Skip to content

Error in context encoding implementation #3

@artificertxj1

Description

@artificertxj1

I believe there are some major discrepancies between the implementation in this repo and the original paper. You would probably want to check on PixelCNN and gated PixelCNN papers and implement context predictor with masked CNN and gated unit. You don't need to split the encoded feature map into 3x3 pieces and do avg. pooling. A single PixelCNN layer stack will do the job of aggregating features from above. In your implementation, information aggregated is limited by the size of the sliced square pieces (3 x 3). One should try to use long term information to catch a "slow changing" structure (quoting the word from CPC paper).

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions