I'm trying to build a phrase table using the following command:
docker run --rm -v $PWD/mycorpus:/root/corpus build-pt corpus src_short tgt_short 3 > phrase-table.gz
I'm running into errors: IOError: [Errno 2] No such file or directory: 'corpus.src_short'. I'm sure there's a silly mistake that i'm making, but i don't know how to fix it even after hours fiddling around with paths and making sure that my corpus files are there in the docker volume.
I have two files under the mycorpus directory:
- corp.src_short
- corp.tgt_short
So can anyone provide a working example (paths and file names included) of how to correctly generate a phrase table ?