This repository was archived by the owner on Jan 5, 2023. It is now read-only.

Release v1.2.0

ozancaglayan released this 20 Feb 14:10

· 651 commits to master since this release

4f56c04

Release Notes

You can now use $HOME and $USER in your configuration files.
Fixed an overflow error that would cause NMT with more than 255 tokens to fail.
METEOR worker process is now correctly killed after validations.
Many runs of an experiment are now suffixed with a unique random string instead of incremental integers to avoid race conditions in cluster setups.
Replaced utils.nn.get_network_topology() with a new Topology class that will parse the direction string of the model in a more smart way.
If CUDA_VISIBLE_DEVICES is set, the GPUManager will always honor it.
Dropped creation of temporary/advisory lock files under /tmp for GPU reservation.
Time measurements during training are now structered into batch overhead, training and evaluation timings.
Datasets
- Added TextDataset for standalone text file reading.
- Added OneHotDataset, a variant of TextDataset where the sequences are not prefixed/suffixed with <bos> and <eos> respectively.
- Added experimental MultiParallelDataset that merges an arbitrary number of parallel datasets together.
nmtpy translate
- .nodbl and .nounk suffixes are now added to output files for --avoid-double and --avoid-unk arguments respectively.
- A model-agnostic enough beam_search() is now separated out into its own file nmtpytorch/search.py.
- max_len default is increased to 200.

Assets 2