This repository was archived by the owner on Jan 5, 2023. It is now read-only.
Release v1.2.0
Release Notes
- You can now use
$HOMEand$USERin your configuration files. - Fixed an overflow error that would cause NMT with more than 255 tokens to fail.
- METEOR worker process is now correctly killed after validations.
- Many runs of an experiment are now suffixed with a unique random string instead of incremental integers to avoid race conditions in cluster setups.
- Replaced
utils.nn.get_network_topology()with a newTopologyclass that will parse thedirectionstring of the model in a more smart way. - If
CUDA_VISIBLE_DEVICESis set, theGPUManagerwill always honor it. - Dropped creation of temporary/advisory lock files under
/tmpfor GPU reservation. - Time measurements during training are now structered into batch overhead, training and evaluation timings.
- Datasets
- Added
TextDatasetfor standalone text file reading. - Added
OneHotDataset, a variant ofTextDatasetwhere the sequences are not prefixed/suffixed with<bos>and<eos>respectively. - Added experimental
MultiParallelDatasetthat merges an arbitrary number of parallel datasets together.
- Added
- nmtpy translate
.nodbland.nounksuffixes are now added to output files for--avoid-doubleand--avoid-unkarguments respectively.- A model-agnostic enough
beam_search()is now separated out into its own filenmtpytorch/search.py. max_lendefault is increased to 200.