This repository was archived by the owner on Jan 5, 2023. It is now read-only.
v1.3.0
- Added
Multi30kRawDatasetfor training end-to-end systems from raw images as input. - Added
NumpyDatasetto read.npy/.npztensor files as input features. - You can now pass
-Stonmtpy trainto produce shorter experiment files with not all the hyperparameters in file name. - New post-processing filter option
de-spmfor Google SentencePiece (SPM) processed files. sacrebleuis now a dependency as it is now accepted as an early-stopping metric.
It only makes sense to use it with SPM processed files since they are detokenized
once post-processed.- Added
sklearnas a dependency for some metrics. - Added
momentumandnesterovparameters to[train]section for SGD. ImageEncoderlayer is improved in many ways. Please see the code for further details.- Added unmerged upstream PR for
ModuleDict()support. METEORwill now fallback to English if language can not be detected from file suffixes.-fnow produces a separate numpy file for token frequencies when building vocabulary files withnmtpy-build-vocab.- Added new command
nmtpy testfor non beam-search inference modes. - Removed
nmtpy resumecommand and addedpretrained_fileoption for[train]to initialize model weights from a checkpoint. - Added
freeze_layersoption for[train]to give comma-separated list of layer name prefixes to freeze. - Improved seeding: seed is now printed in order to reproduce the results.
- Added IPython notebook for attention visualization.
- Layers
- New shallow
SimpleGRUDecoderlayer. TextEncoder: Ability to setmaxnormandgradscaleof embeddings and work with or without sorted-length batches.ConditionalDecoder: Make it work with GRU/LSTM, allow settingmaxnorm/gradscalefor embeddings.ConditionalMMDecoder: Same as above.
- New shallow
- nmtpy translate
--avoid-doubleand--avoid-unkremoved for now.- Added Google's length penalty normalization switch
--lp-alpha. - Added ensembling which is enabled automatically if you give more than 1 model checkpoints.
- New machine learning metric wrappers in
utils/ml_metrics.py:- Label-ranking average precision
lrap - Coverage error
- Mean reciprocal rank
- Label-ranking average precision