Refactor script to use 'overwrites' variable for command-line arguments in training scripts #1473
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
The goal of this PR is to add support for command line arguments to the bash training scripts. The
run_train.sh
had support foroverrides
, however, themultinode_trainer.slurm
script did not. Thisoverrides
flag add supports for commands like:sbatch multinode_trainer.slurm --job.description="TEST_RUN"
However, there is a problem with the current
overrides
implementation, when passing arguments with space such as"TEST RUN"
instead of"TEST_RUN"
then the variablejob.description
would only getTEST
as input and the training script throws an error for unrecognizing the argumentRUN
which is passed in a different line. To address this I simplify the code and directly pass the additional overrides through$@
. This solves the issue for commands such as:sbatch multinode_trainer.slurm --job.description="TEST RUN"