Skip to content

Commit 856f430

Browse files
authored
Update README.md
1 parent 967b19d commit 856f430

File tree

1 file changed

+2
-2
lines changed

1 file changed

+2
-2
lines changed

examples/asr_librispeech/README.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -79,7 +79,7 @@ If you're interested in training with DeepSpeed, refer to the script `finetune_w
7979
}
8080
```
8181

82-
Note that when using `zero-0`/`1`/`2`/`3`, the DeepSpeed model is saved as `pytorch_model.bin`
82+
Note that when using `zero-0`/`1`/`2`/`3`, the DeepSpeed model is saved as `pytorch_model.bin`, and you should change "++ckpt_path=$ckpt_path/model.pt" to " ++ckpt_path=$ckpt_path/pytorch_model.bin" in the script to use the model during inference.
8383
If you use bf16/fp16 training in DeepSpeed and encounter NaN in train/eval loss, check the autocast in `src/slam_llm/utils/deepspeed_utils.py`:
8484

8585
```python
@@ -96,4 +96,4 @@ You can refer to the paper for more results.
9696
journal={arXiv preprint arXiv:2402.08846},
9797
year={2024}
9898
}
99-
```
99+
```

0 commit comments

Comments
 (0)