-
Notifications
You must be signed in to change notification settings - Fork 3.4k
NVIDIA Megatron-LM Discussions
Sort by:
Latest activity
Categories, most helpful, and community links
Categories
Community links
Discussions
-
You must be logged in to vote 🙏 [QUESTION] How to release the model and optimizer memory manually?
staleNo activity in 60 days on issue or PR -
You must be logged in to vote 🙏 [QUESTION] How to set
stale--rotary-seq-len-interpolation-factorfor rope scaling?No activity in 60 days on issue or PR -
You must be logged in to vote 🙏 [QUESTION] How to re-initialize process group after destroy_process_group() ?
staleNo activity in 60 days on issue or PR -
You must be logged in to vote 🙏 How to split the dataset when running pretrain_bert.py
staleNo activity in 60 days on issue or PR -
You must be logged in to vote 🙏 [QUESTION] Why write a special LinearWithFrozenWeight?
staleNo activity in 60 days on issue or PR -
You must be logged in to vote 🙏 question about test_global_memory_buffer
staleNo activity in 60 days on issue or PR