Skip to content

Commit d4ec7c8

Browse files
sanandaraj5597Selvaraj Anandaraj
andauthored
Added gc_interval for manual garbage collection (#278)
Signed-off-by: Selvaraj Anandaraj <[email protected]> Co-authored-by: Selvaraj Anandaraj <[email protected]>
1 parent 8d73d6c commit d4ec7c8

File tree

2 files changed

+2
-1
lines changed

2 files changed

+2
-1
lines changed

launcher_scripts/conf/training/gpt3/175b.yaml

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -156,6 +156,7 @@ model:
156156
use_cpu_initialization: False # Init weights on the CPU (slow for large models)
157157
onnx_safe: False # Use work-arounds for known problems with Torch ONNX exporter.
158158
apex_transformer_log_level: 30 # Python logging level displays logs with severity greater than or equal to this
159+
gc_interval: 10 #Manual garbage collection
159160

160161
# Nsys profiling options
161162
nsys_profile:

launcher_scripts/conf/training/llama/llama2_70b.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -139,7 +139,7 @@ model:
139139
use_flash_attention: true
140140
overlap_p2p_comm: true
141141
batch_p2p_comm: false
142-
gc_interval: 100
142+
gc_interval: 10
143143
optim:
144144
name: distributed_fused_adam
145145
lr: 0.00015

0 commit comments

Comments
 (0)