Skip to content

Commit f6b8e08

Browse files
committed
remove unused epsilon-clip
1 parent d775bec commit f6b8e08

File tree

8 files changed

+0
-8
lines changed

8 files changed

+0
-8
lines changed

configs/grpo_che_config.yaml

Lines changed: 0 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -47,7 +47,6 @@ grpo:
4747
discount: 0.9
4848
termination_threshold: -0.1
4949
reward_shift: -2.1
50-
epsilon_clip: null
5150

5251
# wandb
5352
wandb:

configs/grpo_he_config.yaml

Lines changed: 0 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -47,7 +47,6 @@ grpo:
4747
discount: 0.9
4848
termination_threshold: -0.1
4949
reward_shift: -2.1
50-
epsilon_clip: null
5150

5251
# wandb
5352
wandb:

configs/grpo_mbpp_config.yaml

Lines changed: 0 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -47,7 +47,6 @@ grpo:
4747
discount: 0.9
4848
termination_threshold: -0.1
4949
reward_shift: -2.1
50-
epsilon_clip: null
5150

5251
# wandb
5352
wandb:

configs/magrpo_che_config.yaml

Lines changed: 0 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -48,7 +48,6 @@ magrpo:
4848
discount: 0.9
4949
termination_threshold: -0.2
5050
reward_shift: -4
51-
epsilon_clip: null
5251

5352
# wandb
5453
wandb:

configs/magrpo_he_config.yaml

Lines changed: 0 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -46,7 +46,6 @@ magrpo:
4646
discount: 0.9
4747
termination_threshold: -0.2
4848
reward_shift: -4
49-
epsilon_clip: null
5049

5150
# wandb
5251
wandb:

configs/magrpo_mbpp_config.yaml

Lines changed: 0 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -48,7 +48,6 @@ magrpo:
4848
discount: 0.9
4949
termination_threshold: -0.2
5050
reward_shift: -4
51-
epsilon_clip: null
5251

5352
# wandb
5453
wandb:

train_grpo.py

Lines changed: 0 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -366,7 +366,6 @@ def _resolver(prompt: str):
366366
discount=grpo_config.get("discount", 0.9),
367367
joint_mode=grpo_config.get("joint_mode", "aligned"),
368368
termination_threshold=grpo_config.get("termination_threshold", None),
369-
epsilon_clip=grpo_config.get("epsilon_clip", None),
370369
)
371370

372371
# ------------------------------------------------------------------

train_magrpo.py

Lines changed: 0 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -419,7 +419,6 @@ def _resolver(prompt: str):
419419
discount=magrpo_config.get("discount", 0.9),
420420
joint_mode=magrpo_config.get("joint_mode", "aligned"),
421421
termination_threshold=magrpo_config.get("termination_threshold", None),
422-
epsilon_clip=magrpo_config.get("epsilon_clip", None),
423422
)
424423

425424
# ------------------------------------------------------------------

0 commit comments

Comments
 (0)