generated from fastai/nbdev_template
-
Notifications
You must be signed in to change notification settings - Fork 2.1k
Pull requests: huggingface/trl
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Fix
clone_chat_template
vocab size and support PEFT instruction tuning
#3763
opened Jul 24, 2025 by
qgallouedec
Loading…
Prevent NCCL Device Conflicts Between vLLM Server and Trainers
#3762
opened Jul 23, 2025 by
CarlosArguilar
Loading…
5 tasks done
change doc for
num_iterations
and steps_per_generation
to hopefully make them more clear and differentiate between them more clearly
#3761
opened Jul 23, 2025 by
avishaiElmakies
Loading…
2 of 5 tasks
[GRPO] add support for pixel_attention_mask (smolvlm2) and image_sizes (llavanext)
#3760
opened Jul 23, 2025 by
kashif
Loading…
Dynamic sampling option in GRPO trainer based on DAPO paper
#3758
opened Jul 23, 2025 by
almeidava93
Loading…
2 of 5 tasks
🔔 Add deprecation warnings for
AlignPropTrainer
and DDPOTrainer
#3755
opened Jul 22, 2025 by
qgallouedec
Loading…
5 tasks
Add basic support for FSDP/Lora when using TRL/VLLM
#3735
opened Jul 14, 2025 by
ojh31
Loading…
5 tasks
Add warn0 utility and replace warnings.warn with rank-aware warnings in trainer
#3734
opened Jul 14, 2025 by
yafshar
Loading…
1 of 5 tasks
feat: Initial implementation of RePO trainer and components
#3655
opened Jun 26, 2025 by
celsowm
Loading…
5 tasks
Ensure Chat Template Safe Prompt Truncation
#3646
opened Jun 25, 2025 by
pramodith
Loading…
4 of 5 tasks
🔍 Add guidance on choosing
max_length
value and include visualizati…
#3630
opened Jun 22, 2025 by
qgallouedec
Loading…
5 tasks
ClearML logging of visualization in RewardTrainer evaluation
#3602
opened Jun 16, 2025 by
ioverho
Loading…
2 of 5 tasks
Previous Next
ProTip!
Add no:assignee to see everything that’s not assigned.