Skip to content

Pull requests: huggingface/trl

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Add config_init_kwargs option in GRPOConfig
#4069 opened Sep 12, 2025 by hokuyama0106 Loading…
2 of 5 tasks
Add VLM support to RLOO trainer
#4067 opened Sep 11, 2025 by behroozazarkhalili Loading…
feat: Add NPU and XPU support for activation offloading
#4056 opened Sep 10, 2025 by zilongzheng Loading…
2 of 5 tasks
Enable XPU for vllm client
#4031 opened Sep 8, 2025 by jiqing-feng Loading…
vllm sleep mode support
#4028 opened Sep 8, 2025 by ved1beta Loading…
2 of 5 tasks
Fix: undefined current_gradient_accumulation_steps
#4014 opened Sep 5, 2025 by ysjprojects Loading…
2 of 5 tasks
Improve typing of SFT trainer
#4007 opened Sep 4, 2025 by cyyever Loading…
[GFPO]: implement GFPO in GRPOTrainer
#3989 opened Sep 1, 2025 by Peter-Chou Loading…
3 of 5 tasks
fix bug when using dataset streaming by accelerate
#3950 opened Aug 25, 2025 by kaixuanliu Loading…
🐳 Docker update
#3931 opened Aug 20, 2025 by qgallouedec Loading…
[SFTTrainer]: Check for assistant mask up to max_length
#3930 opened Aug 20, 2025 by pramodith Loading…
3 of 5 tasks
[DRAFT] Refactor DPO
#3906 opened Aug 15, 2025 by qgallouedec Draft
5 tasks
Test in distributed setting
#3902 opened Aug 15, 2025 by qgallouedec Loading…
5 tasks
ProTip! Filter pull requests by the default branch with base:main.