Skip to content

Pull requests: huggingface/trl

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

[GRPO] Adds option to disable dropout
#3234 opened Apr 4, 2025 by edbeeching Loading…
overlong filtering
#3229 opened Apr 4, 2025 by shirinyamani Loading…
5 tasks
remove unused optimizer config in prepare_deepspeed
#3228 opened Apr 3, 2025 by liguohao96 Loading…
1 of 5 tasks
Add a raw generate API to the vLLM server
#3227 opened Apr 3, 2025 by wilrop Loading…
5 tasks
Support iterable datasets in GRPO
#3226 opened Apr 3, 2025 by wilrop Loading…
5 tasks
Fix online DPO crash when model is a DataParallel object
#3225 opened Apr 3, 2025 by wilrop Loading…
5 tasks
Simplify logging text
#3219 opened Apr 3, 2025 by qgallouedec Loading…
5 tasks
🌊 Add error for iterable datasets in GRPOTrainer
#3216 opened Apr 2, 2025 by qgallouedec Loading…
5 tasks
update weight update process group
#3211 opened Apr 2, 2025 by ji-huazhong Draft
5 tasks
Adding sampling parameters for vllm generation
#3210 opened Apr 2, 2025 by shaipranesh2 Loading…
GRPO: Scalable training with one LLM/node
#3186 opened Mar 31, 2025 by jglaser Draft
4 tasks
🏃 Faster CI
#3160 opened Mar 25, 2025 by qgallouedec Loading…
5 tasks
Fix: Compatibility for formatting_func returning a list
#3147 opened Mar 24, 2025 by YeFD Loading…
4 of 5 tasks
Fix length bias for Dr GRPO
#3138 opened Mar 23, 2025 by idoru Loading…
5 tasks
Extend BCO Trainer dataset format support
#3134 opened Mar 22, 2025 by reihig-ut Loading…
1 of 5 tasks
feat: Add Interleaved Trainer implementation
#3107 opened Mar 18, 2025 by ucalyptus2 Loading…
3 tasks done
ProTip! no:milestone will show everything without a milestone.