generated from fastai/nbdev_template
-
Notifications
You must be signed in to change notification settings - Fork 2k
Pull requests: huggingface/trl
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
fix wrong logit slicing in grpo _get_per_token_logps_and_entropies
#3682
opened Jul 2, 2025 by
ahatamiz
Loading…
2 of 5 tasks
Restore the effect of liger_kernel's monkey_patch on global modules in UT.
#3680
opened Jul 2, 2025 by
YangKai0616
Loading…
fix: support dict access in SFT Trainer
#3677
opened Jul 2, 2025 by
jannisborn
Loading…
4 of 5 tasks
feat: Initial implementation of RePO trainer and components
#3655
opened Jun 26, 2025 by
celsowm
Loading…
5 tasks
Faster
position_ids
computation for FFD packing
#3649
opened Jun 25, 2025 by
mariosasko
Loading…
1 of 5 tasks
Ensure Chat Template Safe Prompt Truncation
#3646
opened Jun 25, 2025 by
pramodith
Loading…
4 of 5 tasks
🔍 Add guidance on choosing
max_length
value and include visualizati…
#3630
opened Jun 22, 2025 by
qgallouedec
Loading…
5 tasks
ClearML logging of visualization in RewardTrainer evaluation
#3602
opened Jun 16, 2025 by
ioverho
Loading…
2 of 5 tasks
🎀 New defaults:
gradient_checkpointing=True
#3510
opened May 29, 2025 by
qgallouedec
Loading…
5 tasks
Add Bidirectional Knowledge Distillation Option to GKDTrainer
#3508
opened May 29, 2025 by
shaischaudhry
Loading…
3 of 5 tasks
Previous Next
ProTip!
no:milestone will show everything without a milestone.