Skip to content

Actions: huggingface/trl

Actions

Hugging Face Issue Labeler

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
725 workflow runs
725 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

PPO grad_norm is 0.0
Hugging Face Issue Labeler #651: Issue #3961 opened by faker52
27s
sft_video_llm example fail
Hugging Face Issue Labeler #650: Issue #3958 opened by yao-matrix
22s
sft_gemma3 example doesn't work
Hugging Face Issue Labeler #649: Issue #3957 opened by yao-matrix
27s
Why not use AutoModel for ref_model in grpo trainer?
Hugging Face Issue Labeler #648: Issue #3948 opened by csshihao
30s
Can we avoid saving the optimization stage?
Hugging Face Issue Labeler #646: Issue #3944 opened by HelloWorldLTY
22s
Add support for RLPR
Hugging Face Issue Labeler #642: Issue #3928 opened by mitchelldehaven
23s
How to use trl-SFTTrainer to train Qwen-30B-A3B?
Hugging Face Issue Labeler #636: Issue #3918 opened by JeffWb
23s
[GRPO Trainer] Accuracy reward stays 0
Hugging Face Issue Labeler #633: Issue #3903 opened by Revist
25s
sft_gemma3 example fail
Hugging Face Issue Labeler #632: Issue #3901 opened by yao-matrix
30s
Bug in BFD packing
Hugging Face Issue Labeler #628: Issue #3887 opened by RicardoDominguez
29s