Skip to content

Actions: huggingface/trl

Actions

Hugging Face Issue Labeler

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
725 workflow runs
725 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

truncate mode for response
Hugging Face Issue Labeler #625: Issue #3878 opened by shiwanghua
23s
Dynamic Fine Tuning, an improvement of SFT
Hugging Face Issue Labeler #624: Issue #3877 opened by 1485840691
27s
Add reward functions to support RLCR
Hugging Face Issue Labeler #622: Issue #3871 opened by pramodith
32s
'LLMEngine' object has no attribute 'model_executor'
Hugging Face Issue Labeler #617: Issue #3859 opened by EvilCalf
40s
vllm prepends two BOS for LLama
Hugging Face Issue Labeler #614: Issue #3853 opened by wenquanlu
21s
Issues at GRPO with VLM
Hugging Face Issue Labeler #613: Issue #3847 opened by Fhrozen
27s
Ideas to Improve GRPO Training Speed
Hugging Face Issue Labeler #612: Issue #3846 opened by jp1924
26s
accelerate reducing the batch size and crashing GRPO
Hugging Face Issue Labeler #611: Issue #3842 opened by limlimg
31s
Wrong default clipping params for GSPO
Hugging Face Issue Labeler #609: Issue #3834 opened by pramodith
25s
TRL doesn't support gemma-3
Hugging Face Issue Labeler #607: Issue #3828 opened by awestover
41s
DataCollatorForCompletionOnlyLM
Hugging Face Issue Labeler #606: Issue #3827 opened by tejassaboo
21s
RLOOTrainer tldr experiments not reproducible
Hugging Face Issue Labeler #605: Issue #3825 opened by jltchiu
31s
Processing class does not have EOS token
Hugging Face Issue Labeler #603: Issue #3822 opened by debasisdwivedy
31s