Actions: huggingface/trl
Actions
725 workflow runs
725 workflow runs
GRPOTrainer
with top_entropy_quntile < 1
causes hang with multi gpu training
Hugging Face Issue Labeler
#643:
Issue #3933
opened
by
avishaiElmakies
apply_chat_template
behaviour for multimodal dataset between SFT and GRPO
Hugging Face Issue Labeler
#635:
Issue #3915
opened
by
ishaan-rawal-ai