v0.23.1

Latest

Latest

qgallouedec released this 02 Oct 05:20

· 136 commits to main since this release

4529a1c

What's Changed

♨️ [GRPO] Fix potential hang in get_high_entropy_mask by @akakakakakaa in #4041
Aux loss is already included in the loss returned by Transformers by @pramodith in #4078
Fix get_peft_model() so that prepare_model_for_kbit_training does not reapply to an instance of PeftModel, thus freezing all the layers by @Hoesu in #4081
🐯 fix: use_liger_kernel with IterableDataset by @jue-jue-zi in #4087
[SFTrainer]: Fix DFT Loss by @pramodith in #4112
⚡ Fix Flash Attention x Padding-Free loss by @qgallouedec in #4170

New Contributors

@Hoesu made their first contribution in #4081

Full Changelog: v0.23.0...v0.23.1

Contributors

akakakakakaa, pramodith, and 3 other contributors

Assets 2