Release v0.3.0: Initial release · hiyouga/EasyR1

What's Changed

update readme by @hiyouga in #4
[readme] update readme by @hiyouga in #5
[worker] fix small models by @hiyouga in #14
feat: swanlab examples by @Zeyi-Lin in #13
[example] add ReMax support by @Shenzhi-Wang in #20
fix:vllm length by @AL-377 in #18
fix: math reward fn by @yueyang130 in #26
[readme] update readme by @hiyouga in #29
Fix template issue by @wzq016 in #31
[example] fix length by @hiyouga in #32
[readme] update hardware requirement by @hiyouga in #33
[worker] fix model attn init by @hiyouga in #37
Witness the Aha Moment on Counting Task by @BUAADreamer in #38
[example] fix clevr example by @hiyouga in #47
Fix: save processor for VLMs by @wzq016 in #48
[perf] support padding-free training for VLMs by @hiyouga in #61
[readme] update readme by @hiyouga in #62
[readme] add fig explain by @hiyouga in #64
[readme] update fig by @hiyouga in #65
[trainer] support resume ckpt by @hiyouga in #66
[config] update default config by @hiyouga in #68
[readme] update wechat by @hiyouga in #71
[env] fix memory leak & enable vLLM v1 by @hiyouga in #73
[readme] update readme by @hiyouga in #75
[readme] update readme by @hiyouga in #80
Add new baseline GeoQA8k from R1V by @chenllliang in #86
[feat] support freeze vision tower by @hiyouga in #99
[config] increase prompt length by @hiyouga in #100
update readme - add ## Awesome Work using EasyR1 by @LengSicong in #101
Add the work Vision-R1 that uses EasyR1 by @Osilly in #102
fix:OOM by @dirtyDan0 in #111
[trainer] verify arg by @hiyouga in #112
[misc] sync feat from upstream by @hiyouga in #113
[misc] clean some code by @hiyouga in #114
[example] add examples by @hiyouga in #118
[checkpoint] fix load checkpoint by @hiyouga in #119
[trainer] gather metrics by @hiyouga in #120
[misc] add doc string by @hiyouga in #121
Add seg zero to README by @LiuRicky in #122
Update README.md by @PzySeere in #124
fix readme by @hiyouga in #127
[core] remove entropy loss by @hiyouga in #132
[trainer] support val sampling by @hiyouga in #133
misc: save at the last step by @dirtyDan0 in #138
feat: swanlab add easyr1 and verl config by @Zeyi-Lin in #140
[version] upgrade vllm to 0.8 by @hiyouga in #143
[readme] update docker file by @hiyouga in #146
[readme] update wechat by @hiyouga in #147
[readme] update dockerfile by @hiyouga in #148
Update requirements.txt for multinode by @chenllliang in #154
[trainer] support channel-wise reward by @hiyouga in #155
Update README.md by @PzySeere in #157
[trainer] support save limit & fix oom issue by @hiyouga in #158
[misc] update docker files by @hiyouga in #162
[trainer] support 32b by @hiyouga in #164
[data] use hf-native template by @hiyouga in #165
[misc] fix dataset by @hiyouga in #166
[readme] update tutorial by @hiyouga in #167
[tracking] add tensorboard by @hiyouga in #170
[misc] support adamw bf16 by @hiyouga in #171
[misc] fix config by @hiyouga in #172
[misc] fix metrics by @hiyouga in #173
[misc] refactor val gen log by @hiyouga in #174
update Awesome Work using EasyR1 by @appletea233 in #179
[misc] fix masked mean by @hiyouga in #181
[misc] algo improvement by @hiyouga in #184
[misc] minor update by @hiyouga in #188
[fix] arg check by @hiyouga in #189
[bugfix] fix vllm 0.8.3 rollout by @hiyouga in #197
[deps] upgrade to vllm 0.8.3 by @hiyouga in #202
[core] separate score fn & vllm logit bias by @hiyouga in #204
Supports loading format prompt from a file by @Wangbiao2 in #208
[data] update data configs by @hiyouga in #214
fix: enable user to filter overlong examples in RLHFDataset by @0x404 in #210
[data] fix rl dataset by @hiyouga in #215
[misc] lint by @hiyouga in #216
[data] add multi image dataset by @hiyouga in #217
[readme] add multi node script by @hiyouga in #218
[torch] fix saving bf16 optimizer by @hiyouga in #221
[version] release 0.3.0 by @hiyouga in #222

New Contributors

@Zeyi-Lin made their first contribution in #13
@AL-377 made their first contribution in #18
@yueyang130 made their first contribution in #26
@wzq016 made their first contribution in #31
@BUAADreamer made their first contribution in #38
@chenllliang made their first contribution in #86
@LengSicong made their first contribution in #101
@Osilly made their first contribution in #102
@dirtyDan0 made their first contribution in #111
@LiuRicky made their first contribution in #122
@PzySeere made their first contribution in #124
@appletea233 made their first contribution in #179
@Wangbiao2 made their first contribution in #208
@0x404 made their first contribution in #210

Full Changelog: https://github.com/hiyouga/EasyR1/commits/v0.3.0

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

v0.3.0: Initial release

Choose a tag to compare

Sorry, something went wrong.

Sorry, something went wrong.

Uh oh!

No results found

What's Changed

New Contributors

Contributors

Uh oh!