Skip to content

Conversation

@rasdani
Copy link

@rasdani rasdani commented Apr 16, 2025

adds verifier for reasoning-gym's procedurally generated reasoning datasets.
see their GALLERY.md for a run down of the different tasks.

tested with https://huggingface.co/datasets/rasdani/reasoning-gym-dataset-debug-pi-format

can be made a bit tidier, when open-thought/reasoning-gym#422 gets merged.

@rasdani
Copy link
Author

rasdani commented Apr 22, 2025

can be merged as is.

I'm waiting for first fine-tune / ablation results in their discord channel.
We would have an idea then what tasks and difficulty to put in the final dataset.

@rasdani
Copy link
Author

rasdani commented Jun 2, 2025

Nvidia's and RG's paper with infos on e.g. dataset composition.
https://arxiv.org/abs/2505.24864
https://arxiv.org/abs/2505.24760

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant