Skip to content

[Algorithm] GRPO scripts #2970

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 106 commits into
base: gh/vmoens/142/base
Choose a base branch
from
Open

[Algorithm] GRPO scripts #2970

wants to merge 106 commits into from

Conversation

vmoens
Copy link
Collaborator

@vmoens vmoens commented May 22, 2025

Stack from ghstack (oldest at bottom):

[ghstack-poisoned]
vmoens pushed a commit that referenced this pull request May 22, 2025
ghstack-source-id: 22a66ef
Pull-Request-resolved: #2970
Copy link

pytorch-bot bot commented May 22, 2025

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/rl/2970

Note: Links to docs will display an error until the docs builds have been completed.

❌ 11 New Failures, 1 Pending, 4 Unrelated Failures

As of commit fa89fc0 with merge base 023c965 (image):

NEW FAILURES - The following jobs have failed:

BROKEN TRUNK - The following jobs failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

This comment was automatically generated by Dr. CI and updates every 15 minutes.

@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label May 22, 2025
[ghstack-poisoned]
vmoens pushed a commit that referenced this pull request May 22, 2025
ghstack-source-id: c1fad8d
Pull-Request-resolved: #2970
[ghstack-poisoned]
vmoens pushed a commit that referenced this pull request May 22, 2025
ghstack-source-id: 09cda67
Pull-Request-resolved: #2970
@vmoens vmoens added the new algo New algorithm request or PR label May 22, 2025
[ghstack-poisoned]
vmoens pushed a commit that referenced this pull request May 22, 2025
ghstack-source-id: a711348
Pull-Request-resolved: #2970
[ghstack-poisoned]
vmoens pushed a commit that referenced this pull request May 22, 2025
ghstack-source-id: 160d734
Pull-Request-resolved: #2970
[ghstack-poisoned]
vmoens pushed a commit that referenced this pull request May 22, 2025
ghstack-source-id: b20e61e
Pull-Request-resolved: #2970
[ghstack-poisoned]
vmoens pushed a commit that referenced this pull request May 22, 2025
ghstack-source-id: 879f74a
Pull-Request-resolved: #2970
[ghstack-poisoned]
vmoens pushed a commit that referenced this pull request May 22, 2025
ghstack-source-id: 0741c8f
Pull-Request-resolved: #2970
[ghstack-poisoned]
vmoens pushed a commit that referenced this pull request May 22, 2025
ghstack-source-id: 6a8fa1e
Pull-Request-resolved: #2970
[ghstack-poisoned]
vmoens pushed a commit that referenced this pull request May 22, 2025
ghstack-source-id: 6768b25
Pull-Request-resolved: #2970
[ghstack-poisoned]
vmoens pushed a commit that referenced this pull request May 23, 2025
ghstack-source-id: 9308ae6
Pull-Request-resolved: #2970
[ghstack-poisoned]
vmoens pushed a commit that referenced this pull request May 23, 2025
ghstack-source-id: b3c20dd
Pull-Request-resolved: #2970
[ghstack-poisoned]
vmoens pushed a commit that referenced this pull request May 23, 2025
ghstack-source-id: 5bd176f
Pull-Request-resolved: #2970
[ghstack-poisoned]
[ghstack-poisoned]
vmoens pushed a commit that referenced this pull request Jun 7, 2025
ghstack-source-id: f4f38b5
Pull-Request-resolved: #2970
[ghstack-poisoned]
vmoens pushed a commit that referenced this pull request Jun 7, 2025
ghstack-source-id: c27fc2e
Pull-Request-resolved: #2970
[ghstack-poisoned]
vmoens pushed a commit that referenced this pull request Jun 7, 2025
ghstack-source-id: ac6a656
Pull-Request-resolved: #2970
[ghstack-poisoned]
vmoens pushed a commit that referenced this pull request Jun 7, 2025
ghstack-source-id: 431313c
Pull-Request-resolved: #2970
[ghstack-poisoned]
vmoens pushed a commit that referenced this pull request Jun 7, 2025
ghstack-source-id: caceecd
Pull-Request-resolved: #2970
[ghstack-poisoned]
vmoens pushed a commit that referenced this pull request Jun 7, 2025
ghstack-source-id: 1e33fa6
Pull-Request-resolved: #2970
[ghstack-poisoned]
vmoens pushed a commit that referenced this pull request Jun 7, 2025
ghstack-source-id: 05b32bf
Pull-Request-resolved: #2970
[ghstack-poisoned]
vmoens pushed a commit that referenced this pull request Jun 7, 2025
ghstack-source-id: 6654235
Pull-Request-resolved: #2970
[ghstack-poisoned]
vmoens pushed a commit that referenced this pull request Jun 7, 2025
ghstack-source-id: df09d5f
Pull-Request-resolved: #2970
[ghstack-poisoned]
vmoens pushed a commit that referenced this pull request Jun 7, 2025
ghstack-source-id: c47691d
Pull-Request-resolved: #2970
[ghstack-poisoned]
vmoens pushed a commit that referenced this pull request Jun 7, 2025
ghstack-source-id: 37e42fa
Pull-Request-resolved: #2970
[ghstack-poisoned]
vmoens pushed a commit that referenced this pull request Jun 7, 2025
ghstack-source-id: ef81464
Pull-Request-resolved: #2970
[ghstack-poisoned]
vmoens pushed a commit that referenced this pull request Jun 7, 2025
ghstack-source-id: 03b33ea
Pull-Request-resolved: #2970
[ghstack-poisoned]
vmoens pushed a commit that referenced this pull request Jun 7, 2025
ghstack-source-id: 4b80be9
Pull-Request-resolved: #2970
[ghstack-poisoned]
vmoens pushed a commit that referenced this pull request Jun 7, 2025
ghstack-source-id: a5aa8e1
Pull-Request-resolved: #2970
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. new algo New algorithm request or PR
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants