Skip to content

Conversation

@yeshsurya
Copy link
Contributor

This pull request introduces significant enhancements to the reward scoring and grading functionality within the ACPT-RFT environment. The main updates include the addition of new modular grader scripts for Azure-based and Python-based grading, an extensible reward scoring entry point, and several dependency updates and Dockerfile improvements to support these features.

Reward scoring and grading enhancements:

  • Added a new default_compute_score function in __init__.py to serve as a unified entry point for reward scoring across multiple data sources, supporting both legacy and new grader modules.
  • Introduced azure_grader.py, a flexible grader supporting both string matching and text similarity metrics (BLEU, ROUGE, METEOR, etc.), with fallback logic if the HuggingFace evaluate library is unavailable.
  • Added azure_python_grader.py, a customizable Python code grader that validates syntax using AST parsing, intended for user extension and custom grading logic.

Dockerfile and dependency updates:

  • Upgraded verl from version 0.6.0 to 0.6.1, and updated vllm from 0.11.1 to 0.13.0; added new dependencies openai and DeepGEMM, and copied new grader scripts into the appropriate locations in the Docker image. [1] [2]
  • Ensured new grader modules (azure_grader.py, azure_python_grader.py, and updated __init__.py) are available in the verl.utils.reward_score package by copying them in the Docker build.

These changes collectively provide a more modular, extensible, and robust reward scoring framework, enabling easier customization and improved evaluation capabilities for different data sources and grading requirements.

Test links

@yeshsurya yeshsurya requested review from a team as code owners December 30, 2025 08:11
@github-actions
Copy link

github-actions bot commented Dec 30, 2025

Test Results for assets-test

0 tests   0 ✅  0s ⏱️
0 suites  0 💤
0 files    0 ❌

Results for commit 3388c1e.

♻️ This comment has been updated with latest results.

@yeshsurya yeshsurya merged commit 676179c into main Dec 31, 2025
36 checks passed
@yeshsurya yeshsurya deleted the yeshwanth/rft_env_update_for_grader_extension branch December 31, 2025 04:47
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants