[feat]: Adding azure NLP graders and python grader #4706

yeshsurya · 2025-12-30T08:11:01Z

This pull request introduces significant enhancements to the reward scoring and grading functionality within the ACPT-RFT environment. The main updates include the addition of new modular grader scripts for Azure-based and Python-based grading, an extensible reward scoring entry point, and several dependency updates and Dockerfile improvements to support these features.

Reward scoring and grading enhancements:

Added a new default_compute_score function in __init__.py to serve as a unified entry point for reward scoring across multiple data sources, supporting both legacy and new grader modules.
Introduced azure_grader.py, a flexible grader supporting both string matching and text similarity metrics (BLEU, ROUGE, METEOR, etc.), with fallback logic if the HuggingFace evaluate library is unavailable.
Added azure_python_grader.py, a customizable Python code grader that validates syntax using AST parsing, intended for user extension and custom grading logic.

Dockerfile and dependency updates:

Upgraded verl from version 0.6.0 to 0.6.1, and updated vllm from 0.11.1 to 0.13.0; added new dependencies openai and DeepGEMM, and copied new grader scripts into the appropriate locations in the Docker image. [1] [2]
Ensured new grader modules (azure_grader.py, azure_python_grader.py, and updated __init__.py) are available in the verl.utils.reward_score package by copying them in the Docker build.

These changes collectively provide a more modular, extensible, and robust reward scoring framework, enabling easier customization and improved evaluation capabilities for different data sources and grading requirements.

Test links

github-actions · 2025-12-30T08:11:43Z

Test Results for assets-test

0 tests 0 ✅ 0s ⏱️
0 suites 0 💤
0 files 0 ❌

Results for commit 3388c1e.

♻️ This comment has been updated with latest results.

yeshsurya requested review from a team as code owners December 30, 2025 08:11

yeshsurya added 2 commits December 30, 2025 13:41

[feat]: Adding azure NLP graders and python grader

69addf2

[grader]: Adding sacrebleu package for one of inbuilt grader

3388c1e

yeshsurya force-pushed the yeshwanth/rft_env_update_for_grader_extension branch from b866113 to 3388c1e Compare December 30, 2025 08:11

yeshsurya temporarily deployed to Testing December 30, 2025 08:11 — with GitHub Actions Inactive

novaturient95 approved these changes Dec 30, 2025

View reviewed changes

iamrk04 approved these changes Dec 30, 2025

View reviewed changes

yeshsurya merged commit 676179c into main Dec 31, 2025
36 checks passed

yeshsurya deleted the yeshwanth/rft_env_update_for_grader_extension branch December 31, 2025 04:47

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[feat]: Adding azure NLP graders and python grader #4706

[feat]: Adding azure NLP graders and python grader #4706

Uh oh!

yeshsurya commented Dec 30, 2025

Uh oh!

github-actions bot commented Dec 30, 2025 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[feat]: Adding azure NLP graders and python grader #4706

[feat]: Adding azure NLP graders and python grader #4706

Uh oh!

Conversation

yeshsurya commented Dec 30, 2025

Uh oh!

github-actions bot commented Dec 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Test Results for assets-test

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

github-actions bot commented Dec 30, 2025 •

edited

Loading