Skip to content

environments acft hf nlp grpo

github-actions[bot] edited this page Jul 11, 2025 · 1 revision

acft-hf-nlp-grpo

Overview

Environment used by Group Relative Policy Optimization component

Version: 1

Tags

Preview

View in Studio: https://ml.azure.com/registries/azureml/environments/acft-hf-nlp-grpo/version/1

Docker image: mcr.microsoft.com/azureml/curated/acft-hf-nlp-grpo:1

Docker build context

Dockerfile

#PTCA image
FROM mcr.microsoft.com/aifx/acpt/stable-ubuntu2204-cu126-py310-torch271:biweekly.202506.3

USER root

RUN pip install --upgrade pip

COPY requirements.txt .
RUN pip install -r requirements.txt --no-cache-dir

RUN pip install azureml-acft-common-components==0.0.76
RUN pip install numpy==2.2.5
RUN pip install azureml-evaluate-mlflow==0.0.76
RUN pip install mlflow==3.1.0
RUN pip install transformers==4.52.1

# clean conda and pip caches
RUN rm -rf ~/.cache/pip
Clone this wiki locally