Important
This lab is designed for use in both instructor-led (75-mins, in-venue) and self-guided (self-paced, at home) sessions.
- Instructor-led session attendees. Listen to the in-venue instructions and follow directions in the Skillable VM panel.
- Self-guided session attendees. You will need to take a few extra steps for setup. Start with the Self-Guided: Quickstart below.
You’ve built a custom AI application grounded in your enterprise data. How do you ensure response quality and safety? Join us as we explore AI-assisted evaluation workflows with built-in and custom evaluators on Azure AI. Learn what each metric represents, then understand how to analyze the scores for your specific application. Learn why observability is key, and how the generated telemetry can be used both locally, and in the cloud, to help you assess and debug your application performance.
- Level: Intermediate/Advanced
- Duration: 75 minutes
By the end of this workshop you should be able to:
- Explain key evaluation metrics and workflows for GenAIOps
- Create and run code-first evaluations with your Azure AI project
- Understand and use built-in evaluators for quality and safety
- Understand and use custom evaluators for your application needs
- Understand and use simulators to create datasets for evaluations
- Run manual evaluations and view results in the Azure AI Foundry portal
To complete this lab you need:
- A personal GitHub account → create one for free if needed
- An Azure subscription → with quota for the required models
- Familiarity with Python → and usage of Jupyter notebooks
- Familiarity with Generative AI → basic tools and concepts
An Azure subscription pre-provisioned with the required models and application infrastructure will be made available to Microsoft Build Lab 334 attendees in-venue, for the duration of the session.
To get started on this lab at home, follow these steps:
- Fork this repo to your personal profile
- Launch GitHub Codespaces - wait till ready
- Open the VS Code terminal in codespaces
- Run
mkdocs serve > /dev/null 2>&1 &
- Select the browser option in the pop-up dialog
You will see a preview of the instruction guide in a new tab. Click the "Workshop" menu item - and get started.
We welcome feedback to help us improve the learning experience.
- File an issue. We welcome feedback on ways to improve the workshop for future learners.
- Join the Azure AI Foundry Discord. Meet Azure AI community members and share insights.
- Visit the Azure AI Foundry Developer Forum. Get the latest updates on Azure AI Foundry.
The material in this repo was presented at Microsoft Build 2025 under the session title "Evaluate Reasoning Models For Your Generative AI Solutions". Please access these links for the session materials, recording, and Learn resources.
Resources | Links | Description |
---|---|---|
Build session page | https://build.microsoft.com/sessions/LAB334 | Event session page with downloadable recording, slides, resources, and speaker bio |
Microsoft Learn | Develop, replatform, and improve AI apps via advanced Azure AI services | Official Collection or Plan with skilling resources to learn at your own pace |