LAB 334: Evaluate and improve the quality and safety of your AI applications

Important

This lab is designed for use in both instructor-led (75-mins, in-venue) and self-guided (self-paced, at home) sessions.

Instructor-led session attendees. Listen to the in-venue instructions and follow directions in the Skillable VM panel.
Self-guided session attendees. You will need to take a few extra steps for setup. Start with the Self-Guided: Quickstart below.

Workshop Description

You’ve built a custom AI application grounded in your enterprise data. How do you ensure response quality and safety? Join us as we explore AI-assisted evaluation workflows with built-in and custom evaluators on Azure AI. Learn what each metric represents, then understand how to analyze the scores for your specific application. Learn why observability is key, and how the generated telemetry can be used both locally, and in the cloud, to help you assess and debug your application performance.

Level: Intermediate/Advanced
Duration: 75 minutes

Learning Objectives

By the end of this workshop you should be able to:

Explain key evaluation metrics and workflows for GenAIOps
Create and run code-first evaluations with your Azure AI project
Understand and use built-in evaluators for quality and safety
Understand and use custom evaluators for your application needs
Understand and use simulators to create datasets for evaluations
Run manual evaluations and view results in the Azure AI Foundry portal

Pre-Requisites

To complete this lab you need:

A personal GitHub account → create one for free if needed
An Azure subscription → with quota for the required models
Familiarity with Python → and usage of Jupyter notebooks
Familiarity with Generative AI → basic tools and concepts

An Azure subscription pre-provisioned with the required models and application infrastructure will be made available to Microsoft Build Lab 334 attendees in-venue, for the duration of the session.

Self-Guided: Quickstart

To get started on this lab at home, follow these steps:

Fork this repo to your personal profile
Launch GitHub Codespaces - wait till ready
Open the VS Code terminal in codespaces
Run mkdocs serve > /dev/null 2>&1 &
Select the browser option in the pop-up dialog

You will see a preview of the instruction guide in a new tab. Click the "Workshop" menu item - and get started.

Questions & Feedback

We welcome feedback to help us improve the learning experience.

File an issue. We welcome feedback on ways to improve the workshop for future learners.
Join the Azure AI Foundry Discord. Meet Azure AI community members and share insights.
Visit the Azure AI Foundry Developer Forum. Get the latest updates on Azure AI Foundry.

Session Resources

The material in this repo was presented at Microsoft Build 2025 under the session title "Evaluate Reasoning Models For Your Generative AI Solutions". Please access these links for the session materials, recording, and Learn resources.

Resources	Links	Description
Build session page	https://build.microsoft.com/sessions/LAB334	Event session page with downloadable recording, slides, resources, and speaker bio
Microsoft Learn	Develop, replatform, and improve AI apps via advanced Azure AI services	Official Collection or Plan with skilling resources to learn at your own pace

Name		Name	Last commit message	Last commit date
Latest commit History 43 Commits
.devcontainer		.devcontainer
.vscode		.vscode
data		data
docs		docs
img		img
labs		labs
scripts		scripts
.env.sample		.env.sample
.gitignore		.gitignore
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
LICENSE		LICENSE
README.md		README.md
SECURITY.md		SECURITY.md
SUPPORT.md		SUPPORT.md
mkdocs.yml		mkdocs.yml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

LAB 334: Evaluate and improve the quality and safety of your AI applications

Workshop Description

Learning Objectives

Pre-Requisites

Self-Guided: Quickstart

Questions & Feedback

Session Resources

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 5

Uh oh!

Languages

License

microsoft/BUILD25-LAB334

Folders and files

Latest commit

History

Repository files navigation

LAB 334: Evaluate and improve the quality and safety of your AI applications

Workshop Description

Learning Objectives

Pre-Requisites

Self-Guided: Quickstart

Questions & Feedback

Session Resources

About

Topics

Resources

License

Code of conduct

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 5

Uh oh!

Languages

Packages