feat: update knowledge distillation tutorial for using vllm with Qwen model #2960

RexBearIU · 2026-01-16T10:50:01Z

Description

This pull request significantly updates and modernizes the knowledge distillation tutorial for MaxText, aligning it with current best practices and tooling. The guide now uses Qwen3-32B as the teacher model (via vLLM) and Llama-3.1-8B as the student, streamlines the setup with Hyperdisk storage, and provides new scripts and commands for dataset generation and fine-tuning. The instructions have been clarified, unnecessary conversion steps removed for the teacher, and the fine-tuning process updated for the latest MaxText and vLLM workflows.

Tests

Manually triggered the distillation pipeline and monitored the execution flow step-by-step. Confirmed that the training loop finished and resources were released.

Checklist

Before submitting this PR, please make sure (put X in square brackets):

I have performed a self-review of my code. For an optional AI review, add the gemini-review label.
I have necessary comments in my code, particularly in hard-to-understand areas.
I have run end-to-end tests tests and provided workload links above if applicable.
I have made or will make corresponding changes to the doc if needed, including adding new documentation pages to the relevant Table of Contents (toctree directive) as explained in our documentation.

… model

feat: update knowledge distillation tutorial for using vllm with Qwen…

2fb1059

… model

RexBearIU force-pushed the jackyf/docs/distillation branch from 14091ae to 2fb1059 Compare January 19, 2026 14:24

RexBearIU marked this pull request as ready for review January 19, 2026 14:29

RexBearIU requested review from A9isha, RissyRan, bvandermoon, gagika, gobbleturk, jacoguzo, jiangjy1982, richjames0, shralex and vipannalla as code owners January 19, 2026 14:29

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: update knowledge distillation tutorial for using vllm with Qwen model #2960

feat: update knowledge distillation tutorial for using vllm with Qwen model #2960

Uh oh!

RexBearIU commented Jan 16, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

feat: update knowledge distillation tutorial for using vllm with Qwen model #2960

Are you sure you want to change the base?

feat: update knowledge distillation tutorial for using vllm with Qwen model #2960

Uh oh!

Conversation

RexBearIU commented Jan 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Tests

Checklist

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

RexBearIU commented Jan 16, 2026 •

edited

Loading