Add Qwen3 from_pretrained, generation example, and DeepseekV3 model support by danikhan632 · Pull Request #92 · google/tunix

danikhan632 · 2025-07-03T15:13:19Z

New from_pretrained() method in tunix.models.qwen3.params:
- Supports loading Qwen3 models directly from the Hugging Face Hub.
- Automatically maps Hugging Face config (AutoConfig) to Tunix's ModelConfig.
- Loads weights using snapshot_download and create_model_from_safe_tensors.

Adds a standalone example script demonstrating:
- Loading Qwen/Qwen3-0.6B from Hugging Face.
- Using AutoTokenizer with chat templating (enable_thinking=True).
- Running inference using Sampler with KV cache and token generation.

wang2yn84 · 2025-10-23T21:52:47Z

Thank you for your PR. The new model you have seems not a deepseek impl. Can you double check?

Adding Qwen3 Flax and Deepseek

04572ed

wang2yn84 self-assigned this Oct 23, 2025

wang2yn84 self-requested a review October 23, 2025 21:52

Provide feedback