Add GLM4_MOE model support #952

vvvdwbvvv · 2025-11-25T06:07:56Z

Summary

This PR adds support for GLM4.5 (GLM-4 MOE) models to the Liger Kernel #951
https://huggingface.co/zai-org/GLM-4.5 which share the same structure as GLM 4.6

Testing Done

For the convergence test on fp32, model size can easily leads to OOM, initially I was using 4090 to run the tests, however only fp32 encounters OOM, so I move forward to L40S to finish all the tests.

Hardware Type:
run make test to ensure correctness
run make checkstyle to ensure code style
run make test-convergence to ensure convergence

… mapping

…ed parameters

…nce_for_glm4_moe

kashif · 2025-11-25T10:46:00Z

src/liger_kernel/transformers/model/glm4_moe.py

+        skip_logits = self.training and (labels is not None or shift_labels is not None)
+
+    if skip_logits:
+        loss = LigerForCausalLMLoss(


kindly have a look at the other model examples and adapt to new API that returns the metric

vvvdwbvvv · 2025-11-25T12:56:01Z

Fixed in 5af9d16

vvvdwbvvv added 6 commits November 25, 2025 10:54

[GLM4MOE] Add support for Liger kernel patches in GLM-4MOE models

26487c2

[GLM4MOE] Formatting functions

dac15e9

Rename function for GLM-4MOE kernel application and update model type…

14cfb90

… mapping

Refactor lce_forward function: update return type and remove deprecat…

973e418

…ed parameters

Fix import path for Glm4MoeConfig in test_apply_liger_kernel_to_insta…

39e7d18

…nce_for_glm4_moe

fix tests

ca27242

kashif reviewed Nov 25, 2025

View reviewed changes

modify to adapt to new API

5af9d16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add GLM4_MOE model support #952

Add GLM4_MOE model support #952

Uh oh!

vvvdwbvvv commented Nov 25, 2025

Uh oh!

kashif Nov 25, 2025

Uh oh!

vvvdwbvvv commented Nov 25, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Add GLM4_MOE model support #952

Are you sure you want to change the base?

Add GLM4_MOE model support #952

Uh oh!

Conversation

vvvdwbvvv commented Nov 25, 2025

Summary

Testing Done

Uh oh!

kashif Nov 25, 2025

Choose a reason for hiding this comment

Uh oh!

vvvdwbvvv commented Nov 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

vvvdwbvvv commented Nov 25, 2025 •

edited

Loading