NVIDIA / Model-Optimizer Public

Notifications You must be signed in to change notification settings
Fork 282
Star 2k

Code
Issues 67
Pull requests 96
Actions
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Security
Insights

Pull requests: NVIDIA/Model-Optimizer

Labels 27 Milestones 0

New pull request New

96 Open 507 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

Fix: quant config error on quantized offline eagle

#925 opened Feb 24, 2026 by h-guo18

Loading…

Support decoder block-level sequential calibration

#924 opened Feb 24, 2026 by sugunav14

Loading…

Fix skip softmax defaults

#923 opened Feb 24, 2026 by rohansjoshi

Loading…

Enable multinode training for HF speculative decoding

#922 opened Feb 23, 2026 by yeyu-nvidia

Loading…

Diffusers 2:4 Sparse Attention

#921 opened Feb 23, 2026 by jingyu-ml • Draft

example demonstrating how to train CosmosReason2 Eagle3

#920 opened Feb 23, 2026 by skierat

Loading…

Add Qwen3-VL support to Minitron pruning

#919 opened Feb 23, 2026 by eagle705 • Draft

disable rope scaling for training, add yarn during export

#917 opened Feb 23, 2026 by h-guo18 • Draft

Add 2:4 Sparse Attention

#916 opened Feb 22, 2026 by kaix-nv • Draft

Feat: Speculatice Decoding export with quantization support

#913 opened Feb 21, 2026 by h-guo18

Loading…

Add support for export ComfyUI compatible checkpoint for diffusion model(e.g., LTX-2)

#911 opened Feb 20, 2026 by ynankani

Loading…

Support force tokens to % of total experts during calibration

#910 opened Feb 20, 2026 by cjluo-nv

Loading…

fix

#908 opened Feb 19, 2026 by h-guo18 • Draft

PTQ and QAD with Qwen Image

#905 opened Feb 18, 2026 by AliesTaha

Loading…

Support mbridge distillation for any_model

#904 opened Feb 18, 2026 by danielkorzekwa

Loading…

Enable Qwen3.5-MoE PTQ

#897 opened Feb 16, 2026 by Edwardf0t1 • Draft

Fix DeepSpeed import crash on runtime-only CUDA and improve NVFP4 uncalibrated weight error

#896 opened Feb 16, 2026 by debo3

Loading…

Add Qwen3VL

#895 opened Feb 16, 2026 by hychiang-git

Loading…

gpt-oss 20b support

#889 opened Feb 13, 2026 by chochowski

Loading…

Implicit Gemm NVFP4 on Conv3D

#886 opened Feb 13, 2026 by jingyu-ml

Loading…

Add support for offline speculative decoding model PTQ

#883 opened Feb 12, 2026 by yeyu-nvidia • Draft

update qwen quant

#880 opened Feb 11, 2026 by zhewenl • Draft

Update README.md for DMS (fix cd experimental/DMS to cd Model-Optimizer/experimental/DMS)

#879 opened Feb 10, 2026 by faridlazuarda

Loading…

[OMNIML-2914] Fix export of fused layernorm weights for TE spec

#876 opened Feb 10, 2026 by yueshen2016

Loading…

SpecDec Bench: February Update

#875 opened Feb 10, 2026 by IzzyPutterman

Loading…

Previous 1 2 3 4 Next

Previous Next

ProTip! Follow long discussions with comments:>50.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!