Skip to content

Pull requests: NVIDIA/Model-Optimizer

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Fix: quant config error on quantized offline eagle
#925 opened Feb 24, 2026 by h-guo18 Loading…
Fix skip softmax defaults
#923 opened Feb 24, 2026 by rohansjoshi Loading…
Diffusers 2:4 Sparse Attention
#921 opened Feb 23, 2026 by jingyu-ml Draft
Add 2:4 Sparse Attention
#916 opened Feb 22, 2026 by kaix-nv Draft
fix
#908 opened Feb 19, 2026 by h-guo18 Draft
PTQ and QAD with Qwen Image
#905 opened Feb 18, 2026 by AliesTaha Loading…
Support mbridge distillation for any_model
#904 opened Feb 18, 2026 by danielkorzekwa Loading…
Enable Qwen3.5-MoE PTQ
#897 opened Feb 16, 2026 by Edwardf0t1 Draft
Add Qwen3VL
#895 opened Feb 16, 2026 by hychiang-git Loading…
gpt-oss 20b support
#889 opened Feb 13, 2026 by chochowski Loading…
Implicit Gemm NVFP4 on Conv3D
#886 opened Feb 13, 2026 by jingyu-ml Loading…
update qwen quant
#880 opened Feb 11, 2026 by zhewenl Draft
SpecDec Bench: February Update
#875 opened Feb 10, 2026 by IzzyPutterman Loading…
ProTip! Follow long discussions with comments:>50.