Skip to content

Issues: NVIDIA/TensorRT-Model-Optimizer

Beta
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

Cannot serve modelopt quantized nvfp4 model on TensorRT LLM bug Something isn't working
#187 opened Apr 27, 2025 by enisaras
Support more Quantization methods for "onnx_ptq"? feature request New feature or request
#184 opened Apr 24, 2025 by s101010tw
[BUG] Issue processing NF4 double quantization bug Something isn't working
#183 opened Apr 22, 2025 by ishan-modi
QAT weight load error bug Something isn't working
#180 opened Apr 18, 2025 by white-wolf-tech
int4 quantization output onnx does not load bug Something isn't working
#156 opened Mar 13, 2025 by thejaswi01
pi0 support?
#151 opened Mar 9, 2025 by johnnynunez
Not support torch.compile() ?
#145 opened Mar 5, 2025 by Vieeo
ProTip! Exclude everything labeled bug with -label:bug.