NVIDIA / TensorRT-LLM Public

Notifications You must be signed in to change notification settings
Fork 2k
Star 12.7k

Code
Issues 518
Pull requests 483
Discussions
Actions
Projects 2
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Security
Insights

Pull requests: NVIDIA/TensorRT-LLM

Labels 59 Milestones 1

New pull request New

483 Open 6,790 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

[https://nvbugs/5816267][fix] Remove weight tensor holder to release memory earlier

#10876 opened Jan 21, 2026 by dongxuy04

Loading…

1 task done

[https://nvbugs/5674665][fix] Fix accuracy drop in VSWA with KV cache block reuse

#10875 opened Jan 21, 2026 by SimengLiu-nv

Loading…

1 task done

[None][fix] default disable gemm+allreduce fusion (#10656)

#10874 opened Jan 21, 2026 by benzh-2025

Loading…

[https://nvbugs/5769425][fix] add syncthreads for tinygemm to resolve intermittent accuracy problem

#10873 opened Jan 21, 2026 by dc3671

Loading…

1 task

[https://nvbugs/5741304][chore] Update flashinfer-python to 0.6.1

#10872 opened Jan 21, 2026 by yihwang-nv

Loading…

[https://nvbugs/5741304][chore] Update flashinfer-python to 0.6.1

#10871 opened Jan 21, 2026 by yihwang-nv

Loading…

[https://nvbugs/5740377][fix] Prevent out-of-bounds read

#10868 opened Jan 21, 2026 by HuiGao-NV

Loading…

1 task done

[None][feat] AutoDeploy: Flashinfer kernels bringup

#10867 opened Jan 21, 2026 by nvchenghaoz

Loading…

1 task

[https://nvbugs/5821433][fix] fix test_auto_scaling for 2 GPUs

#10866 opened Jan 21, 2026 by reasonsolo

Loading…

1 task done

[None][fix] Fix PD disaggregation for VLMs that use mrope

#10865 opened Jan 21, 2026 by 2ez4bz

Loading…

1 task done

[None][chore] Measure total time of AutoDeploy transforms

#10864 opened Jan 20, 2026 by taylor-yb-lee • Draft

1 task

[None][fix] Enable offline mode for HF models

#10863 opened Jan 20, 2026 by FrankD412

Loading…

1 task done

[None][feat] Replace KV cache search structure with separate radix tree

#10862 opened Jan 20, 2026 by thorjohnsen • Draft

1 task

[TRTLLM-10319][feat] Dynamic draft length on spec decode one-model path

#10860 opened Jan 20, 2026 by zheyuf • Draft

1 task

[https://nvbugs/5791242][fix] add changes from PR 10713 (WAR for flashinfer.sampling.sampling_from_logits)

#10859 opened Jan 20, 2026 by v-shobhit • Draft

1 task

[https://nvbugs/5779536][fix] Unwaive Llama 3.3 related multi GPU tests

#10855 opened Jan 20, 2026 by pengbowang-nv • Draft

1 task

[https://nvbugs/5769815][fix] Fix offset calculation in _are_stop_words when using speculative decoding

#10854 opened Jan 20, 2026 by stnie • Draft

1 task

[https://nvbugs/5688721][fix] unwaive NemotronH accuracy test

#10852 opened Jan 20, 2026 by lucaslie

Loading…

1 task done

[https://nvbugs/5769712][fix] fix timeout in AutoDeploy llama accuracy test (#10461)

#10851 opened Jan 20, 2026 by lucaslie

Loading…

1 task done

[https://nvbugs/5814247][fix] AutoDeploy: skip mxfp4_moe test unless on Hopper (#10729)

#10850 opened Jan 20, 2026 by lucaslie

Loading…

1 task done

[None][chore] added AutoDeploy nano_v3_scale.yaml

#10845 opened Jan 20, 2026 by MrGeva

Loading…

1 task done

[None][fix] Update RMSNorm custom op plumbing

#10843 opened Jan 20, 2026 by JintaoPengCS

Loading…

1 task done

[https://nvbugs/5800646][fix] Fix hang issue by avoid exposing UB buf…

#10842 opened Jan 20, 2026 by liji-nv

Loading…

1 task done

[https://nvbugs/5636916][fix] Cherry-pick #10654: Fix accuracy issue of TWO-SHOT AllReduce kernel

#10841 opened Jan 20, 2026 by hyukn

Loading…

1 task done

[None][fix] Proper conditional compilation of sm10x cubins

#10839 opened Jan 20, 2026 by tongyuantongyu • Draft

1 task

Previous 1 2 3 4 5 … 19 20 Next

Previous Next

ProTip! What’s not been updated in a month: updated:<2025-12-20.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!