Skip to content

Pull requests: NVIDIA/TensorRT-LLM

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[#11312][chore] Make TRT/NCCL configurable in CMake find modules Community want to contribute PRs initiated from Community
#11528 opened Feb 15, 2026 by xuantengh Loading…
1 task done
[None][chore] Remove closed bugs
#11527 opened Feb 15, 2026 by xinhe-nv Draft
[#11398][feat] AutoDeploy: flashinfer rope for GLM4.7-Flash AutoDeploy <NV> AutoDeploy Backend
#11524 opened Feb 14, 2026 by taylor-yb-lee Loading…
1 task done
Glm5 nvfp4 serving
#11522 opened Feb 14, 2026 by pst2154 Loading…
1 task done
[None][fix] Fix test prefix generation for per-sm waives
#11519 opened Feb 13, 2026 by tburt-nv Loading…
1 task
[None][feat] Support spec dec for KV cache manager v2
#11513 opened Feb 13, 2026 by mikeiovine Loading…
1 task done
[Draft] Add support for expert_number<=2048 and K<=32
#11510 opened Feb 13, 2026 by ChristinaZ Loading…
1 task
Fix pp+disagg
#11509 opened Feb 13, 2026 by Tabrizian Draft
1 task
[None][feat] Add Helix CP support for DSV3.2
#11507 opened Feb 13, 2026 by brb-nv Loading…
1 task done
[#2912][feat] Support Cohere Command A model Community want to contribute PRs initiated from Community
#11505 opened Feb 13, 2026 by torotoki Loading…
1 task done
[None][feat] Optimize 6KD fp8 blockscale gemm Community want to contribute PRs initiated from Community
#11502 opened Feb 13, 2026 by CarstyYou Loading…
1 task done
[None][feat] TRT-LLM Gen MoE finalize kernel optimization
#11501 opened Feb 13, 2026 by nekorobov Loading…
1 task done
[None][feat] Fuse shared to sparse experts MoE
#11499 opened Feb 13, 2026 by nekorobov Loading…
1 task done
ProTip! Filter pull requests by the default branch with base:main.