Skip to content

Pull requests: NVIDIA/TensorRT-LLM

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[None][perf] Remove redundant allreduce
#14974 opened Jun 4, 2026 by mikeiovine Collaborator Loading…
1 task done
[None][infra] Reduce Docker image layer count in release stage
#14972 opened Jun 4, 2026 by tburt-nv Collaborator Loading…
1 task done
[None][chore] Unwaive AutoDeploy accuracy tests
#14971 opened Jun 4, 2026 by bmarimuthu-nv Collaborator Loading…
1 task done
[None][feat] Add PyTorch reset_prefix_cache API api-compatible Accepted LLM API contract change that is backwards-compatible
#14970 opened Jun 4, 2026 by milesial Collaborator Loading…
1 task done
[TRTLLM-13177][doc] Add Nemotron 3 Ultra doc
#14964 opened Jun 4, 2026 by nv-guomingz Collaborator Loading…
1 task done
[None][feat] enable GQA and cross-attention for attn2d
#14961 opened Jun 4, 2026 by NVShreyas Collaborator Loading…
1 task done
[None][test] Add GLM-5 into CI Perf Test
#14960 opened Jun 4, 2026 by chenfeiz0326 Collaborator Loading…
1 task done
[None][fix] add use_remote_kv_events option in kvaware router
#14959 opened Jun 4, 2026 by reasonsolo Collaborator Loading…
1 task done
[None][refactor] split VisualGen pipeline and model configs
#14956 opened Jun 4, 2026 by bobboli Collaborator Loading…
[https://nvbugs/6160629][fix] AutoDeploy: increase rtol for bf16 HF vs FI rope test
#14954 opened Jun 4, 2026 by galagam Collaborator Loading…
1 task done
[None][feat] Sparse-attention behavior-layer framework + V2-migrated RocketKV with chunked prefill api-compatible Accepted LLM API contract change that is backwards-compatible
#14953 opened Jun 4, 2026 by Hudayday Collaborator Loading…
[https://nvbugs/5859886][fix] Remove the waiver
#14948 opened Jun 4, 2026 by ziyixiong-nv Collaborator Loading…
1 task
[None][opt] attn kernel epilogue fuse RopeQuant
#14947 opened Jun 4, 2026 by yunruis Contributor Loading…
1 task done
[None][feat] Support beam search in KV cache manager v2
#14945 opened Jun 4, 2026 by yizhang-nv Member Loading…
1 task done
[None][feat] AutoDeploy: Fix hardcoded configs
#14943 opened Jun 4, 2026 by taylor-yb-lee Collaborator Loading…
1 task done
[TRTLLM-10184][chore] Remove legacy XQA precompiled path
#14941 opened Jun 4, 2026 by pengbowang-nv Collaborator Draft
1 task done
ProTip! Follow long discussions with comments:>50.