Uh oh!

There was an error while loading. Please reload this page.

NVIDIA / TensorRT-LLM Public

Notifications You must be signed in to change notification settings
Fork 2.6k
Star 14.1k

Code
Issues 615
Pull requests 910
Discussions
Actions
Projects
Security and quality
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Security and quality
Insights

Pull requests: NVIDIA/TensorRT-LLM

Labels 66 Milestones 1

New pull request New

910 Open 11,537 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

[https://nvbugs/6451032][fix] Reserve extra dflash KV slot for dummy requests

#16548 opened Jul 17, 2026 by amukkara Collaborator

Loading…

1 task done

[None][chore] update allowlist 2026-07-17

#16547 opened Jul 17, 2026 by tburt-nv Collaborator

Loading…

1 task done

[None][feat] Raise the CuTE-DSL top-k decode limit to 16384 and support odd top_k

#16546 opened Jul 17, 2026 by Hudayday Collaborator

Loading…

[https://nvbugs/6438658][fix] Fix KV cache estimation capacity

#16545 opened Jul 17, 2026 by jiaganc Collaborator • Draft

1 task done

[None][feat] Support rejection sampling under attention DP (incl. LM-head TP) api-compatible

Accepted LLM API contract change that is backwards-compatible

#16544 opened Jul 17, 2026 by zhaoyangwang-nvidia Collaborator

Loading…

1 task done

[None][feat] Import agent-flow and modeling bringup agent into TensorRT-LLM

#16543 opened Jul 17, 2026 by WeiHaocheng Collaborator

Loading…

[None][test] remove MiniMax-M2 tp16 multinode eval test case

#16542 opened Jul 17, 2026 by jieli-matrix Collaborator

Loading…

1 task done

[None][test] Add DeepSeek-V4-Pro perf sanity cases on GB300

#16540 opened Jul 17, 2026 by chenfeiz0326 Collaborator

Loading…

6 tasks done

[https://nvbugs/6395830][fix] Qwen-VL mRoPE: move seq-slot delta cach…

#16537 opened Jul 17, 2026 by nv-guomingz Collaborator

Loading…

1 task

[None][perf] Update deepseek integration test configs to use transceiver V2

#16536 opened Jul 17, 2026 by Shixiaowei02 Collaborator • Draft

1 task done

[https://nvbugs/6316983][chore] Unwaive TestQwen2_5_VL_7B::test_auto_dtype

#16534 opened Jul 17, 2026 by yihwang-nv Collaborator

Loading…

1 task

[https://nvbugs/6467675][fix] Bump ETCD_VER from v3.6.9 to v3.7.0 in docker/common/install_etcd.sh; binary…

#16533 opened Jul 17, 2026 by trtllm-agent Collaborator

Loading…

2 tasks done

[None][perf] Fuse DeepSeek-V4 Indexer Q projection with CuTe DSL

#16532 opened Jul 17, 2026 by mingyangHao Collaborator

Loading…

1 task done

[https://nvbugs/6467684][fix] Bump the golang image tag to 1.23 and the license_checker pin to v0.3.1…

#16531 opened Jul 17, 2026 by trtllm-agent Collaborator

Loading…

2 tasks done

[None][chore] Update flashinfer-python from 0.6.14 to 0.6.15

#16530 opened Jul 17, 2026 by yihwang-nv Collaborator

Loading…

4 tasks

[https://nvbugs/6467688][fix] Bump the floor to mistune>=3.3.0 and update the WAR comment to reference the…

#16527 opened Jul 17, 2026 by trtllm-agent Collaborator

Loading…

2 tasks done

[https://nvbugs/6467687][fix] Bump the jupyter_server minimum to >=2.20.0 and update the WAR comment to…

#16526 opened Jul 17, 2026 by trtllm-agent Collaborator

Loading…

2 tasks done

[https://nvbugs/6467696][fix] Change license_checker@v0.3.0 → license_checker@v0.3.1 on…

#16525 opened Jul 17, 2026 by trtllm-agent Collaborator

Loading…

2 tasks done

[None][feat] Default GLM-5 to the Python KV-cache transceiver

#16524 opened Jul 17, 2026 by chuangz0 Collaborator

Loading…

1 task done

[None][feat] serve: multi-process HTTP frontends on the classic IPC executor path api-compatible

Accepted LLM API contract change that is backwards-compatible

#16523 opened Jul 17, 2026 by lancelly Collaborator

Loading…

[None][test] Assert MTP acceptance length in ADP + LM-head-TP accuracy tests

#16521 opened Jul 17, 2026 by qiaoxj07 Collaborator

Loading…

[https://nvbugs/6448152][perf] diagnose 2x admission with PP consensus

#16518 opened Jul 17, 2026 by chienchunhung Collaborator • Draft

[https://nvbugs/6448152][perf] diagnose 2x admission without PP consensus

#16517 opened Jul 17, 2026 by chienchunhung Collaborator • Draft

[https://nvbugs/6163690][fix] Upgrade transformers to 5.10.1

#16516 opened Jul 17, 2026 by pamelap-nvidia Collaborator

Loading…

1 task done

[https://nvbugs/6120535][fix] Re-enable DeepSeek V3.2 disaggregated serving test

#16515 opened Jul 16, 2026 by peihu-nv Collaborator • Draft

1 task done

Previous 1 2 3 4 5 … 36 37 Next

Previous Next

ProTip! Adding no:label will show everything without a label.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!