-
Notifications
You must be signed in to change notification settings - Fork 2.4k
Pull requests: NVIDIA/TensorRT-LLM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[TRTLLM-12430][tests] Add video E2E test for nano v3 omni
#13883
opened May 8, 2026 by
2ez4bz
Collaborator
Loading…
1 task done
[None][test] Add DSR1 B200 DISAGG to CI Perf Test
#13882
opened May 8, 2026 by
chenfeiz0326
Collaborator
Loading…
1 task done
[https://nvbugs/6157131][ci] Waive flaky AutoDeploy accuracy test
#13881
opened May 8, 2026 by
2ez4bz
Collaborator
Loading…
1 task done
[https://nvbugs/6108995][fix] Fix workspace size calculation for fmha_bmm1_scale_size with FP8ContextMLA
#13880
opened May 8, 2026 by
pengbowang-nv
Collaborator
Loading…
1 task done
[None][fix] Skip calibration scalars in initialize_dummy_weights
#13879
opened May 8, 2026 by
shikicloud
Loading…
[https://nvbugs/6095421][chore] Unwaive 1 failed test
#13877
opened May 8, 2026 by
heyuhhh
Collaborator
Loading…
1 task
[None][test] Add DeepSeek-V4 dis-agg CI coverage (b200/b300)
#13874
opened May 8, 2026 by
Shixiaowei02
Collaborator
•
Draft
1 task
[TRTLLM-12503][feat] Parallel VAE independent scaling and fix arg passing
#13873
opened May 8, 2026 by
NVShreyas
Collaborator
Loading…
1 task done
[None][feat] add /start_profile and /stop_profile endpoints to trtllm…
#13872
opened May 8, 2026 by
JunyiXu-nv
Collaborator
•
Draft
1 task
[TRTLLM-12339][feat] Support T5 encoder-decoder models in the PyTorch backend
#13870
opened May 7, 2026 by
cascade812
Collaborator
Loading…
1 task done
[TRTLLM-12529][feat] Graceful exit when lora unsupported
#13869
opened May 7, 2026 by
brb-nv
Collaborator
Loading…
1 task done
[https://nvbugs/3140325][fix] 6140325 test time out
#13868
opened May 7, 2026 by
chienchunhung
Collaborator
•
Draft
1 task done
[#13816][feat] AutoDeploy: Optimize GPT-OSS-120b perf
#13867
opened May 7, 2026 by
taylor-yb-lee
Collaborator
•
Draft
1 task
[None][feat] support EPD video item runs
#13864
opened May 7, 2026 by
venkywonka
Collaborator
•
Draft
[TRTLLMINF-54][infra] Migrate typed-exception classifier to shared library
#13863
opened May 7, 2026 by
dpitman-nvda
Collaborator
Loading…
1 task done
[None][chore] Unwaive stale autodeploy waives
#13862
opened May 7, 2026 by
galagam
Collaborator
Loading…
1 task done
[None][chore] AutoDeploy: Simplify user facing model configs and auto-generated transforms documentation
#13860
opened May 7, 2026 by
bmarimuthu-nv
Collaborator
Loading…
1 task done
[None][feat] AutoDeploy push the rope buffer to later stage
#13859
opened May 7, 2026 by
nvchenghaoz
Collaborator
Loading…
[TRTLLM-12399][fix] Fix KV cache adaptive ratio sampling
#13857
opened May 7, 2026 by
lowsfer
Member
Loading…
1 task
[https://nvbugs/6130334][fix] Fix index error of shared expert when loading weights
#13856
opened May 7, 2026 by
shuyixiong
Collaborator
Loading…
1 task
[https://nvbugs/6058251][fix] Resolve top-level model_type for composite HF configs
#13855
opened May 7, 2026 by
tensorrt-cicd
Collaborator
Loading…
2 tasks done
[https://nvbugs/6143787][fix] Add
kv_cache_config = KvCacheConfig(free_gpu_memory_fraction=0.6) to TestQwen3
#13852
opened May 7, 2026 by
tensorrt-cicd
Collaborator
Loading…
2 tasks done
Previous Next
ProTip!
Find all pull requests that aren't related to any open issues with -linked:issue.