-
Notifications
You must be signed in to change notification settings - Fork 2.1k
Pull requests: NVIDIA/TensorRT-LLM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[https://nvbugs/5814504][fix] Add skip_pre_hopper flag on NVILA & Nano V2 VLMs
#11275
opened Feb 4, 2026 by
yechank-nvidia
Loading…
[https://nvbugs/5845769][fix] B300 support on VLMs
#11274
opened Feb 4, 2026 by
yechank-nvidia
Loading…
[None][feat] Optimize super-v3 nvfp4 for better perf
#11273
opened Feb 4, 2026 by
Wanli-Jiang
•
Draft
1 task done
[https://nvbugs/5612438][fix] add timeout 14400 for SeedOSS
#11269
opened Feb 4, 2026 by
zhhuang-nv
Loading…
1 task done
AutoDeploy trtllm attention backend with trtllm's kv cache manager direct operation
#11268
opened Feb 4, 2026 by
MrGeva
Loading…
1 task
[TRTLLM-9771][feat] Make refit compatible with CUDA Graph
#11267
opened Feb 4, 2026 by
shuyixiong
•
Draft
1 task
[https://nvbugs/5848377][fix] fix deepeplowlatency with trtllm moe backend running fp8 DS_R1
#11266
opened Feb 4, 2026 by
leslie-fang25
Loading…
1 task done
[TRTLLM-10858][feat] Multi-image support for EPD disagg
#11264
opened Feb 4, 2026 by
2ez4bz
Loading…
1 task done
[None][fix] Fix possible arithmetic overflow in FMHAv2 launcher
#11263
opened Feb 4, 2026 by
tongyuantongyu
Loading…
1 task done
[refactor] Cache CUDA_LAUNCH_BLOCKING with lazy init
#11261
opened Feb 4, 2026 by
hnover-nv
Loading…
[#11234][test] Move test_ad_export_onnx to integration examples
#11260
opened Feb 4, 2026 by
nvyocox
Loading…
7 tasks done
[https://nvbugs/5820874][fix] Adjust deepgemm tuning buckets to cover larger num_tokens's scope
#11259
opened Feb 4, 2026 by
chenfeiz0326
Loading…
1 task done
[None][fix] Fix amax to avoid NaN issue in fp8_blockscale_gemm_kernel.
#11256
opened Feb 4, 2026 by
yuxianq
Loading…
1 task done
[None][chore] Resolve a conflict in the md file
#11255
opened Feb 4, 2026 by
ziyixiong-nv
Loading…
1 task
[https://nvbugs/5863464][fix] Fix port binding conflict in disagg benchmarking script
#11253
opened Feb 4, 2026 by
brb-nv
Loading…
1 task done
[https://nvbugs/5863443][fix] Fix message truncation in Helix CP cache transmission
#11252
opened Feb 4, 2026 by
brb-nv
Loading…
1 task done
[None][infra] Use frontend dgx-h100 and b200 slurm platforms
#11251
opened Feb 4, 2026 by
mlefeb01
Loading…
1 task
[DRAFT][#11203][feat] Optimize weight shape extraction
#11250
opened Feb 4, 2026 by
taylor-yb-lee
•
Draft
1 task
[https://nvbugs/5800679][fix] Re-enable test after bug fixed
#11249
opened Feb 4, 2026 by
dongfengy
Loading…
1 task done
[https://nvbugs/5849697][fix] Refine QA Test List for SM120
#11248
opened Feb 4, 2026 by
dongfengy
Loading…
1 task done
Previous Next
ProTip!
Find all pull requests that aren't related to any open issues with -linked:issue.