Skip to content

Pull requests: NVIDIA/Fuser

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Optimize TMA inner-reduction and add TMA serial-split
#5867 opened Jan 23, 2026 by tbqh Loading…
tests: update exceptions include
#5859 opened Jan 21, 2026 by wujingyue Loading…
Remove Ampere-Turing matmul scheduler
#5836 opened Jan 16, 2026 by naoyam Loading…
avoid warp diverge in warp specialized kernel
#5830 opened Jan 15, 2026 by liqiangxl Loading…
[WIP] Use TensorIndexer by default
#5828 opened Jan 15, 2026 by naoyam Draft
[Build Speed] PCH Test
#5795 opened Jan 11, 2026 by csarofeen Draft
Migrate from pybind11 to nanobind Direct Bindings Python extension with direct mapping to NvFuser CPP objects.
#5780 opened Jan 8, 2026 by rdspring1 Draft
Add a toy multi-GPU benchmark
#5753 opened Jan 3, 2026 by wujingyue Draft
Benchmark for nvfp4 scaled mm
#5737 opened Dec 23, 2025 by protonu Loading…
ProTip! Mix and match filters to narrow down what you’re looking for.