Skip to content

Pull requests: NVIDIA/cutlass

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Fix example imports and pytest imports
#3230 opened May 12, 2026 by depaulmillz Contributor Loading…
W4a8 speedup v2
#3226 opened May 11, 2026 by mak-corp Loading…
Avoid unordered_map for runtime datatype mapping
#3223 opened May 11, 2026 by LwhJesse Loading…
FMHA examples: use cute::min in device functions
#3222 opened May 11, 2026 by LwhJesse Loading…
[examples][CuTeDSL] add MoE dispatch+combine example with NVSHMEM
#3221 opened May 11, 2026 by shubaoyu2 Contributor Loading…
Add Hopper FP8 grouped blockwise GEMM (sparse-groups) CuTeDSL example
#3195 opened Apr 29, 2026 by Johnsonms Contributor Draft
7 tasks done
Add Hopper FP8 grouped blockwise GEMM CuTeDSL example
#3194 opened Apr 29, 2026 by Johnsonms Contributor Draft
5 tasks done
Add Hopper FP8 groupwise GEMM CuTeDSL example
#3193 opened Apr 29, 2026 by Johnsonms Contributor Draft
5 tasks done
Add Hopper FP8 blockwise GEMM CuTeDSL example
#3192 opened Apr 29, 2026 by Johnsonms Contributor Draft
5 tasks done
Update CuTe DSL JAX tutorial
#3188 opened Apr 28, 2026 by katjasrz Contributor Loading…
Fix Thor MLA decode arch dispatch
#3173 opened Apr 17, 2026 by iloveai8086 Loading…
WIP: OSS CI Testing for v4.5
#3171 opened Apr 16, 2026 by zekunf-nv Collaborator Loading…
[CuTeDSL][fix]: 1d bias epilogue fix inactive-30d
#3157 opened Apr 9, 2026 by leevan Loading…
[CuTeDSL] Fix dynamic shape args not passed to JIT kernel
#3148 opened Apr 5, 2026 by Flink-ddd Contributor Loading…
ProTip! What’s not been updated in a month: updated:<2026-04-13.