-
Notifications
You must be signed in to change notification settings - Fork 1.8k
Pull requests: NVIDIA/cutlass
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Fix example imports and pytest imports
#3230
opened May 12, 2026 by
depaulmillz
Contributor
Loading…
Fix MSVC CUDA build: is_unsigned_v not available in cutlass::platform
#3229
opened May 12, 2026 by
TxsharDev
Loading…
[example Cute C++]Add CuTe C++ tutorial for Blackwell MXFP8 block-scaled GEMM.
#3225
opened May 11, 2026 by
haowen-han
Loading…
[examples][CuTeDSL] add MoE dispatch+combine example with NVSHMEM
#3221
opened May 11, 2026 by
shubaoyu2
Contributor
Loading…
[CuTe][Fix]: Add missing template specialization for F8F6F4 MMA Op (#3207)
#3209
opened May 7, 2026 by
infinitron
Loading…
[CuTeDSL] Make editable installs use exact runtime companion wheels
#3204
opened May 5, 2026 by
alecco
Loading…
[CuTe][SM70] Add comment clarifying signed cast requirement for blockIdx coords
#3203
opened May 2, 2026 by
Flink-ddd
Contributor
Loading…
[CuTe] [Fix] MSVC's inability to deduce a non-type parameter pack from a dependent template alias
#3198
opened Apr 30, 2026 by
SystemPanic
Contributor
Loading…
Add high-performance FP16 GEMM kernel with TMA for Blackwell(SM120a)
#3187
opened Apr 26, 2026 by
FY-26
Loading…
doc: fix stride typo in layout algebra composition example
#3186
opened Apr 26, 2026 by
leonardHONG
Loading…
Fix CuTe composition stride-divisibility check (#3177)
#3181
opened Apr 21, 2026 by
jduprat
Loading…
[CuTeDSL] Fix random_ and normal_ ops to support torch.compile fullgraph
#3175
opened Apr 19, 2026 by
Flink-ddd
Contributor
Loading…
Fix incorrect example paths in CuTeDSL docstrings
inactive-30d
#3151
opened Apr 6, 2026 by
Weili-0234
Loading…
[CuTeDSL] Fix dynamic shape args not passed to JIT kernel
#3148
opened Apr 5, 2026 by
Flink-ddd
Contributor
Loading…
Previous Next
ProTip!
What’s not been updated in a month: updated:<2026-04-13.