Skip to content

Pull requests: NVIDIA/cutlass

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[CLI] Update FMHA & improve perf
#3251 opened May 20, 2026 by keithzzzzz Contributor Loading…
Filter SM120 mixed 8-bit tiles for FP6 ElementD
#3247 opened May 19, 2026 by zhils Loading…
fix an intermittent accuracy isse
#3233 opened May 15, 2026 by dishengbin Loading…
Fix example imports and pytest imports
#3230 opened May 12, 2026 by depaulmillz Contributor Loading…
W4a8 speedup v2
#3226 opened May 11, 2026 by mak-corp Loading…
Avoid unordered_map for runtime datatype mapping
#3223 opened May 11, 2026 by LwhJesse Loading…
FMHA examples: use cute::min in device functions
#3222 opened May 11, 2026 by LwhJesse Loading…
[examples][CuTeDSL] add MoE dispatch+combine example with NVSHMEM
#3221 opened May 11, 2026 by shubaoyu2 Contributor Loading…
Add Hopper FP8 grouped blockwise GEMM (sparse-groups) CuTeDSL example
#3195 opened Apr 29, 2026 by Johnsonms Contributor Draft
7 tasks done
Add Hopper FP8 grouped blockwise GEMM CuTeDSL example
#3194 opened Apr 29, 2026 by Johnsonms Contributor Draft
5 tasks done
Add Hopper FP8 groupwise GEMM CuTeDSL example
#3193 opened Apr 29, 2026 by Johnsonms Contributor Draft
5 tasks done
Add Hopper FP8 blockwise GEMM CuTeDSL example
#3192 opened Apr 29, 2026 by Johnsonms Contributor Draft
5 tasks done
Update CuTe DSL JAX tutorial
#3188 opened Apr 28, 2026 by katjasrz Contributor Loading…
ProTip! What’s not been updated in a month: updated:<2026-04-20.