forked from NVIDIA/cutlass
-
Notifications
You must be signed in to change notification settings - Fork 32
Pull requests: codeplaysoftware/cutlass-sycl
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Simplify Flash Attention Decode benchmarks generation
#437
opened Jun 19, 2025 by
muhammad-tanvir-1211
Loading…
Unify interface for Flash Attention Decode
#423
opened Jun 11, 2025 by
muhammad-tanvir-1211
Loading…
support different scale/zero data types (int8, bf16, fp16) for mixed input mma
release
#420
opened Jun 11, 2025 by
taozha2
Loading…
Adding Fp8 input support for flash attention prefill
release
#419
opened Jun 11, 2025 by
mehdi-goli
Loading…
Add tests and benchmark configurations for BF16 | FP16 output for Flash Decode
#408
opened Jun 5, 2025 by
muhammad-tanvir-1211
Loading…
RFC: test out new syntax for launch with type deduction
#305
opened Apr 12, 2025 by
rolandschulz
Loading…
ProTip!
Find all pull requests that aren't related to any open issues with -linked:issue.