Add local attention mask support to CUTLASS Blackwell decode op #58

Aya-ZIbra · 2025-12-17T16:55:55Z

Summary:
Extends the xformers CUTLASS Blackwell FwOpDecode operator to support BlockDiagonalLocalAttentionPaddedKeysMask, enabling sliding window attention for decode workloads. This builds on the gen kernel sliding window implementation to provide end-to-end local attention support through the xformers API.

Also for consistent shapes, I changed the default split_k_size from 1024 to 0 in the interface to use no split-K sizing.
TODO: Add merge_attentions step to cutlass op when number of splits >1

Differential Revision: D89192917

meta-codesync · 2025-12-17T16:56:02Z

@Aya-ZIbra has exported this pull request. If you are a Meta employee, you can view the originating Diff in D89192917.

Summary: Extends the xformers CUTLASS Blackwell FwOpDecode operator to support BlockDiagonalLocalAttentionPaddedKeysMask, enabling sliding window attention for decode workloads. This builds on the gen kernel sliding window implementation to provide end-to-end local attention support through the xformers API. Also for consistent shapes, I changed the default split_k_size from 1024 to 0 in the interface to use no split-K sizing. TODO: Add merge_attentions step to cutlass op when number of splits >1 Differential Revision: D89192917

meta-cla bot added the cla signed label Dec 17, 2025

meta-codesync bot added fb-exported meta-exported labels Dec 17, 2025

Aya-ZIbra force-pushed the export-D89192917 branch from 5204841 to 722f6dc Compare December 17, 2025 18:42

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add local attention mask support to CUTLASS Blackwell decode op #58

Add local attention mask support to CUTLASS Blackwell decode op #58

Uh oh!

Aya-ZIbra commented Dec 17, 2025

Uh oh!

meta-codesync bot commented Dec 17, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Add local attention mask support to CUTLASS Blackwell decode op #58

Are you sure you want to change the base?

Add local attention mask support to CUTLASS Blackwell decode op #58

Uh oh!

Conversation

Aya-ZIbra commented Dec 17, 2025

Uh oh!

meta-codesync bot commented Dec 17, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant