flash-algo / flash-sparse-attention Public

Notifications You must be signed in to change notification settings
Fork 45
Star 471

Code
Issues 7
Pull requests 14
Discussions
Actions
Projects
Wiki
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Wiki
Security
Insights

Pull requests: flash-algo/flash-sparse-attention

Labels 10 Milestones 0

New pull request New

Clear current search query, filters, and sorts

14 Open 164 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

[BUG FIX] Correct causal mask handling for longer KV pairs

#213 by LoserCheems was merged Dec 2, 2025

Loading…

4 tasks

Corrects issue links in README guides

#212 by LoserCheems was merged Nov 25, 2025

Loading…

[BUG FIX] Unify masking utilities and improve performance

#209 by LoserCheems was merged Nov 13, 2025

Loading…

4 tasks

Chore/sync after move

#208 by LoserCheems was merged Nov 11, 2025

Loading…

Fix documentation and references for Flash Sparse Attention

#207 by LoserCheems was merged Nov 9, 2025

Loading…

4 tasks

[FEATURE SUPPORT] Triton special compact dynamic-mask attention: 1.6× faster fwd+bwd, numerically equivalent

#206 by LoserCheems was merged Nov 7, 2025

Loading…

4 of 5 tasks

Refactor attention block smoothing for consistency

#205 by LoserCheems was merged Nov 6, 2025

Loading…

4 tasks

Add selectable masking strategies for attention

#204 by LoserCheems was merged Nov 6, 2025

Loading…

[FEATURE SUPPORT] Move scaling out of streaming loops, bias-initialized acc_s, and fix dQ double-scaling

#203 by LoserCheems was merged Nov 4, 2025

Loading…

5 tasks done

Add block-wise smoothing to attention mask

#201 by LoserCheems was merged Oct 30, 2025

Loading…

Optimize triton version: GQA, mask/bias broadcasting, skip inactive tiles, and stability fixes

#200 by LoserCheems was merged Nov 7, 2025

Loading…

2 of 5 tasks

Fix attention bias calculation and dbias handling

#199 by LoserCheems was merged Oct 27, 2025

Loading…

Update documentation to use mask utility in examples

#198 by LoserCheems was merged Oct 23, 2025

Loading…

[FEATURE SUPPORT] Centralize dynamic mask creation for FDMA

#197 by LoserCheems was merged Oct 23, 2025

Loading…

5 tasks done

[FEATURE SUPPORT] Robust dBias accumulation for seqlen_q_bias == 1

#194 by LoserCheems was merged Oct 22, 2025

Loading…

4 of 5 tasks

Enhance bias gradient accumulation in backward pass

#193 by LoserCheems was merged Oct 16, 2025

Loading…

Fix attention_mask and attention_bias shape descriptions and remove redundant checks

#192 by LoserCheems was merged Oct 13, 2025

Loading…

Refactor bias initialization and enhance bias computation in FlashDMAttnFunc

#191 by LoserCheems was merged Oct 12, 2025

Loading…

[FEATURE SUPPORT] Broadcastable 4D mask/bias, 128‑rounded key length, stride‑0 broadcasting, and dbias reductions

#190 by LoserCheems was merged Oct 12, 2025

Loading…

5 tasks done

[FEATURE SUPPORT] Variable-Length Attention with Padding-Free Execution

#188 by LoserCheems was merged Oct 11, 2025

Loading…

6 tasks done

Add issue/PR templates

#186 by LoserCheems was merged Oct 10, 2025

Loading…

Implement variable-length attention with mask and bias support

#185 by LoserCheems was merged Oct 9, 2025

Loading…

[BUG FIX] Fix mask/bias memory access and vectorization issues in kernels

#182 by LoserCheems was merged Oct 1, 2025

Loading…

7 tasks done

[BUG FIX] SM80 NaN in bias.grad when both mask and bias are enabled

#179 by LoserCheems was merged Sep 22, 2025

Loading…

3 tasks done

Refactor attention mask and bias handling for efficiency

#177 by LoserCheems was merged Sep 21, 2025

Loading…

Previous 1 2 3 4 5 6 7 Next

Previous Next

ProTip! Type g p on any issue or pull request to go back to the pull request listing page.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!