flashinfer-ai / flashinfer Public

Notifications You must be signed in to change notification settings
Fork 607
Star 4.3k

Code
Issues 265
Pull requests 64
Discussions
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Security
Insights

Pull requests: flashinfer-ai/flashinfer

Labels 36 Milestones 5

New pull request New

64 Open 1,610 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

chore: export compile commands for better IDE integration

#2253 opened Dec 20, 2025 by yzh119

Loading…

4 of 5 tasks

chore: add __all__ exports to Python modules and document missing APIs

#2251 opened Dec 20, 2025 by yzh119

Loading…

5 tasks

bugfix: skip CUTLASS kernel generation when AOT cache exists

#2248 opened Dec 19, 2025 by yongwww

Loading…

5 tasks

feat: Support numLocalTokens=0 for moe All-to-all

#2247 opened Dec 19, 2025 by trevor-m

Loading…

5 tasks

misc: Add runtime validation for plan/run consistency in BatchMLAPagedAttentionWrapper

#2246 opened Dec 19, 2025 by bkryu

Loading…

3 of 5 tasks

Fp8 attention are now part of cuDNN 9.17.1

#2241 opened Dec 18, 2025 by Anerudhan • Draft

5 tasks done

agent: add CLAUDE.md and claude skills

#2240 opened Dec 18, 2025 by yzh119

Loading…

5 tasks done

fix: Handle zeros in Mistral Large 3 MoE inference

#2238 opened Dec 18, 2025 by dbari • Draft

8 of 9 tasks

refactor: pull trtllm-gen batch-gemm/gemm headers from artifactory; update tma descriptor shape init

#2235 opened Dec 17, 2025 by jimmyzho

Loading…

5 tasks

cicd / testing: Add xfails tracker script

#2227 opened Dec 16, 2025 by kahyunnam

Loading…

5 tasks done

misc: support checks unit test tracking

#2224 opened Dec 16, 2025 by jimmyzho

Loading…

5 tasks

chore: Update CODEOWNERS automated maintenance

#2218 opened Dec 15, 2025 by flashinfer-bot

Loading…

WIP feat: Input/output Dump + Replay Mode for API Logging Level 10

#2206 opened Dec 11, 2025 by bkryu • Draft

3 of 5 tasks

2025 Dec

Fix: Add mask_indptr conversion in BatchPrefillWithPagedKVCacheWrapper.plan()

#2201 opened Dec 11, 2025 by Dutch-voyage

Loading…

5 tasks

refactor: update fa3 codebase [part 2]

#2192 opened Dec 9, 2025 by yzh119

Loading…

4 of 5 tasks

Add CUDA graph buffers for persistent attention

#2185 opened Dec 7, 2025 by Edenzzzz

Loading…

5 tasks

Fix/moe_sm110 (to be tested)

#2183 opened Dec 6, 2025 by aleozlx • Draft

5 tasks

[Flashinfer-Bench integration] HF end-to-end inference

#2151 opened Nov 30, 2025 by sfc-gh-goliaro • Draft

5 tasks

Enable Hopper FA3 FP8 attention in decode.py

#2148 opened Nov 28, 2025 by nvpohanh

Loading…

5 tasks done

feat: add sink to flashinfer decode

#2087 opened Nov 13, 2025 by djmmoss

Loading…

feat: BF16 GEMM using CUTLASS backend for SM100

#2070 opened Nov 10, 2025 by raayandhar

Loading…

5 tasks done

Refactor flashinfer/__init__.py so that applications could selectively pack submodules without modifying __init__.py

#2027 opened Nov 3, 2025 by bangshengtang

Loading…

5 tasks done

Blockwise GEMM with all reduce overlapping

#2007 opened Oct 30, 2025 by Amir-19 • Draft

5 tasks

chore: agentic workflow for automatic version bump

#1947 opened Oct 19, 2025 by yzh119

Loading…

5 tasks

add blockwise gemm cute dsl

#1922 opened Oct 13, 2025 by Amir-19

Loading…

5 tasks

Previous 1 2 3 Next

Previous Next

ProTip! What’s not been updated in a month: updated:<2025-11-20.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!