Highlights
- Pro
Pinned Loading
- 1-50B_MoEMLALatentLLM
1-50B_MoEMLALatentLLM Publicehhh scaling, MoE, CUDA Graphs, gamma all broken. Fixing is a pita
Python
- FlashMLA_Windows_Linux_sm120
FlashMLA_Windows_Linux_sm120 PublicDense Decode, Dense Prefill Fwd, Dense Prefill Bwd **sparse too hard (no TMEM)** (will clean it up, but its working nevertheless) Working source for FlashMLA that works on Windows, more specificall…
-
- windows_faiss_whl
windows_faiss_whl Publicwhl for windows faiss, for those struggling in windows since facebook is lazy creating "AGI"
- biLSTM-bert-based-attention-NLP
biLSTM-bert-based-attention-NLP Publicattention based bidirectional LSTM NLP for next word and summarization.
Python 1
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.



