Skip to content
View IISuperluminaLII's full-sized avatar

Highlights

  • Pro

Block or report IISuperluminaLII

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. 1-50B_MoEMLALatentLLM 1-50B_MoEMLALatentLLM Public

    ehhh scaling, MoE, CUDA Graphs, gamma all broken. Fixing is a pita

    Python

  2. FlashMLA_Windows_Linux_sm120 FlashMLA_Windows_Linux_sm120 Public

    Dense Decode, Dense Prefill Fwd, Dense Prefill Bwd **sparse too hard (no TMEM)** (will clean it up, but its working nevertheless) Working source for FlashMLA that works on Windows, more specificall…

    C++ 6 1

  3. quantileppo quantileppo Public

    Distributional PPO with Quantile Regression

    Python

  4. windows_faiss_whl windows_faiss_whl Public

    whl for windows faiss, for those struggling in windows since facebook is lazy creating "AGI"

    2

  5. biLSTM-bert-based-attention-NLP biLSTM-bert-based-attention-NLP Public

    attention based bidirectional LSTM NLP for next word and summarization.

    Python 1

  6. SCGAN-GP SCGAN-GP Public

    skipped connection gan with gradient penalty

    Python 1