Pinned Loading
- Awesome-LLM-Inference
Awesome-LLM-Inference PublicForked from xlite-dev/Awesome-LLM-Inference
📖A curated list of Awesome LLM/VLM Inference Papers with codes, such as FlashAttention, PagedAttention, Parallelism, etc. 🎉🎉
-
- whisper.cpp
whisper.cpp PublicForked from ggml-org/whisper.cpp
Port of OpenAI's Whisper model in C/C++
C++
-
- open-thoughts
open-thoughts PublicForked from open-thoughts/open-thoughts
Open Thoughts: Fully Open Data Curation for Thinking Models
Python
- sherpa-onnx
sherpa-onnx PublicForked from k2-fsa/sherpa-onnx
Speech-to-text, text-to-speech, and speaker recongition using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, RISC-V, x86_64 serve…
C++
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.



