Skip to content
View DefTruth's full-sized avatar
🎯
#pragma unroll
🎯
#pragma unroll

Organizations

@vipshop @PaddlePaddle @xlite-dev

Block or report DefTruth

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. xlite-dev/LeetCUDA xlite-dev/LeetCUDA Public

    📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.🎉

    Cuda 8.4k 834

  2. xlite-dev/lite.ai.toolkit xlite-dev/lite.ai.toolkit Public

    🛠A lite C++ AI toolkit: 100+ models with MNN, ORT and TRT, including Det, Seg, Stable-Diffusion, Face-Fusion, etc.🎉

    C++ 4.3k 764

  3. xlite-dev/Awesome-LLM-Inference xlite-dev/Awesome-LLM-Inference Public

    📚A curated list of Awesome LLM/VLM Inference Papers with Codes: Flash-Attention, Paged-Attention, WINT8/4, Parallelism, etc.🎉

    Python 4.7k 322

  4. vllm-project/vllm vllm-project/vllm Public

    A high-throughput and memory-efficient inference and serving engine for LLMs

    Python 62.9k 11.2k

  5. huggingface/diffusers huggingface/diffusers Public

    🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.

    Python 31.6k 6.5k

  6. vipshop/cache-dit vipshop/cache-dit Public

    A Unified and Flexible Inference Engine with Hybrid Cache Acceleration and Parallelism for 🤗Diffusers.

    Python 544 20