kernel-tuning

Here are 2 public repositories matching this topic...

massif-01 / vllm_benchmark_block_fp8

Automated Triton w8a8 block FP8 kernel tuning tool for vLLM. Auto-detects model architecture, supports Qwen3-Coder-30B-A3B-Instruct-FP8/DeepSeek-V3/custom models, multi-GPU parallel tuning, and generates optimized kernel configs for quantization.

triton performance-tuning kernel-tuning fp8 vllm

Updated Oct 31, 2025
Python

NikitaZelenskis / LLM-Kernel-Tuner

Star

A package for automated kernel tuning with LLMs.

python framework gpu cuda auto-tuning kernel-tuning llm llm-tools llm-agents

Updated Oct 24, 2025
Python

Improve this page

Add a description, image, and links to the kernel-tuning topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the kernel-tuning topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly