kubeai-project / kubeai Star 1.1k Code Issues Pull requests Discussions AI Inference Operator for Kubernetes. The easiest way to serve ML models in production. Supports VLMs, LLMs, embeddings, and speech-to-text. kubernetes ai k8s whisper autoscaler openai-api llm vllm faster-whisper ollama vllm-operator ollama-operator inference-operator Updated Dec 15, 2025 Go
arcxteam / kuzco-inference Star 3 Code Issues Pull requests Comprehensive Guide Running Kuzco Inference Training LLM Models AI w/ CPU & Docker - Commit Deployment inference openai api-proxy train-model llm-training llm-inference ollama-client ollama-api kuzco kuzco-cli llama3 inference-operator solana-agent-kit vikey-ai kuzco-inference Updated Dec 5, 2025 HTML