Run large language models like Qwen and LLaMA locally on Android for offline, private, real-time question answering and chat - powered by ONNX Runtime.
android chatbot android-app on-device-ai mobile-ai onnx-runtime huggingface-tokenizers local-llm qwen llama3 local-llm-integration offline-inference
- Updated
Sep 9, 2025 - Kotlin