On-device voice assistant platform powered by deep learning
- Updated
Apr 11, 2025 - Python
On-device voice assistant platform powered by deep learning
S.T.A.R.K. - Speech And Text Algorithmic Recognition Kit
S.T.A.R.K. Platform Library and Community Extensions
Control Spot legged robot with audio, build semantic navigation maps and support visual question answering
Voice Interface Driver for Google Assistant
A modular Python library for voice interactions with AI systems, featuring high-quality TTS, STT with Whisper, and memory persistence.
Saarthi is an AI-powered, voice-first assistant that helps citizens discover and understand government schemes with face authentication, secure PII handling, and a Streamlit UI powered by LangGraph and local LLMs.
Voice-driven ontology builder. Say “command …” then a sentence (e.g., “the car has four wheels”). It transcribes and parses to OWL (RDF/XML): classes, has-relations with cardinalities, and part_of inverse. View and download the OWL.
This project is a Python-based conversational AI chatbot that allows voice-based interactions using speech recognition (for input) and text-to-speech (for output). It uses a pre-trained model (DialoGPT) and can be fine-tuned on custom datasets using training.py.
🤖 AI Chatbot with Voice Interface - A Flask web app featuring Groq-powered chat, voice input/output, and theme support. Combines natural language processing with speech synthesis for an interactive chat experience. #Python #Flask #AI #VoiceInterface
AI-Powered Sales Pipeline Analytics with Multi-Agent Architecture and Voice Interface
🛠️ Site Reporter MVP turns live French chantier voice notes into structured reports by pairing Azure GPT‑4o-mini-transcribe speech-to-text with a Mistral LLM extractor. The FastAPI backend orchestrates transcription → template inference → report drafting, while a Streamlit UI gives supervisors either a human-in-loop or fully automatic workflow.
Advanced AI Agent for Windows & Web – Modular, Voice-Enabled, Multi-LLM Orchestration Sistem agen AI otonom yang dirancang untuk menjalankan tugas kompleks di desktop dan web, dengan dukungan suara, kontrol GUI, dan integrasi LLM multi-provider. Mendukung automasi Office, voice interface, dan pengambilan keputusan berbasis refleksi mandiri.
Collaborative AI research‑fabrication OS: a swarm‑based platform where specialized agents (routed across local and cloud models) cooperate to decompose, research and synthesize complex tasks while orchestrating 3D‑printing, CNC and workshop ops. Features dynamic model routing, hierarchical research synthesis, agent registries and slot management.
🎤 Transform spoken phrases into OWL ontologies, making it easy to create structured data from voice. Ideal for developers and researchers alike.
Add a description, image, and links to the voice-interface topic page so that developers can more easily learn about it.
To associate your repository with the voice-interface topic, visit your repo's landing page and select "manage topics."