Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.
- Updated
Jul 11, 2025 - Python
Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.
A multimodal chat interface with many tools.
VLM driven tool that processes surveillance videos, extracts frames, and generates insightful annotations using a fine-tuned Florence-2 Vision-Language Model. Includes a Gradio-based interface for querying and analyzing video footage.
A chatbot UI for RAG, multimodal, text completion. (support Transformers, llama.cpp, MLX, vLLM)
AUTOMATIC1111: Software for tensor operations, saving tensor data in .safetensors format. ComfyUI: UI library, possibly managing tensor data safely with *.safetensors. InvokeAI: ML platform using *.safetensors for secure tensor storage.
This project is a multi-agent customer service chatbot designed for an e-commerce platform. The chatbot employ specialized agents handle distinct tasks to ensure efficient and accurate interactions. The chatbot aims to enhance user experience by streamlining order processing, answering FAQs, and providing personalized recommendations.
Play with Orpheus TTS, a Llama-based Speech-LLM designed for high-quality, empathetic text-to-speech generation. This model has been fine-tuned to deliver human-level speech synthesis 🔥🗣️
Gradio app using Gemini to transcribe and summarize audios into Thai governmental format
A journalist that knows lots of news about AI!📰💻
Conversate effortlessly in more than 50 languages!
Modular AI tool to convert web content into markdown brochures using OpenRouter LLMs. Fully customizable and deployable via Gradio on Hugging Face Spaces.
Sydney – Multimodal Offline AI for medical guidance. CPU-friendly, powered by Gemma 3 1B, Whisper, Glow-TTS & Granite 47M R2 embeddings. RAG + memory = rapid, accurate, context-aware responses.
Chatbot - Your Personal Culinary Advisor: Discover What to Cook Next!
EduAI is a Python chatbot that enhances programming learning with abundant resources, text-to-speech accessibility, and interactive Q&A, fostering universal programming knowledge access.
Build amazing AI and RAG-powered applications, plain and simple🪂
SAT-Landforms-Classifier is an image classification vision-language encoder model fine-tuned from google/siglip2-base-patch16-224 for a single-label classification task. It is designed to classify satellite images into different landform categories using the SiglipForImageClassification architecture
A Dify plugin for semantic search across 110 million academic publications powered by abstracts-search.一个基于 abstracts-search 的 Dify 插件,可对 1.1 亿篇学术出版物进行语义搜索。
Let this AI automatically create a brochure for the company you want. Just give the company website and it'll create the brochure.
Add a description, image, and links to the gradio-python-llm topic page so that developers can more easily learn about it.
To associate your repository with the gradio-python-llm topic, visit your repo's landing page and select "manage topics."