vlm

#3 Winner of Best Use of Zoom API at Stanford TreeHacks 2025! An AI-powered meeting assistant that captures video, audio and textual context from Zoom calls using multimodal RAG.

Updated Feb 16, 2025
JavaScript

safouaneelg / Talk2FastVLM

Star

Flask app to interact with FastVLM using CPU only

apple cpu chatbot captioning-images vlm onnx talkbot fastvlm

Updated Sep 28, 2025
JavaScript

hamedR96 / User-VLM

Star

Personalized Vision Language Models for Social Human-Robot Interactions

robotics vlm llm

Updated Nov 10, 2025
JavaScript

elloza / slides2video-pinokio-script

Star

Pinokio script for installing the app slides2video

video translation slides tts ia vlm pinokio

Updated Jun 19, 2025
JavaScript

6Morpheus6 / bagel

Star

[NVIDIA ONLY] [RTX 50 Support] Image generation, image editing and free-form manipulation with a VLM (Minimum Requirements 12GB VRAM / 32GB RAM Recommended Requirements 24GB VRAM / 48GB RAM)

image-editing image-manipulation vlm pinokio

Updated Nov 5, 2025
JavaScript

RealTime-VLM brings real-time VLM inference to the browser. It continuously captures webcam frames, sends image+text to an OpenAI-compatible API, and displays responses with sub-second latency. Works with local or hosted VLMs.

computer-vision vlm vision-language-model visual-language-models real-time-vlm

Updated Aug 11, 2025
JavaScript

DestroyerDarkNess / fastvlm-webgpu

Star

Real-time video captioning powered by FastVLM

transformers vanilla-js webgpu vlm vision-language-model fastvlm

Updated Nov 26, 2025
JavaScript

turningpoint-ai / MOSSBench

Star

This is the official implementation (code, data) of the paper "MOSSBench: Is Your Multimodal Language Model Oversensitive to Safe Queries?""

vlm mllm oversensitivity safety-alignment turningpoint-ai

Updated Nov 16, 2024
JavaScript

6Morpheus6 / BAGEL-DFloat11

Star

[NVIDIA ONLY] Image generation, image editing and free-form manipulation with a VLM (Minimum Requirements 12GB VRAM / 32GB RAM Recommended Requirements 24GB VRAM / 48GB RAM)

image-editing image-manipulation vlm pinokio

Updated Nov 5, 2025
JavaScript

Parijat-Ghosh / AI

Star

AI-powered personal assistant with OpenAI & MERN stack. Chat, analyze images, secure subs—50% faster responses.

react nodejs javascript ai mongodb rest-api chatbot mern cloudinary jwt-authentication razorpay vlm openai-api

Updated Oct 25, 2025
JavaScript

6Morpheus6 / Flux-Kontext-Dfloat11

Star

[NVIDIA ONLY] Gradio demo for Flux Kontext based on Diffusers with single and multiple images. (Minimum Requirements 12GB VRAM 48GB RAM / Recommended Requirements 24GB VRAM / 48GB RAM)

image-editing image-manipulation image-creation vlm pinokio

Updated Jul 13, 2025
JavaScript

ArunachalamM101202 / AI-scribble

Star

ScribblAI turns your chaotic doodles into photorealistic images using advanced AI — and your friends must guess what you were trying to draw. It's fast, fun, and full of AI magic!

ai image-generation vlm

Updated May 18, 2025
JavaScript

vulab-AI / YESBUT-v2

Star

We introduce the YesBut-v2, a benchmark for assessing AI's ability to interpret juxtaposed comic panels with contradictory narratives. Unlike existing benchmarks, it emphasizes visual understanding, comparative reasoning, and social knowledge.

benchmark vlm mllm-evaluation mllm-reasoning yesbut-v2 yesbut

Updated Apr 7, 2025
JavaScript

vulab-AI / YESBUT_Homepage

Star

YesBut Benchmark; Project page of paper Cracking the Code of Juxtaposition: Can AI Models Understand the Humorous Contradictions, accepted by NeurIPS 2024 (Oral).

benchmark vlm mllm-evaluation yesbut

Updated Nov 13, 2025
JavaScript

6Morpheus6 / OmniSVG

Star

[NVIDIA ONLY] End-to-end multimodal SVG generator capable of generating complex and detailed SVGs, from simple icons to intricate anime characters. (Minimum Requirements 12GB VRAM / 32GB RAM, Recommended Requirements 24GB VRAM / 24GB RAM)

svg vlm pinokio

Updated Jul 25, 2025
JavaScript

Improve this page

Add a description, image, and links to the vlm topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the vlm topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

vlm

Here are 24 public repositories matching this topic...

MigoXLab / dingo

opendatalab-raiser / Envision

manycore-research / CAD2Program

xirui-li / MOSSBench

EvilFreelancer / img2md-vlm-ocr

NxtGenLegend / TreeHacks-ZoneOut

safouaneelg / Talk2FastVLM

hamedR96 / User-VLM

elloza / slides2video-pinokio-script

6Morpheus6 / bagel

alessioborgi / RealTime-VLM

DestroyerDarkNess / fastvlm-webgpu

turningpoint-ai / MOSSBench

6Morpheus6 / BAGEL-DFloat11

Parijat-Ghosh / AI

6Morpheus6 / Flux-Kontext-Dfloat11

ArunachalamM101202 / AI-scribble

vulab-AI / YESBUT-v2

vulab-AI / YESBUT_Homepage

6Morpheus6 / OmniSVG

Improve this page

Add this topic to your repo