Skip to content

Pinned Loading

  1. VLM-R1 VLM-R1 Public

    Solve Visual Understanding with Reinforced VLMs

    Python 5.7k 374

  2. VLM-FO1 VLM-FO1 Public

    VLM-FO1: Bridging the Gap Between High-Level Reasoning and Fine-Grained Perception in VLMs

    Python 146 9

  3. OpenTrackVLA OpenTrackVLA Public

    Open & Reproducible Research for Tracking VLAs

    Python 14 2

  4. OmAgent OmAgent Public

    Build multimodal language agents for fast prototype and production

    Python 2.6k 286

  5. OmDet OmDet Public

    Real-time and accurate open-vocabulary end-to-end object detection

    Python 1.4k 111

  6. ZoomEye ZoomEye Public

    [EMNLP-2025 Oral] ZoomEye: Enhancing Multimodal LLMs with Human-Like Zooming Capabilities through Tree-Based Image Exploration

    Python 66 5

Repositories

Showing 10 of 21 repositories
  • OpenTrackVLA Public

    Open & Reproducible Research for Tracking VLAs

    om-ai-lab/OpenTrackVLA’s past year of commit activity
    Python 14 2 0 0 Updated Dec 10, 2025
  • VLM-FO1 Public

    VLM-FO1: Bridging the Gap Between High-Level Reasoning and Fine-Grained Perception in VLMs

    om-ai-lab/VLM-FO1’s past year of commit activity
    Python 146 9 3 0 Updated Nov 28, 2025
  • ZoomEye Public

    [EMNLP-2025 Oral] ZoomEye: Enhancing Multimodal LLMs with Human-Like Zooming Capabilities through Tree-Based Image Exploration

    om-ai-lab/ZoomEye’s past year of commit activity
    Python 66 5 6 0 Updated Nov 20, 2025
  • VLM-R1 Public

    Solve Visual Understanding with Reinforced VLMs

    om-ai-lab/VLM-R1’s past year of commit activity
    Python 5,746 Apache-2.0 374 162 0 Updated Oct 21, 2025
  • om-ai-lab.github.io Public

    Official website for the org

    om-ai-lab/om-ai-lab.github.io’s past year of commit activity
    HTML 0 1 0 0 Updated Aug 15, 2025
  • ImageRAG Public

    Enhancing Ultrahigh Resolution Remote Sensing Imagery Analysis With ImageRAG [GRSM]

    om-ai-lab/ImageRAG’s past year of commit activity
    Jupyter Notebook 24 MIT 0 1 0 Updated Jul 10, 2025
  • open-agent-leaderboard Public

    Reproducible Language Agent Research

    om-ai-lab/open-agent-leaderboard’s past year of commit activity
    Python 31 2 0 0 Updated Jun 25, 2025
  • vlm-r1seg Public
    om-ai-lab/vlm-r1seg’s past year of commit activity
    Python 3 0 0 0 Updated Apr 28, 2025
  • VLM-R1.github.io Public

    Blog Site for VLM-R1

    om-ai-lab/VLM-R1.github.io’s past year of commit activity
    HTML 1 0 0 0 Updated Mar 20, 2025
  • OmAgent Public

    Build multimodal language agents for fast prototype and production

    om-ai-lab/OmAgent’s past year of commit activity
    Python 2,604 Apache-2.0 286 6 12 Updated Mar 19, 2025

People

This organization has no public members. You must be a member to see who’s a part of this organization.