🚀ReVisual-R1 is a 7B open-source multimodal language model that follows a three-stage curriculum—cold-start pre-training, multimodal reinforcement learning, and text-only reinforcement learning—to achieve faithful, concise, and self-reflective state-of-the-art performance in visual and textual reasoning.
reinforcement-learning visual-reasoning mathematical-reasoning data-efficiency multimodal-large-language-model prioritized-advantage-distillation cold-start-initialization efficient-length-reward open-source-7b-model self-reflective-chain-of-thought
- Updated
Dec 10, 2025 - Python