cold-start-initialization

Here is 1 public repository matching this topic...

CSfufu / Revisual-R1

🚀ReVisual-R1 is a 7B open-source multimodal language model that follows a three-stage curriculum—cold-start pre-training, multimodal reinforcement learning, and text-only reinforcement learning—to achieve faithful, concise, and self-reflective state-of-the-art performance in visual and textual reasoning.

reinforcement-learning visual-reasoning mathematical-reasoning data-efficiency multimodal-large-language-model prioritized-advantage-distillation cold-start-initialization efficient-length-reward open-source-7b-model self-reflective-chain-of-thought

Updated Dec 10, 2025
Python

Improve this page

Add a description, image, and links to the cold-start-initialization topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the cold-start-initialization topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cold-start-initialization

Here is 1 public repository matching this topic...

CSfufu / Revisual-R1

Improve this page

Add this topic to your repo