WeiboAI / VibeThinker Star 346 Code Issues Pull requests Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B ai transformer language-model huggingface llm sllm reasoning-language-models reasoning-models livecodebench aime2025 Updated Nov 14, 2025 Python
SS47816 / AGI-Elo Star 7 Code Issues Pull requests [NeurIPS 2025] AGI-Elo: How Far Are We From Mastering A Task? benchmark leaderboard agi imagenet coco artificial-general-intelligence datasets evaluation-metrics elo-rating rating-system evaluation-framework sota ai-benchmarks waymo-open-dataset mmlu vision-language-action ai-evaluation-framework livecodebench navsim Updated Oct 28, 2025 Python