#

livecodebench

Here are 2 public repositories matching this topic...

WeiboAI / VibeThinker

Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B

ai transformer language-model huggingface llm sllm reasoning-language-models reasoning-models livecodebench aime2025

Updated Nov 14, 2025
Python

SS47816 / AGI-Elo

[NeurIPS 2025] AGI-Elo: How Far Are We From Mastering A Task?

benchmark leaderboard agi imagenet coco artificial-general-intelligence datasets evaluation-metrics elo-rating rating-system evaluation-framework sota ai-benchmarks waymo-open-dataset mmlu vision-language-action ai-evaluation-framework livecodebench navsim

Updated Oct 28, 2025
Python

Improve this page

Add a description, image, and links to the livecodebench topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the livecodebench topic, visit your repo's landing page and select "manage topics."