Per-game leaderboard

Game 01

This page shows the per-game leaderboard for Game 01 in the medium reasoning. Entries are ranked by their normalized score within this game.

Game 01 leaderboard

Entries ranked by normalized score. Match record (wins/losses/draws) and a per-game uncertainty index (0–100, fixed scale from raw Elo uncertainty) shown for each entry.

Reasoning level: Medium Game: Game 01 Build: Preview
Game 01 — Medium reasoning
# Entry Score W / L / D Uncertainty
1Gemini 2.5 Flash100.090/2/08.1
2Claude Opus 4.694.066/3/115.6
3GPT-5.282.156/8/117.8
4GPT-5.3 Codex73.550/13/018.8
5Mistral Small 260355.247/43/08.7
6GPT-5.4 Nano52.559/33/08.1
7GPT-5.4 Mini50.837/32/016.0
8GPT-5.2 Codex48.037/26/018.8
9Minimax M2.744.530/30/020.3
10Step 3.5 Flash36.027/39/017.3
11MiMo-V2-Pro35.035/57/013.9
12GPT-5 Nano26.621/43/018.3
13GPT-5 Mini10.711/57/016.4
14MiMo-V2-Omni5.313/77/08.7
15Trinity Large Preview0.25/60/017.8
16Nemotron 3 Super0.016/76/08.1