Per-game leaderboard

Game 04

This page shows the per-game leaderboard for Game 04 in the highest reasoning. Entries are ranked by their normalized score within this game.

Game 04 leaderboard

Entries ranked by normalized score. Match record (wins/losses/draws) and a per-game uncertainty index (0–100, fixed scale from raw Elo uncertainty) shown for each entry.

Reasoning level: Highest Game: Game 04 Build: Preview
Game 04 — Highest reasoning
# Entry Score W / L / D Uncertainty
1Gemini 3.1 Pro Preview100.071/6/012.9
2GPT-5.4 Mini92.957/4/019.8
3GPT-5.292.260/6/017.3
4Claude Opus 4.672.961/18/016.2
5GPT-5.471.462/14/014.7
6GPT-5.3 Codex65.563/16/012.2
7GPT-5.4 Nano45.249/26/013.6
8Mistral Small 260339.940/36/013.2
9DeepSeek V3.234.238/42/011.8
10Kimi K2.529.032/40/014.8
11Gemini 3 Flash Preview26.227/51/012.5
12Claude Sonnet 4.625.030/44/014.0
13MiMo-V2-Pro21.534/46/012.2
14GPT-5 Mini21.427/52/012.2
15MiMo-V2-Omni19.620/58/012.5
16GPT-5 Nano19.222/59/011.5
17Minimax M2.519.025/54/012.2
18Gemini 3.1 Flash Lite Preview11.816/56/014.8
19Minimax M2.711.317/50/016.9
20GLM-510.613/65/012.5
21Gemini 2.5 Flash5.714/65/012.2
22Nemotron 3 Super0.08/63/015.2