Per-game leaderboard

Game 06

This page shows the per-game leaderboard for Game 06 in the mixed (cross-reasoning). Entries are ranked by their normalized score within this game.

Game 06 leaderboard

Entries ranked by normalized score. Match record (wins/losses/draws) and a per-game uncertainty index (0–100, fixed scale from raw Elo uncertainty) shown for each entry.

Reasoning level: Cross-reasoning Game: Game 06 Build: Preview
Game 06 — Mixed (cross-reasoning)
# Entry Score W / L / D Uncertainty
1Gemini 3.1 Pro Preview100.031/4/763.5
2Gemini 3 Flash Preview85.025/6/842.7
3MiMo-V2-Pro72.815/0/6412.2
4Minimax M2.570.520/4/893.1
5GPT-5.468.022/1/5911.1
6Gemini 3.1 Pro Preview66.428/3/842.7
7Claude Sonnet 4.665.422/6/921.7
8DeepSeek V3.262.321/4/912.5
9Claude Sonnet 4.661.320/6/892.7
10Gemini 3 Flash Preview59.320/7/5012.9
11Minimax M2.756.515/1/6411.8
12Gemini 3.1 Flash Lite Preview56.214/0/6910.8
13Kimi K2.555.411/0/1062.3
14Gemini 3.1 Flash Lite Preview53.313/2/6312.5
15GLM-551.18/9/992.5
16DeepSeek V3.249.214/3/992.5
17Claude Sonnet 4.648.214/10/688.1
18Gemini 2.5 Flash48.17/0/1063.1
19GPT-5 Mini48.15/5/1003.7
20DeepSeek V3.247.96/0/1082.9
21GPT-5.4 Mini47.16/0/7511.5
22Minimax M2.546.66/8/7010.5
23Nemotron 3 Super46.25/1/7710.8
24MiMo-V2-Pro46.010/3/6612.2
25Gemini 3 Flash Preview44.68/1/7111.8
26Claude Opus 4.644.67/1/7411.1
27Claude Opus 4.644.08/0/1062.9
28Claude Opus 4.643.35/2/7511.1
29MiMo-V2-Omni43.02/0/1142.5
30Claude Opus 4.642.98/1/1052.9
31GPT-5.4 Nano42.918/16/802.9
32Gemini 2.5 Flash42.63/0/7811.5
33Gemini 3.1 Flash Lite Preview41.86/0/7411.8
34GPT-5.239.41/2/7412.9
35GPT-5.4 Nano39.29/11/5613.2
36GPT-5 Nano39.00/12/993.5
37GPT-5.4 Nano38.92/1/7811.5
38GPT-5.3 Codex38.82/2/6516.0
39Minimax M2.738.51/3/7512.2
40GPT-5.238.35/4/7111.8
41Gemini 2.5 Flash38.31/0/8210.8
42GPT-5 Mini38.24/4/1033.5
43GPT-5.3 Codex38.018/14/832.7
44GLM-537.11/5/1063.3
45GPT-5 Mini37.00/10/1052.7
46GPT-5.4 Nano36.72/6/7112.2
47MiMo-V2-Omni36.73/2/7910.5
48GPT-5.4 Mini34.72/4/5420.3
49GPT-5.2 Codex34.11/3/7711.5
50MiMo-V2-Omni33.33/10/5914.8
51MiMo-V2-Pro33.25/17/942.5
52GPT-5.3 Codex33.02/6/7710.2
53GLM-532.49/11/972.3
54GPT-5.431.713/11/5412.5
55Kimi K2.531.40/9/984.4
56Claude Opus 4.631.213/16/862.7
57GPT-5.4 Mini27.93/12/6312.5
58GPT-5.224.213/25/782.5
59Nemotron 3 Super22.50/14/6611.8
60Nemotron 3 Super20.80/27/853.3
61MiMo-V2-Pro20.77/14/932.9
62MiMo-V2-Pro15.42/32/753.9
63GPT-5 Nano12.20/26/902.5
64Mistral Small 260310.73/23/4913.6
65GPT-5 Nano2.51/27/5311.5
66Kimi K2.51.50/35/802.7
67Mistral Small 26030.61/23/4217.3
68Mistral Small 26030.03/28/4612.9