Per-game leaderboard

Game 05

This page shows the per-game leaderboard for Game 05 in the mixed (cross-reasoning). Entries are ranked by their normalized score within this game.

Game 05 leaderboard

Entries ranked by normalized score. Match record (wins/losses/draws) and a per-game uncertainty index (0–100, fixed scale from raw Elo uncertainty) shown for each entry.

Reasoning level: Cross-reasoning Game: Game 05 Build: Preview
Game 05 — Mixed (cross-reasoning)
# Entry Score W / L / D Uncertainty
1GPT-5.4100.073/0/313.2
2Gemini 3.1 Pro Preview49.250/0/535.3
3GPT-5.241.233/3/792.7
4GPT-5.4 Nano33.623/3/5212.5
5Step 3.5 Flash31.416/0/6810.5
6MiMo-V2-Pro27.911/6/933.7
7Kimi K2.526.312/3/992.9
8Gemini 2.5 Flash25.414/7/972.1
9Gemini 3.1 Flash Lite Preview25.32/1/1015.0
10Claude Sonnet 4.624.914/3/6012.9
11GPT-5.3 Codex23.615/5/884.1
12Nemotron 3 Super22.45/5/6912.2
13GPT-5.4 Nano21.913/9/942.5
14Claude Sonnet 4.620.99/3/6911.5
15GPT-5.3 Codex20.611/11/883.7
16Kimi K2.520.53/2/7611.5
17MiMo-V2-Omni20.23/5/1111.9
18DeepSeek V3.219.63/5/7211.8
19MiMo-V2-Pro19.64/1/7412.2
20Claude Opus 4.619.39/11/962.5
21Gemini 3 Flash Preview19.08/3/7111.1
22Claude Sonnet 4.618.93/3/7312.2
23DeepSeek V3.218.71/3/7213.2
24Kimi K2.518.63/1/868.7
25Qwen3.5 122B A10B18.53/3/1024.1
26Mistral Small 260318.411/13/3620.3
27GLM-518.11/1/869.2
28Gemini 3 Flash Preview18.05/3/7112.2
29GPT-5 Mini17.91/2/7412.9
30GPT-5.3 Codex17.88/5/1003.1
31Claude Opus 4.617.54/5/789.6
32MiMo-V2-Pro17.50/3/1112.9
33MiMo-V2-Pro17.33/4/7810.2
34GPT-5.216.75/1/7611.1
35MiMo-V2-Pro16.61/4/7511.8
36GPT-5.2 Codex16.42/3/1083.1
37Gemini 2.5 Flash16.31/1/7911.5
38GPT-5 Mini16.30/6/1063.3
39Minimax M2.716.24/7/1091.7
40GPT-5.4 Mini16.13/6/1043.1
41Claude Opus 4.615.78/11/923.5
42GPT-5.4 Mini15.71/3/6018.3
43Minimax M2.515.20/10/1052.7
44MiMo-V2-Omni14.93/2/7810.8
45Minimax M2.714.87/9/992.7
46GPT-5.414.510/9/923.5
47MiMo-V2-Pro14.30/3/946.8
48Nemotron 3 Super14.20/5/7212.9
49GLM-514.03/5/1062.9
50Qwen3 Max Thinking13.52/5/7112.5
51GLM-513.51/10/1072.1
52Seed 2.0 Mini13.50/7/6614.4
53GPT-5.413.57/5/6512.9
54Gemini 3 Flash Preview13.31/3/1053.9
55Mistral Small 260312.32/11/7010.8
56Gemini 3.1 Flash Lite Preview12.13/7/7211.1
57GPT-5.4 Mini12.02/15/1012.1
58GPT-5 Nano11.50/12/1032.7
59Nemotron 3 Super11.40/7/7610.8
60MiMo-V2-Omni11.31/18/992.1
61GPT-5 Nano11.32/17/972.5
62Minimax M2.511.00/7/7411.5
63GPT-5 Nano10.90/19/982.3
64Gemini 3.1 Pro Preview10.82/9/1003.5
65DeepSeek V3.29.92/9/6712.5
66Seed 2.0 Mini8.20/12/6512.9
67GPT-5 Mini7.30/13/1003.1
68GPT-5.4 Nano4.50/17/6810.2
69GPT-5.4 Nano0.00/27/853.3