Per-game leaderboard

Game 06

This page shows the per-game leaderboard for Game 06 in the mixed (cross-reasoning). Entrants are ranked by their relative per-game score within this game.

Game 06 leaderboard

Entrants are ranked by relative per-game score (0–100). Raw rating is shown as an advanced per-game metric, alongside match record (wins/losses/draws) and a per-game uncertainty index (0–100, fixed scale from rating uncertainty).

Reasoning level: Cross-reasoning Game: Game 06
Game 06 — Mixed (cross-reasoning)
Rank Model Reasoning Score Raw Elo W / L / D Uncertainty
1Gemini 3.1 Pro PreviewHighest100.01644.697/6/1560.0
2MiMo-V2.5Highest81.41594.065/34/1710.0
3GPT-5.4 NanoHighest77.61583.678/21/1740.0
4Kimi K2.5Medium77.01584.443/7/1780.0
5Gemma 4 31BNone76.01579.169/14/1950.0
6GPT-5.2Highest75.41578.494/38/1300.0
7GPT-5.3 CodexHighest72.11571.153/20/1530.0
8MiMo-V2.5Highest71.01565.445/17/2240.0
9Gemini 3 Flash PreviewMedium70.11564.744/13/1930.0
10Nemotron 3 SuperHighest68.81562.724/11/1870.0
11Gemini 3 Flash PreviewHighest68.81559.244/25/2200.0
12MiMo-V2-ProNone68.71561.525/7/2040.0
13GPT-5.4 NanoHighest68.61560.058/27/1780.0
14Minimax M2.7Highest68.21557.929/12/2430.0
15Gemini 2.5 FlashNone66.81557.626/11/1780.0
16GLM-5.1Highest66.31554.827/8/2050.0
17GPT-5.4 NanoHighest66.11552.852/24/1950.0
18Claude Opus 4.6Highest65.81553.754/22/1590.0
19Gemini 3.1 Flash Lite PreviewNone65.51551.029/15/2290.0
20GPT-5.2Highest65.21552.270/16/1460.0
21Hy3 PreviewNone65.01550.741/15/1960.0
22MiMo-V2.5-ProMedium64.61550.427/17/1940.0
23GPT-5.5Medium63.11545.441/30/1870.0
24MiMo-V2.5Medium62.21543.531/17/1980.0
25Gemma 4 31BMedium62.21542.452/12/2030.0
26Deepseek V4 ProHighest60.41538.339/23/1900.0
27Claude Opus 4.6Medium59.21534.031/17/2250.0
28GPT-5.2None59.21535.618/10/2120.0
29GPT-5.3 CodexMedium59.01534.011/6/2460.0
30Qwen3.6 Max PreviewNone58.41532.918/19/2150.0
31Gemini 3.1 Pro PreviewHighest57.91531.652/19/1800.0
32GPT-5.4None57.21529.444/11/2060.0
33Claude Opus 4.6None57.01529.727/5/2130.0
34Step 3.5 FlashHighest56.71527.511/12/2460.0
35Qwen3.6 FlashMedium56.51527.029/11/2320.0
36GPT-5.5None56.21527.131/24/1970.0
37Gemma 4 31BNone56.11525.955/13/2020.0
38Ling-2.6-1TMedium55.61526.911/6/2090.0
39Qwen3 Max ThinkingHighest55.31527.328/6/1720.0
40Kimi K2.5Medium55.01522.779/19/1800.0
41Claude Opus 4.7None54.31522.945/13/1770.0
42Grok 4.20None54.11521.59/22/2170.0
43Owl AlphaHighest53.91528.617/4/1260.0
44Qwen3 Max ThinkingMedium53.61519.726/14/2170.0
45GPT-5.4 NanoNone53.51519.716/21/2160.0
46GLM-5.1Medium53.11520.928/23/1630.0
47Gemini 3.1 Flash Lite PreviewMedium52.91532.73/4/936.0
48Qwen3.6 PlusNone52.81521.217/7/1720.0
49Grok 4.20Highest52.61518.45/12/2160.0
50Qwen3.6 PlusNone52.31518.39/8/2030.0
51MiMo-V2-OmniMedium51.91517.412/7/2010.0
52MiMo-V2-ProMedium51.81514.148/38/1880.0
53DeepSeek V3.2Highest51.41513.523/20/2200.0
54GPT-5.2Medium51.31513.59/8/2420.0
55Qwen3.5 122B A10BMedium51.11512.943/27/1900.0
56Claude Opus 4.6None50.81513.223/5/2090.0
57Ring 2.6 1TMedium50.81512.518/10/2220.0
58MiMo-V2-ProMedium50.81512.322/36/1950.0
59Claude Opus 4.6Highest50.61512.422/5/2170.0
60Gemini 3.1 Pro PreviewMedium50.41510.735/11/2200.0
61GPT-5 MiniMedium50.31511.18/17/2310.0
62Claude Opus 4.7None50.31509.648/14/2250.0
63Gemma 4 31BHighest50.31514.24/2/1940.0
64Claude Sonnet 4.6None50.11511.232/12/1980.0
65Gemini 3 Flash PreviewNone50.01508.414/27/2580.0
66Minimax M2.5Medium50.01508.829/17/2380.0
67Deepseek V4 ProNone49.91510.519/19/2080.0
68GPT-5.5Highest49.91510.941/28/1660.0
69Qwen3.6 Plus PreviewMedium49.41507.816/22/2360.0
70GPT-5 MiniHighest49.21507.48/13/2460.0
71Claude Sonnet 4.6Medium49.01508.239/11/1930.0
72Claude Opus 4.6Medium49.01508.722/15/1960.0
73Gemini 2.5 FlashMedium49.01510.62/6/1930.0
74Claude Opus 4.7Highest48.81507.053/25/1770.0
75Owl AlphaHighest48.71509.87/9/1850.0
76MiMo-V2.5-ProNone48.61509.54/10/1890.0
77Qwen3.6 Plus PreviewHighest48.51506.539/6/2000.0
78Gemma 4 26B A4BNone48.41506.67/11/2230.0
79MiMo-V2-ProHighest48.21504.026/15/2410.0
80Gemini 2.5 FlashHighest47.61506.314/7/1870.0
81Claude Opus 4.6Highest47.61504.726/14/1950.0
82GPT-5.2 CodexMedium47.31501.85/14/2580.0
83DeepSeek V3.2None47.11504.28/8/2040.0
84Claude Opus 4.7None47.01501.319/26/2300.0
85Gemma 4 31BMedium46.81501.910/8/2300.0
86Qwen3.5 122B A10BHighest46.61502.54/5/2200.0
87Ling-2.6-1TNone46.41502.212/10/2030.0
88Claude Opus 4.7Medium46.01498.112/18/2540.0
89GPT-5 MiniNone46.01499.05/10/2460.0
90GPT-5.4 NanoNone45.11499.410/3/2000.0
91MiMo-V2-OmniNone45.11498.19/13/2110.0
92Hy3 PreviewMedium44.71496.34/17/2280.0
93Kimi K2.6Medium44.71498.710/3/1920.0
94Qwen3.6 Max PreviewMedium44.61494.822/25/2260.0
95Minimax M2.7Medium44.61495.911/16/2220.0
96Claude Sonnet 4.6Highest44.51495.744/16/1870.0
97GPT-5.5Medium44.41495.337/28/1870.0
98Gemma 4 31BMedium44.41495.94/20/2150.0
99DeepSeek V3.2Medium44.31493.531/17/2370.0
100GPT-5.4 MiniNone44.01495.010/5/2210.0
101Kimi K2.6Highest43.91493.239/20/2070.0
102GLM-5None43.81494.66/8/2210.0
103MiMo-V2.5None43.71493.612/12/2210.0
104Grok 4.20Medium43.61491.522/27/2380.0
105Kimi K2.5None43.31492.611/12/2220.0
106Deepseek V4 FlashNone43.21491.714/16/2280.0
107GPT-5.2 CodexNone42.31491.13/11/2120.0
108Deepseek V4 FlashMedium42.11491.110/11/1960.0
109Mistral Small 2603Medium40.61484.751/74/1350.0
110Seed 2.0 MiniMedium40.31484.15/24/2270.0
111MiMo-V2.5-ProHighest40.11482.99/23/2370.0
112Qwen3.6 PlusMedium40.01525.01/0/3439.6
113Kimi K2.5Highest39.71484.614/35/1670.0
114Qwen3.6 Max PreviewHighest38.71479.120/25/2260.0
115Kimi K2.5Highest38.41501.40/1/6915.6
116Minimax M2.5Highest38.31479.65/36/1970.0
117Gemma 4 31BNone38.21479.86/12/2120.0
118Deepseek V4 FlashHighest38.01481.138/15/1480.0
119Qwen3.6 PlusNone37.21476.022/42/1870.0
120Gemini 3.1 Flash Lite PreviewHighest37.11474.427/19/2340.0
121Hy3 PreviewMedium36.91474.39/16/2440.0
122GLM-5Medium36.61476.124/24/1680.0
123MiMo-V2.5-ProMedium36.31474.518/61/1520.0
124Qwen3.6 35B A3BHighest35.41471.513/22/2070.0
125GPT-5.5Highest35.31471.435/26/1800.0
126Step 3.5 FlashMedium35.11471.27/30/1990.0
127Claude Opus 4.7Medium33.81466.98/14/2290.0
128GPT-5.4 MiniHighest33.41465.98/33/2090.0
129Grok 4.20Highest32.61516.71/0/2454.3
130Nemotron 3 SuperMedium31.91462.70/40/1930.0
131Qwen3.6 FlashHighest31.41459.318/47/2070.0
132Deepseek V4 ProMedium31.01457.419/28/2420.0
133GPT-5.4Medium30.41458.442/50/1420.0
134GPT-5.4 MiniMedium30.31457.314/21/2190.0
135Gemma 4 26B A4BMedium30.01457.50/21/2120.0
136GPT-5.3 CodexNone29.51454.010/20/2470.0
137Nemotron 3 SuperNone29.21455.31/43/1900.0
138Hy3 PreviewHighest29.21453.55/23/2400.0
139Ling-2.6-FlashMedium28.71454.91/36/1800.0
140GPT-5.4 NanoMedium28.41452.14/31/2190.0
141Gemma 4 31BHighest28.31512.00/0/2162.9
142Gemma 4 31BHighest28.21505.01/1/2354.3
143Claude Opus 4.7Medium28.01451.313/39/1980.0
144Seed 2.0 MiniNone27.91509.10/0/2260.5
145GLM-5.1None27.61503.30/0/2554.3
146MiMo-V2.5Medium27.31501.10/0/2652.5
147Ring 2.6 1THighest27.21449.418/47/1760.0
148GPT-5 NanoNone26.41446.51/38/2190.0
149Gemini 3.1 Pro PreviewMedium25.61501.10/1/2258.3
150MiMo-V2-OmniHighest25.41443.718/55/1860.0
151Kimi K2.6None25.31502.20/0/2260.5
152MiMo-V2.5-ProNone25.31504.10/0/2162.9
153Qwen3.6 PlusHighest25.31503.90/0/2162.9
154Grok 4.20Medium24.51503.70/0/2065.4
155Qwen3.6 35B A3BHighest23.01437.615/51/1880.0
156Ling-2.6-FlashHighest22.91438.85/31/1890.0
157GPT-5.5None20.31429.327/65/1850.0
158MiMo-V2-ProNone20.31430.513/61/1760.0
159GLM-5Highest20.21483.30/1/2454.3
160Grok 4.20None17.21422.20/61/1870.0
161GPT-5 NanoHighest15.11458.00/3/3239.6
162MiMo-V2.5None14.41438.12/8/5617.3
163CobuddyHighest14.21415.50/53/1700.0
164Mistral Small 2603None13.91411.611/68/2060.0
165Ling-2.6-1THighest9.91402.60/57/1910.0
166Qwen3.6 FlashNone9.51442.90/3/3239.6
167Hy3 PreviewNone8.61398.50/66/1960.0
168Mistral Small 2603Highest8.51397.234/80/1650.0
169GPT-5 NanoMedium8.11395.711/89/1910.0
170Nemotron 3 Nano Omni 30B A3B ReasoningHighest7.11395.82/47/1840.0
171MiMo-V2.5-ProHighest6.31391.46/76/1940.0
172GLM-5.1None2.81384.10/75/1610.0
173Kimi K2.5None0.01429.00/4/2154.3