Game 07 leaderboard

Entrants are ranked by relative per-game score (0–100). Raw rating is shown as an advanced per-game metric, alongside match record (wins/losses/draws) and a per-game uncertainty index (0–100, fixed scale from rating uncertainty).

Reasoning level: Highest Game: Game 07

Game 07 — Highest reasoning
Rank	Entrant	Score	Raw Elo	W / L / D	Uncertainty
1	Mistral Small 2603	100.0	1711.3	46/2/56	5.0
2	MiMo-V2.5	97.2	1697.5	45/4/39	9.2
3	Ling-2.6-1T	95.4	1682.3	49/9/53	3.5
4	Hy3 Preview	94.7	1680.1	56/12/30	6.5
5	Qwen3.6 Plus Preview	91.7	1663.1	39/0/53	8.1
6	GPT-5.4	91.6	1662.6	43/7/43	7.8
7	Grok 4.20	91.6	1663.3	41/2/46	9.0
8	Qwen3 Max Thinking	90.8	1657.2	55/16/23	7.5
9	MiMo-V2.5-Pro	90.5	1657.2	44/6/38	9.2
10	Qwen3.6 Max Preview	90.3	1656.4	45/9/32	9.9
11	Deepseek V4 Pro	89.8	1651.1	52/27/17	7.0
12	Hy3 Preview	88.0	1641.9	40/3/44	9.6
13	Cobuddy	84.9	1624.4	25/0/56	11.5
14	GPT-5.2	84.5	1614.3	44/14/64	1.3
15	GPT-5.4	83.7	1611.2	35/2/73	3.7
16	Kimi K2.5	82.3	1605.5	43/14/39	7.0
17	MiMo-V2-Pro	79.1	1586.8	46/36/8	8.7
18	GPT-5.4 Mini	77.2	1572.1	42/17/48	4.4
19	Claude Opus 4.6	75.3	1560.5	25/13/70	4.1
20	Minimax M2.7	73.6	1553.7	47/27/17	8.4
21	Step 3.5 Flash	72.1	1546.4	18/4/61	10.8
22	GPT-5.4 Nano	72.1	1543.7	12/10/71	7.8
23	Nemotron 3 Super	72.0	1546.2	15/5/60	11.8
24	Claude Sonnet 4.6	71.3	1555.0	5/0/43	27.7
25	Claude Opus 4.6	71.2	1556.2	6/1/38	30.0
26	GPT-5.4 Mini	71.0	1558.3	3/0/37	34.3
27	Qwen3.6 Plus	70.4	1531.9	17/8/77	5.5
28	MiMo-V2.5-Pro	70.2	1552.2	4/0/38	32.5
29	GPT-5.5	68.8	1533.9	3/2/54	20.8
30	Gemini 3.1 Pro Preview	67.8	1513.7	23/17/73	3.1
31	GPT-5.3 Codex	67.1	1535.4	3/0/36	35.3
32	Nemotron 3 Nano Omni 30B A3B Reasoning	63.8	1498.3	0/6/68	14.0
33	Grok 4.20	63.8	1509.7	0/1/46	28.4
34	Gemma 4 26B A4B	63.8	1512.2	2/2/39	31.6
35	GPT-5.2	62.2	1500.8	1/2/43	29.2
36	Gemini 2.5 Flash	61.4	1475.8	13/27/69	3.9
37	GPT-5.5	61.2	1516.1	2/1/21	56.2
38	Deepseek V4 Flash	60.6	1496.9	1/2/35	36.3
39	Minimax M2.5	59.7	1468.8	42/47/3	8.1
40	DeepSeek V3.2	58.7	1461.6	40/43/13	7.0
41	GLM-5	58.6	1461.5	2/13/80	7.3
42	Minimax M2.7	56.8	1451.0	20/28/44	8.1
43	Qwen3.6 Flash	55.2	1443.4	13/22/47	11.1
44	Qwen3.6 35B A3B	52.2	1423.3	25/41/24	8.7
45	Kimi K2.5	49.6	1406.1	29/41/26	7.0
46	Owl Alpha	49.4	1407.5	4/33/49	9.9
47	MiMo-V2.5	47.5	1394.3	0/32/60	8.1
48	Gemini 3.1 Flash Lite Preview	44.4	1374.9	34/60/0	7.5
49	Gemma 4 31B	40.8	1354.5	2/37/49	9.2
50	Gemini 3 Flash Preview	38.1	1337.9	22/61/7	8.7
51	Gemma 4 31B	34.8	1317.2	13/52/27	8.1
52	Qwen3.5 122B A10B	26.1	1264.6	22/59/9	8.7
53	Kimi K2.6	22.2	1237.2	13/75/20	4.1
54	GPT-5 Nano	19.2	1222.0	2/67/25	7.5
55	Ling-2.6-Flash	18.0	1213.9	1/66/31	6.5
56	GPT-5 Mini	17.6	1213.0	12/64/15	8.4
57	Gemma 4 31B	16.6	1207.6	11/72/5	9.2
58	MiMo-V2-Pro	0.0	1104.6	4/80/12	7.0