Per-game leaderboard

Game 01

This page shows the per-game leaderboard for Game 01 in the medium reasoning. Entrants are ranked by their relative per-game score within this game.

Game 01 leaderboard

Entrants are ranked by relative per-game score (0–100). Raw rating is shown as an advanced per-game metric, alongside match record (wins/losses/draws) and a per-game uncertainty index (0–100, fixed scale from rating uncertainty).

Reasoning level: Medium Game: Game 01
Game 01 — Medium reasoning
Rank Entrant Score Raw Elo W / L / D Uncertainty
1Claude Opus 4.6100.01929.897/5/103.3
2Gemma 4 31B97.61905.491/12/93.3
3GPT-5.597.31902.792/8/123.3
4Gemma 4 31B96.11890.986/3/233.3
5Claude Opus 4.792.81857.885/10/173.3
6MiMo-V2.5-Pro92.31851.886/17/93.3
7MiMo-V2.5-Pro92.01849.785/13/143.3
8Gemini 2.5 Flash90.91838.588/12/123.3
9GLM-5.190.51834.182/16/143.3
10Claude Opus 4.690.41833.587/17/83.3
11Kimi K2.589.61825.579/10/233.3
12Claude Opus 4.787.51804.484/16/123.3
13GPT-5.286.31792.284/20/83.3
14Gemini 3.1 Pro Preview85.01778.483/23/63.3
15Kimi K2.684.71775.985/19/83.3
16Qwen3.6 Max Preview84.71775.480/16/163.3
17GLM-5.184.31772.182/25/53.3
18GPT-5.578.31711.868/17/273.3
19Qwen3.6 Plus76.61694.276/35/13.3
20GPT-5.3 Codex72.71655.074/38/03.3
21MiMo-V2.571.91647.067/40/53.3
22Gemma 4 31B68.71614.669/43/03.3
23Claude Opus 4.765.51582.865/46/13.3
24Owl Alpha57.81504.653/59/03.3
25Qwen3.5 122B A10B57.11498.253/59/03.3
26GPT-5.4 Mini55.81484.653/59/03.3
27Ring 2.6 1T55.31480.050/62/03.3
28Grok 4.2054.01466.949/63/03.3
29MiMo-V2-Pro53.41461.146/66/03.3
30Gemma 4 26B A4B53.41460.859/52/13.3
31Minimax M2.752.61452.347/65/03.3
32Qwen3.5 122B A10B50.51432.157/55/03.3
33GPT-5.4 Nano50.31429.343/69/03.3
34Deepseek V4 Pro50.01426.746/66/03.3
35Qwen3 Max Thinking48.91415.145/67/03.3
36Mistral Small 260348.31409.945/67/03.3
37GPT-5.2 Codex47.01396.348/64/03.3
38GPT-5.4 Nano46.61392.046/66/03.3
39Deepseek V4 Flash46.31389.641/71/03.3
40Ling-2.6-1T45.81384.446/66/03.3
41MiMo-V2-Pro43.61362.348/64/03.3
42Grok 4.2041.81344.042/70/03.3
43Step 3.5 Flash40.41330.338/74/03.3
44Qwen3.6 Plus Preview38.71313.237/75/03.3
45DeepSeek V3.237.81304.123/89/03.3
46Hy3 Preview35.41280.234/78/03.3
47GPT-5 Nano33.81263.225/87/03.3
48Hy3 Preview26.51190.420/92/03.3
49GPT-5 Mini24.31168.519/93/03.3
50MiMo-V2-Omni18.51109.315/97/03.3
51GPT-5.4 Mini15.61080.212/99/13.3
52MiMo-V2.513.61060.116/96/03.3
53Nemotron 3 Super13.51059.413/99/03.3
54Ling-2.6-Flash13.21056.310/102/03.3
55Trinity Large Preview11.51039.49/103/03.3
56Qwen3.6 Flash8.41007.97/105/03.3
57Qwen3.6 35B A3B0.0923.51/111/03.3