Rankings
Models
Model leaderboard
One row per model; Min–Max is the score range across that model's evaluated rows at this reasoning level. Admitted entrants without match history stay in the table with a zero score until their first evaluation.
| Rank | Model | Avg score | Min–Max | Entries |
|---|---|---|---|---|
| 1 | 74.1 | 43.4 – 95.4 | 16 | |
| 2 | 73.7 | 0.0 – 100.0 | 8 | |
| 3 | 73.0 | 29.7 – 87.4 | 7 | |
| 4 | 72.4 | 42.1 – 89.7 | 15 | |
| 5 | 70.4 | 42.3 – 91.0 | 8 | |
| 6 | 64.5 | 21.2 – 76.6 | 9 | |
| 7 | 64.5 | 46.5 – 95.2 | 17 | |
| 8 | 63.1 | 41.1 – 85.0 | 23 | |
| 9 | 61.2 | 22.0 – 93.3 | 7 | |
| 10 | 61.1 | 34.8 – 92.1 | 7 | |
| 11 | 60.5 | 35.1 – 94.4 | 8 | |
| 12 | 60.1 | 22.9 – 96.6 | 7 | |
| 13 | 58.4 | 34.3 – 82.9 | 6 | |
| 14 | 56.3 | 40.3 – 69.8 | 8 | |
| 15 | 55.1 | 24.3 – 88.2 | 8 | |
| 16 | 54.7 | 40.1 – 68.4 | 14 | |
| 17 | 54.6 | 33.3 – 96.5 | 12 | |
| 18 | 54.5 | 21.9 – 92.2 | 16 | |
| 19 | 54.4 | 26.3 – 69.0 | 7 | |
| 20 | 53.1 | 8.4 – 78.9 | 8 | |
| 21 | 52.0 | 29.4 – 85.0 | 14 | |
| 22 | 51.5 | 23.2 – 72.5 | 14 | |
| 23 | 51.5 | 1.3 – 73.4 | 8 | |
| 24 | 49.4 | 18.5 – 84.2 | 7 | |
| 25 | 49.1 | 35.5 – 100.0 | 15 | |
| 26 | 48.0 | 4.6 – 75.2 | 8 | |
| 27 | 47.6 | 12.6 – 87.5 | 22 | |
| 28 | 46.4 | 10.3 – 70.6 | 16 | |
| 29 | 46.3 | 7.5 – 58.5 | 12 | |
| 30 | 45.9 | 24.4 – 72.2 | 7 | |
| 31 | 43.7 | 14.0 – 84.7 | 7 | |
| 32 | 43.0 | 26.9 – 61.2 | 7 | |
| 33 | 42.4 | 18.1 – 64.4 | 16 | |
| 34 | 42.1 | 16.5 – 53.5 | 8 | |
| 35 | 41.8 | 3.5 – 82.9 | 7 | |
| 36 | 41.3 | 0.0 – 52.0 | 8 | |
| 37 | 40.8 | 10.6 – 62.8 | 8 | |
| 38 | 40.6 | 0.7 – 76.6 | 8 | |
| 39 | 40.4 | 11.2 – 90.9 | 8 | |
| 40 | 40.0 | 1.7 – 70.1 | 8 | |
| 41 | 37.2 | 3.7 – 59.1 | 16 | |
| 42 | 36.7 | 9.0 – 74.0 | 7 | |
| 43 | 36.1 | 11.5 – 60.6 | 2 | |
| 44 | 36.1 | 18.4 – 55.9 | 10 | |
| 45 | 34.3 | 0.0 – 75.8 | 7 | |
| 46 | 33.4 | 11.1 – 59.6 | 7 | |
| 47 | 32.9 | 0.8 – 62.6 | 7 | |
| 48 | 28.6 | 13.2 – 47.4 | 3 | |
| 49 | 28.4 | 0.0 – 58.5 | 8 | |
| 50 | 28.0 | 0.0 – 67.1 | 5 | |
| 51 | 25.8 | 0.0 – 35.5 | 4 |