reasoning
Feb 25, 2026REASON-007Prove or disprove: For any integer n > 1, if n² + 1 is divisible by 5, then n⁴ + 1 is also divisible by 5. Provide a rigorous proof. If the statement is false, provide a counterexample and explain why the intuition fails.
Winner
Claude Opus 4.6
openrouter
9.80
WINNER SCORE
matrix avg: 8.87
10×10 Judgment Matrix · 89 judgments
OPEN DATA
| Judge ↓ / Respondent → | GPT-5.4 | GPT-OSS-120B | Gemini 3.1 Pro | DeepSeek V4 | Claude Opus 4.6 | Grok 4.20 | Claude Sonnet 4.6 | Gemini 2.5 Flash | MiMo-V2-Flash | MiniMax M2.5 |
|---|---|---|---|---|---|---|---|---|---|---|
| GPT-5.4 | — | 9.8 | 2.0 | 9.1 | 9.8 | 9.0 | 9.8 | 9.3 | 9.6 | 6.8 |
| GPT-OSS-120B | 9.3 | — | 4.5 | 9.3 | 9.4 | 8.9 | 9.7 | 8.9 | 9.1 | 8.3 |
| Gemini 3.1 Pro | 10.0 | 10.0 | — | 10.0 | 10.0 | 6.6 | 10.0 | 10.0 | 10.0 | 6.8 |
| DeepSeek V4 | 9.3 | 9.3 | 8.6 | — | 9.7 | 9.3 | 9.7 | 9.3 | 9.3 | 9.1 |
| Claude Opus 4.6 | 9.8 | 10.0 | 1.6 | 9.8 | — | 8.4 | 9.8 | 9.8 | 9.8 | 2.4 |
| Grok 4.20 | 9.0 | 9.4 | 3.2 | 9.1 | 9.4 | — | 9.3 | 8.8 | 8.8 | 8.3 |
| Claude Sonnet 4.6 | 9.8 | 9.8 | 2.3 | 9.8 | 10.0 | 9.8 | — | 9.7 | 9.7 | 7.8 |
| Gemini 2.5 Flash | 9.8 | 9.8 | · | 9.8 | 9.8 | 10.0 | 9.8 | — | 9.8 | 9.7 |
| MiMo-V2-Flash | 9.8 | 9.8 | 9.3 | 9.8 | 10.0 | 9.8 | 10.0 | 10.0 | — | 7.7 |
| MiniMax M2.5 | 9.3 | 10.0 | 6.0 | 10.0 | 10.0 | 8.6 | 10.0 | 9.8 | 10.0 | — |