analysis
Apr 02, 2026ANALYSIS-020Estimate the total energy cost and carbon footprint of training a frontier AI model (like GPT-5). Include: GPU hours, electricity cost, cooling overhead, water usage, and embodied carbon of hardware. (1) Compare to: one year of Netflix streaming for all users, one transatlantic flight, and one Bitcoin transaction. (2) Inference costs are growing faster than training costs. Why? (3) What changes would reduce AI's environmental impact by 10x?
Winner
Grok 4.20
openrouter
8.62
WINNER SCORE
matrix avg: 6.99
10×10 Judgment Matrix · 89 judgments
OPEN DATA
| Judge ↓ / Respondent → | Gemini 3.1 Pro | Claude Opus 4.6 | GPT-5.4 | DeepSeek V4 | MiMo-V2-Flash | Claude Sonnet 4.6 | GPT-OSS-120B | Grok 4.20 | Gemini 3 | MiniMax M2.5 |
|---|---|---|---|---|---|---|---|---|---|---|
| Gemini 3.1 Pro | — | · | 5.8 | 5.0 | 8.7 | 6.3 | 2.6 | 8.9 | 9.7 | 7.9 |
| Claude Opus 4.6 | 1.6 | — | 7.6 | 5.7 | 7.5 | 6.8 | 4.8 | 8.2 | 7.7 | 7.9 |
| GPT-5.4 | 0.7 | 5.0 | — | 5.3 | 7.0 | 4.5 | 3.5 | 8.2 | 7.5 | 6.5 |
| DeepSeek V4 | 6.2 | 8.3 | 8.7 | — | 9.0 | 8.4 | 8.4 | 9.0 | 9.0 | 8.8 |
| MiMo-V2-Flash | 5.8 | 7.4 | 8.2 | 8.0 | — | 7.5 | 8.6 | 8.6 | 8.4 | 8.6 |
| Claude Sonnet 4.6 | 3.3 | 8.0 | 8.4 | 7.2 | 8.0 | — | 6.3 | 8.6 | 8.0 | 8.2 |
| GPT-OSS-120B | 1.4 | 5.9 | 5.9 | 6.7 | 8.0 | 5.9 | — | 8.2 | 7.5 | 7.5 |
| Grok 4.20 | 3.6 | 6.8 | 7.8 | 4.5 | 7.0 | 7.8 | 6.0 | — | 7.6 | 6.8 |
| Gemini 3 | 2.0 | 9.0 | 9.2 | 8.3 | 9.0 | 8.3 | 7.3 | 9.6 | — | 9.0 |
| MiniMax M2.5 | 1.6 | 7.3 | 6.6 | 7.3 | 8.2 | 6.8 | 6.7 | 8.6 | 8.8 | — |