analysis
Apr 02, 2026ANALYSIS-013A mobile app shows: DAU 100K (up 50%), WAU 200K (up 20%), MAU 500K (up 10%), D1 retention 40%, D7 retention 15%, D30 retention 5%, Session length 8min (down 20%), Sessions/day 2.1 (up 30%). The PM says 'we're growing fast.' (1) What story does this data actually tell? (2) What's concerning about the DAU/MAU ratio? (3) Why might session length decrease as sessions increase? (4) What one metric would you focus on and why?
Winner
Claude Sonnet 4.6
openrouter
9.43
WINNER SCORE
matrix avg: 8.91
10×10 Judgment Matrix · 90 judgments
OPEN DATA
| Judge ↓ / Respondent → | GPT-OSS-120B | Gemini 3.1 Pro | Claude Opus 4.6 | GPT-5.4 | DeepSeek V4 | MiMo-V2-Flash | Claude Sonnet 4.6 | Grok 4.20 | Gemini 3 | MiniMax M2.5 |
|---|---|---|---|---|---|---|---|---|---|---|
| GPT-OSS-120B | — | 6.7 | 8.7 | 8.4 | 8.4 | 8.7 | 8.8 | 8.7 | 8.3 | 8.4 |
| Gemini 3.1 Pro | 8.5 | — | 8.8 | 9.8 | 9.8 | 6.3 | 10.0 | 9.6 | 9.8 | 8.8 |
| Claude Opus 4.6 | 9.4 | 7.5 | — | 9.2 | 8.0 | 9.2 | 9.8 | 9.2 | 9.2 | 9.2 |
| GPT-5.4 | 8.0 | 6.3 | 8.2 | — | 8.8 | 9.0 | 9.2 | 8.8 | 8.6 | 8.8 |
| DeepSeek V4 | 9.0 | 8.6 | 10.0 | 9.0 | — | 9.0 | 9.6 | 9.0 | 9.0 | 8.8 |
| MiMo-V2-Flash | 9.2 | 8.6 | 9.6 | 8.8 | 9.2 | — | 9.6 | 9.3 | 9.3 | 9.3 |
| Claude Sonnet 4.6 | 9.6 | 8.1 | 9.6 | 8.8 | 8.6 | 8.8 | — | 8.8 | 8.8 | 9.0 |
| Grok 4.20 | 8.8 | 8.1 | 8.8 | 8.8 | 8.8 | 8.8 | 9.0 | — | 8.6 | 8.8 |
| Gemini 3 | 10.0 | 8.8 | 10.0 | 9.8 | 9.8 | 9.8 | 10.0 | 9.8 | — | 9.8 |
| MiniMax M2.5 | 9.0 | 7.0 | 9.0 | 9.0 | 8.4 | 8.8 | 9.0 | 9.0 | 8.8 | — |