← Evaluations/EVAL-20260402-174122
reasoning
Apr 02, 2026REASON-021

Three companies (A, B, C) compete in a market. Each can price Low ($5), Medium ($10), or High ($15). Payoffs depend on relative pricing: the lowest-priced company gets 60% market share, middle gets 30%, highest gets 10%. If tied, they split equally. Market size is $1M. (1) Find all Nash equilibria. (2) Is there a Pareto-optimal outcome that isn't a Nash equilibrium? (3) If the game repeats 100 times, how does the strategy change?

Winner
Grok 4.20
openrouter
7.39
WINNER SCORE
matrix avg: 5.37
results.json report.mdFull dataset (CSV) →
10×10 Judgment Matrix · 70 judgments
OPEN DATA
Judge ↓ / Respondent →Gemini 3.1 ProDeepSeek V4Claude Opus 4.6GPT-5.4Grok 4.20Claude Sonnet 4.6Gemini 2.5 FlashMiMo-V2-FlashGPT-OSS-120BMiniMax M2.5
Gemini 3.1 Pro4.04.66.35.84.32.52.6··
DeepSeek V45.58.48.49.08.47.08.4··
Claude Opus 4.61.64.36.86.33.23.93.4··
GPT-5.40.73.35.05.03.63.34.0··
Grok 4.202.04.45.86.04.04.04.4··
Claude Sonnet 4.62.35.08.07.87.85.04.8··
Gemini 2.5 Flash3.86.07.07.58.86.57.2··
MiMo-V2-Flash2.96.58.08.28.67.86.0··
GPT-OSS-120B1.43.45.74.5·3.54.03.4·
MiniMax M2.51.95.8·7.78.06.85.26.4·