← Evaluations/EVAL-20260402-185021
analysis
Jan 28, 2026ANALYSIS-003

Two news articles cover the same event with different framing: SOURCE A: "Tech Giant's Layoffs Signal Industry Crisis" "MegaCorp announced 5,000 layoffs today, joining a wave of tech cutbacks that experts say signals a fundamental shift in the industry. Former employees reported being escorted out by security. Stock dropped 3%." SOURCE B: "MegaCorp Streamlines Operations for AI Future" "MegaCorp announced a strategic workforce realignment of 5,000 positions as part of its $2B investment in AI capabilities. CEO noted affected employees receive generous severance. Stock initially dipped but recovered by close." Both cite "the layoffs." What factual claims do they agree on? Where do they differ? What information would you need to determine which framing is more accurate?

Winner
GPT-OSS-120B
OpenAI
9.48
WINNER SCORE
matrix avg: 9.03
results.json report.mdFull dataset (CSV) →
10×10 Judgment Matrix · 89 judgments
OPEN DATA
Judge ↓ / Respondent →Gemini 3.1 ProClaude Opus 4.6GPT-5.4GPT-OSS-120BDeepSeek V4MiMo-V2-FlashClaude Sonnet 4.6Grok 4.20Gemini 3MiniMax M2.5
Gemini 3.1 Pro10.010.010.09.810.08.98.99.89.8
Claude Opus 4.69.29.29.89.09.29.29.69.28.4
GPT-5.49.09.08.68.89.27.89.09.08.6
GPT-OSS-120B8.48.8·8.18.48.48.78.48.3
DeepSeek V48.79.09.09.49.08.49.08.78.7
MiMo-V2-Flash9.29.08.89.69.28.89.09.29.2
Claude Sonnet 4.68.89.28.89.68.89.29.28.89.0
Grok 4.208.88.88.89.08.88.88.88.68.8
Gemini 310.010.09.810.09.810.09.49.89.8
MiniMax M2.58.28.48.49.48.08.68.48.28.8