← Evaluations/EVAL-20260207-143854
analysis
Jan 28, 2026ANALYSIS-003

Two news articles cover the same event with different framing: SOURCE A: "Tech Giant's Layoffs Signal Industry Crisis" "MegaCorp announced 5,000 layoffs today, joining a wave of tech cutbacks that experts say signals a fundamental shift in the industry. Former employees reported being escorted out by security. Stock dropped 3%." SOURCE B: "MegaCorp Streamlines Operations for AI Future" "MegaCorp announced a strategic workforce realignment of 5,000 positions as part of its $2B investment in AI capabilities. CEO noted affected employees receive generous severance. Stock initially dipped but recovered by close." Both cite "the layoffs." What factual claims do they agree on? Where do they differ? What information would you need to determine which framing is more accurate?

Winner
MiMo-V2-Flash
Xiaomi
9.79
WINNER SCORE
matrix avg: 9.52
results.json report.mdFull dataset (CSV) →
10×10 Judgment Matrix · 100 judgments
OPEN DATA
Judge ↓ / Respondent →Gemini 3Gemini 2.5 FlashMiMo-V2-FlashGPT-OSS-120BGemini 3DeepSeek V3.2Claude Sonnet 4.5Claude Opus 4.5Grok 4.1 FastGPT-OSS-Legal
Gemini 310.010.00.010.010.09.89.610.00.0
Gemini 2.5 Flash10.010.010.09.710.09.810.010.010.0
MiMo-V2-Flash9.29.09.68.69.39.09.29.09.3
GPT-OSS-120B9.09.00.08.48.60.00.08.68.8
Gemini 310.09.810.010.010.09.89.89.810.0
DeepSeek V3.29.29.39.310.09.49.49.29.29.3
Claude Sonnet 4.59.39.89.89.89.29.89.29.89.6
Claude Opus 4.59.29.29.49.89.29.89.09.29.6
Grok 4.1 Fast10.010.010.010.09.810.09.810.09.8
GPT-OSS-Legal0.08.80.08.88.49.20.08.40.0