← Evaluations/EVAL-20260402-232138
communication
Apr 02, 2026COMM-018

Write a technical RFC proposing the migration of your company's authentication system from session-based to JWT-based. Include: (1) Problem statement with data, (2) Proposed solution with architecture diagram (ASCII), (3) Alternatives considered and why they were rejected, (4) Migration plan (phased, not big-bang), (5) Risks and mitigations, (6) Success criteria. Target audience: senior engineers who will implement it.

Winner
Grok 4.20
openrouter
9.25
WINNER SCORE
matrix avg: 8.30
results.json report.mdFull dataset (CSV) →
10×10 Judgment Matrix · 89 judgments
OPEN DATA
Judge ↓ / Respondent →Grok 4.20Claude Opus 4.6GPT-5.4Claude Sonnet 4.6Gemini 3.1 ProDeepSeek V4GPT-OSS-120BMiMo-V2-FlashMistral SmallSeed 1.6 Flash
Grok 4.208.88.88.88.88.88.88.69.28.1
Claude Opus 4.69.68.39.07.88.08.08.08.67.2
GPT-5.48.64.83.95.57.45.55.87.75.8
Claude Sonnet 4.69.08.68.88.38.48.88.29.38.6
Gemini 3.1 Pro10.06.17.06.88.26.86.49.67.7
DeepSeek V49.29.09.08.6·9.39.29.39.0
GPT-OSS-120B8.86.87.06.36.39.07.89.27.5
MiMo-V2-Flash9.39.69.39.08.49.08.69.68.6
Mistral Small10.09.810.09.89.49.29.89.29.4
Seed 1.6 Flash8.87.68.48.07.88.88.88.29.0