← Evaluations/EVAL-20260317-015707
code
Mar 17, 2026EVAL-20260317-015707

Write a function to reverse a string

Winner
Qwen 3.5 35B-A3B
openrouter
9.87
WINNER SCORE
matrix avg: 9.68
results.json report.mdFull dataset (CSV) →
10×10 Judgment Matrix · 43 judgments
OPEN DATA
Judge ↓ / Respondent →Qwen 3 8BQwen 3 32BQwen 3 Coder NextQwen 3.5 35B-A3BQwen 3.5 27BQwen 3.5 122B-A10BQwen 3.5 397B-A17BQwen 3.5 9B
Qwen 3 8B10.09.09.89.810.09.810.0
Qwen 3 32B9.89.49.810.08.810.010.0
Qwen 3 Coder Next9.89.89.89.89.89.89.8
Qwen 3.5 35B-A3B···9.69.39.88.5
Qwen 3.5 27B9.49.89.29.89.89.89.8
Qwen 3.5 122B-A10B·9.89.210.010.09.6·
Qwen 3.5 397B-A17B9.89.8·10.010.010.09.2
Qwen 3.5 9B·······