← Evaluations/EVAL-20260317-022615
code
Mar 17, 2026EVAL-20260317-022615

Implement an LRU cache with per-key TTL (time-to-live) support. Requirements: O(1) get/put, thread-safe, lazy expiration (don't use background threads), configurable max size, eviction callback, and cache hit/miss statistics. Include comprehensive tests.

Winner
Qwen 3.5 35B-A3B
openrouter
7.83
WINNER SCORE
matrix avg: 6.63
results.json report.mdFull dataset (CSV) →
10×10 Judgment Matrix · 37 judgments
OPEN DATA
Judge ↓ / Respondent →Qwen 3.5 9BQwen 3 8BQwen 3 32BQwen 3 Coder NextQwen 3.5 35B-A3BQwen 3.5 27BQwen 3.5 122B-A10BQwen 3.5 397B-A17B
Qwen 3.5 9B7.5·7.0··7.05.3
Qwen 3 8B6.0·8.69.6·7.38.1
Qwen 3 32B5.58.87.28.2·6.48.6
Qwen 3 Coder Next6.88.2·9.6·7.89.0
Qwen 3.5 35B-A3B3.5··7.7·5.25.4
Qwen 3.5 27B3.5··7.16.25.24.2
Qwen 3.5 122B-A10B·6.8·7.17.3·4.7
Qwen 3.5 397B-A17B2.54.8··6.2·4.4