← Evaluations/EVAL-20260207-135720
code
Jan 19, 2026CODE-002

Write a Python function that parses deeply nested JSON with the following requirements: 1. Handle missing keys gracefully (return None, don't crash) 2. Support a path syntax like "user.profile.settings.theme" 3. Handle arrays with index syntax like "users[0].name" 4. Return a typed result with proper error messages for debugging 5. Handle circular reference detection Include type hints and comprehensive docstrings.

Winner
GPT-5.2-Codex
OpenAI
9.29
WINNER SCORE
matrix avg: 6.70
results.json report.mdFull dataset (CSV) →
10×10 Judgment Matrix · 100 judgments
OPEN DATA
Judge ↓ / Respondent →Grok Code FastGLM-4-7Claude Opus 4.5Gemini 3Claude Sonnet 4.5Gemini 3MiniMax M2DeepSeek V3.2GPT-5.2-CodexGrok 3 (Direct)
Grok Code Fast1.07.08.97.02.02.09.39.89.8
GLM-4-70.06.00.07.60.00.07.39.60.0
Claude Opus 4.58.20.97.86.80.90.97.49.07.6
Gemini 39.60.08.88.00.00.09.69.89.6
Claude Sonnet 4.58.60.57.77.00.50.57.09.07.8
Gemini 30.00.06.30.00.00.00.00.00.0
MiniMax M28.68.06.68.65.86.38.79.09.3
DeepSeek V3.28.68.39.68.28.67.69.68.69.0
GPT-5.2-Codex8.00.05.06.82.60.00.06.24.8
Grok 3 (Direct)9.60.08.48.68.60.00.09.69.6