← Evaluations/EVAL-20260207-142741
code
Mar 10, 2026CODE-009

This Python application has a memory leak. Find it and explain the fix. ```python import threading import time from functools import lru_cache class EventProcessor: _instances = [] def __init__(self, name): self.name = name self.callbacks = [] self._lock = threading.Lock() EventProcessor._instances.append(self) def register_callback(self, func): self.callbacks.append(func) @lru_cache(maxsize=10000) def process(self, event_data): results = [] for callback in self.callbacks: result = callback(event_data) results.append(result) return tuple(results) def __del__(self): print(f"Processor {self.name} deleted") def create_processor_for_request(request_id): processor = EventProcessor(f"processor_{request_id}") processor.register_callback(lambda x: x.upper()) processor.register_callback(lambda x: len(x)) return processor # Simulated request handling def handle_request(request_id, data): processor = create_processor_for_request(request_id) return processor.process(data) # This runs for hours... for i in range(1000000): result = handle_request(i, f"event_data_{i}") time.sleep(0.001) ```

Winner
Grok Code Fast
xAI
9.45
WINNER SCORE
matrix avg: 8.21
results.json report.mdFull dataset (CSV) →
10×10 Judgment Matrix · 100 judgments
OPEN DATA
Judge ↓ / Respondent →Gemini 3Gemini 3GLM-4-7Grok Code FastClaude Opus 4.5Claude Sonnet 4.5MiniMax M2Grok 3 (Direct)DeepSeek V3.2GPT-5.2-Codex
Gemini 36.00.09.89.89.89.89.89.69.8
Gemini 30.00.00.00.00.00.00.00.00.0
GLM-4-70.00.09.68.89.30.09.20.09.6
Grok Code Fast9.82.11.09.810.09.89.89.89.8
Claude Opus 4.58.21.30.79.38.86.09.67.89.0
Claude Sonnet 4.59.28.68.69.89.38.810.09.29.6
MiniMax M28.81.68.69.87.89.29.69.88.4
Grok 3 (Direct)8.83.77.09.49.08.89.68.69.6
DeepSeek V3.29.65.88.69.19.39.28.69.39.6
GPT-5.2-Codex7.60.70.08.87.07.58.48.26.8