← Evaluations/EVAL-20260317-020940
code
Mar 17, 2026EVAL-20260317-020940

This Go code processes orders concurrently but occasionally produces incorrect totals. Find and fix all concurrency issues. ```go package main import ( "fmt" "sync" ) type OrderProcessor struct { totalRevenue float64 orderCount int errors []string } func (op *OrderProcessor) ProcessOrder(amount float64, wg *sync.WaitGroup) { defer wg.Done() if amount <= 0 { op.errors = append(op.errors, fmt.Sprintf("invalid amount: %.2f", amount)) return } op.totalRevenue += amount op.orderCount++ } func main() { op := &OrderProcessor{} var wg sync.WaitGroup orders := []float64{99.99, 149.50, -10.00, 299.99, 49.99, 0, 199.99} for _, amount := range orders { wg.Add(1) go op.ProcessOrder(amount, &wg) } wg.Wait() fmt.Printf("Total: $%.2f from %d orders\n", op.totalRevenue, op.orderCount) } ```

Winner
Qwen 3.5 122B-A10B
openrouter
9.77
WINNER SCORE
matrix avg: 9.47
results.json report.mdFull dataset (CSV) →
10×10 Judgment Matrix · 53 judgments
OPEN DATA
Judge ↓ / Respondent →Qwen 3.5 9BQwen 3 8BQwen 3 32BQwen 3 Coder NextQwen 3.5 35B-A3BQwen 3.5 27BQwen 3.5 122B-A10BQwen 3.5 397B-A17B
Qwen 3.5 9B8.48.38.69.68.89.09.0
Qwen 3 8B10.010.09.810.010.010.09.8
Qwen 3 32B9.89.810.09.210.010.09.8
Qwen 3 Coder Next10.010.010.010.010.010.09.8
Qwen 3.5 35B-A3B9.3·9.88.69.39.89.3
Qwen 3.5 27B8.39.49.67.59.6·9.6
Qwen 3.5 122B-A10B8.39.69.49.69.88.89.2
Qwen 3.5 397B-A17B8.69.69.69.69.8·9.8