OTelBench: AI struggles with simple SRE tasks (Opus 4.5 scores only 29%) from Hacker News on 2026-01-29 15:37 (#736G3) Comments