Top model scores may be skewed by Git history leaks in SWE-bench from Hacker News on 2025-09-11 18:32 (#6ZZBS) Comments