AI LLM Has Trillion Dollar Use Case By Increasing Computer Programming Productivity
by Brian Wang from NextBigFuture.com on (#6KY5H)
Devin is an AI that was used on the SWE-bench, a challenging benchmark that asks agents to resolve real-world GitHub issues found in open source projects like Django and scikit-learn. Devin correctly resolves 13.86% of the issues end-to-end, far exceeding the previous state-of-the-art of 1.96%. Even when given the exact files to edit, the best ...