Replying to Avatar Shawn

Jetpacks for software engineers.

“We evaluated Devin on SWE-bench, a challenging benchmark that asks agents to resolve real-world GitHub issues found in open source projects like Django and scikit-learn.

Devin correctly resolves 13.86%* of the issues end-to-end, far exceeding the previous state-of-the-art of 1.96%. Even when given the exact files to edit, the best previous models can only resolve 4.80% of issues.”

https://www.cognition-labs.com/introducing-devin

Avatar
Alan Siefert 1y ago

It won’t take software engineers’ jobs. It will enable them to build bigger things in less time. 🫡

Reply to this note

Please Login to reply.

Discussion

Avatar
Alan Siefert 1y ago

Looking forward to the open source equivalent.

Thread collapsed