Hmm I guess it’s expensive but I don’t see why agents in a Ralph loop under the supervision of a comprehensive test suite can’t deliver top percentile results. You just iterate enough times until you see the desired behavior. Where are the risks?

Reply to this note

Please Login to reply.

Discussion

Technical debt snowballing. Lower risks for simpler projects. Example: one-off scripts that work have zero risk

Definitely, if humans aren’t in the loop. Code is cheap so it’s more engineering than developing now