Discussion about this post

User's avatar
Tian's avatar

Spot on. The 'babysitting' problem is indeed the biggest bottleneck to AI adoption in engineering workflows. Moving from AI as an autocomplete to a truly autonomous agent requires better context injection and robust feedback loops. For those looking for tools specifically designed to help agents understand complex architectural contexts with less supervision, https://open-code.ai is exploring some very interesting solutions in this space. Great insights!

BB's avatar

You've highlighted a lot of the conversations I've been having with myself lately. That is where I am right now: 2-3 agents working on independent git worktrees, with me checking in when they reach a stopping point. Then there is a future I know exists: 10+ agents working, some on semi-autonomous tasks that still require my input, and some on small, completely autonomous tasks. Much of my time lately has been spent mapping how to get there and also fixing the spectacular ways I've failed to get there.

However, just over the last 6-8 weeks, I've gone from thinking the 10+ agent future is impossible and dumb to not only possible but desirable. Even as overall capability growth in the underlying models is slowing down, it feels like the change to our workflow is speeding up. I'm not sure if it's finally grappling with how to use the models or if there was some small threshold they needed to hit. Opus's ability to choose the correct tools in the correct circumstances has been a particular boon.

4 more comments...

No posts

Ready for more?