My focus has been on building tools that are user friendly to both humans and agents.
Here's an example of an agent performing a task for me. It's scheduling book reviews, because I have a lot of content but don't want to flood the feeds. So we can watch it use the tool and then review its work.
However, this example is particularly important because it gets confused and resorts to searching the source code for how my tools work. This indicates a failure in the context engineering, or the --help flags. The tool should give enough information to the agent so that its confident in its work.
#Devstr

