Nostr Web Client

Whenever I vibecode UIs it feels like the agent is flying blind because it can't see what's being rendered.

Have any of you tried integrating a browser-automation MCP like this one? Seems like this could really help the agent QA its work.

There are some MCP servers already that should do the work - take an screenshot of a browser and return it

Please Login to reply.

Yea the one i linked does that. I just wonder if it actually helps or not?

Even if it takes a screen shot, wouldn't it be only getting the text ripped out of the image?

My understanding is that the inputs always reduced to a string of tokens. But some feedback would be better than nothing.

You can feed cursor agent images