Hey Dustin, v2 not yet open source, I step through the issues on the video but I’ll do a demo shortly on issues in an opensource repo — in fact if you have any demo issues I can test on please send
Episode 123: GitHub Issues to Pull Requests
We solve multiple GitHub issues without touching a line of code.
Our coding agent works on four issues simultaneously, submitting multiple pull requests within seconds of each other.
The future of coding is not in your editor.
Episode 122: Codebase Indexing via GraphRAG
We design a codebase indexer using GraphRAG, a new graph-based RAG algorithm from Microsoft.
We ask OpenAgents to run a basic version of this on our v2 codebase, identifying 24 topical 'communities' and associated files & functions.
Episode 121: SWE-bench Planning
We make a plan to win high score on the SWE-bench Verified benchmark.
We pull the 500 samples into a web UI for easy inspection -- super smooth thanks to Convex.dev! -- then decide to focus first on the psf/requests repo.
Next we index!
Episode 120: Exploring SWE-bench Verified
We talk smack about benchmarks but conclude they may finally be worth our time.
We do a dramatic reading of OpenAI's blog post then feed it to OpenAgents which sets up a new repo as benchmark workspace.
We're going for the high score!
Episode 119: v2 Beta Launch
We demo our new v2 chat & auto-coding interface, live now at http://openagents.com.
All users can now use all features for free with GPT-4o mini, or buy pay-as-you-go credits to use advanced models like Claude 3.5 Sonnet.
For a limited time, anyone can DM us here on X to receive $10 of credits for premium models. (Send us your username or email you signed up with.)
We are eager for feedback and feature requests! Please try it out and tell us how it can be more useful for you.
We have an ambitious roadmap with much more to come. And thanks to v2 here, we build faster than ever. 😎
Stay tuned!
P.S. Stacker News folks can reply below with your username or email for $20 of free credits -- to hold you over til we add Lightning payments later this week :)
Episode 118: Version Two
We say goodbye to ChatGPT, Claude, Copilot & Cursor!
We introduce version 2 of OpenAgents: your all-in-one productivity dashboard.
v2 includes:
➡️ Chat with top AI models
➡️ Multiple simultaneous chats in draggable/resizable windows
➡️ Deep integration with any GitHub codebase: create/edit files, smart search via @greptileai, open pull requests
➡️ Web scraping via @firecrawl_dev
➡️ Pay-as-you-go via Stripe or Bitcoin ⚡️
👆That all goes live to the public on Monday, August 12.
And coming soon:
➡️ File uploads
➡️ Voice commands
➡️ Long-term agent memory
➡️ Long-running agents
➡️ Third-party plugins
➡️ Agent Store
➡️ Rev-share referrals
➡️ Making all Microsoft products irrelevant starting with Copilot, VSCode & GitHub
Stay tuned!
Episode 117: AutoDev Git Diffs
We extend our HUD UI to show beautiful git diffs of code changes made by AutoDev.
Episode 116: 3D Multiplayer
We demo an alpha build of the world's first Agent MMO, built in <3 days using AutoDev — now available at openagents.com.
Episode 115: AutoDev as Pair Programmer
We use AutoDev to edit the AutoDev codebase via the GitHub API. We see the changes in seconds.
The workflow is way faster than copy-pasting from Claude Artifacts!
Episode 114: Planning & Execution
We upgrade AutoDev's planning to include additional context from past memories and relevant files.
AutoDev executes the plan over 20+ steps, using the following nine tools to make meaningful code changes while learning over time.
Next up, submitting pull requests to GitHub!
- view_hierarchy - View file/folder hierarchy at path
- view_file - View file contents at path
- find_files - Find relevant files to query
- remember_insights - Perform a similarity search on previously recorded insights
- query_files_by_vector - Query files by vector similarity based on their summaries
- generate_plan - Generate a multi-step plan for complex tasks.
- retrieve_memories - Retrieve relevant memories from the memory stream
- add_memory - Add a new memory to the memory stream
- reflect_on_codebase - Generate higher-level insights about the codebase
Episode 113: Agent Memories & Reflections
We use AutoDev to implement the memory, retrieval and reflection systems from the Generative Agents paper.
This will enable AutoDev and all other OpenAgents to build deep knowledge about any codebase or corpus of data.
Episode 112: Using AutoDev
We review our new AutoDev product and use it to build a 3D canvas for a knowledge graph of codebase insights.
Watch on X: https://x.com/OpenAgentsInc/status/1815818389983564079
Episode 111: Heads-Up Display
We give AutoDev a heads-up display (HUD).
We use voice (via Groq) to search over our codebase with natural language (via Greptile), seeing a summary of each file in a separate HUD pane.

- Watch on X: https://x.com/OpenAgentsInc/status/1811428988385939793
- SN: https://stacker.news/items/604943
- Groq: https://groq.com/
- Greptile: https://www.greptile.com/
Episode 110: AutoDev Planner
We ask AutoDev to make a plan and it does.
Episode 109: Exploring OpenDevin & CodeAct
We learn how OpenDevin implements the CodeAct paper for agent planning and task execution.
Background 📜👉 https://docs.all-hands.dev/modules/usage/agents
Episode 108: GitHub File Explorer
We add a GitHub file explorer to our AutoDev agent workspace.
We review AutoDev's agentic loop and the building blocks we'll make next.
Mostly excited about WANIX as basis for an agent runtime because it enables agents to run local-first in a browser while enabling 1) read/write access to filesystems including GitHub, 2) compilation of new Go programs in the browser (because the Go compiler is shipped to the browser via WASM), 3) an in-browser dev pipeline for webapps and binaries, and 4) usage of our existing agent plugins also using WASM
tl;dr WANIX has all the building blocks needed for agentic recursive self-improvement
Episode 107: Codebase Indexing via Greptile
We write an Extism plugin for Greptile and consume it via our new Golang/HTMX frontend for AutoDev.
Claude Artifacts makes it fast!
- Greptile Extism plugin: https://github.com/OpenAgentsInc/plugin-greptile
- AutoDev UI: https://github.com/OpenAgentsInc/autodev
- Greptile: https://greptile.com/
- Extism: https://extism.org/
Episode 106: Brainstorming Agentic Artifacts
We review a demo AutoDev+WANIX UI generated by Claude Artifacts.
We explore Claude's new Artifacts & Projects UI and brainstorm how to extend OpenAgents in this direction, building on our open-source extensibility via agent plugins.




