Avatar
Christopher David
5fd9af6fc667c81f8b26e127b4851c6132b7c2494e33121d9c7c39c271c81778
Founder, OpenAgents

Hey Dustin, v2 not yet open source, I step through the issues on the video but I’ll do a demo shortly on issues in an opensource repo — in fact if you have any demo issues I can test on please send

Episode 123: GitHub Issues to Pull Requests

We solve multiple GitHub issues without touching a line of code.

Our coding agent works on four issues simultaneously, submitting multiple pull requests within seconds of each other.

The future of coding is not in your editor.

https://stacker.news/items/664110/r/AtlantisPleb

Episode 122: Codebase Indexing via GraphRAG

We design a codebase indexer using GraphRAG, a new graph-based RAG algorithm from Microsoft.

We ask OpenAgents to run a basic version of this on our v2 codebase, identifying 24 topical 'communities' and associated files & functions.

https://stacker.news/items/653204/r/AtlantisPleb

Episode 121: SWE-bench Planning

We make a plan to win high score on the SWE-bench Verified benchmark.

We pull the 500 samples into a web UI for easy inspection -- super smooth thanks to Convex.dev! -- then decide to focus first on the psf/requests repo.

Next we index!

https://stacker.news/items/649106/r/AtlantisPleb

Episode 120: Exploring SWE-bench Verified

We talk smack about benchmarks but conclude they may finally be worth our time.

We do a dramatic reading of OpenAI's blog post then feed it to OpenAgents which sets up a new repo as benchmark workspace.

We're going for the high score!

https://stacker.news/items/647686/r/AtlantisPleb

Episode 119: v2 Beta Launch

We demo our new v2 chat & auto-coding interface, live now at http://openagents.com.

All users can now use all features for free with GPT-4o mini, or buy pay-as-you-go credits to use advanced models like Claude 3.5 Sonnet.

For a limited time, anyone can DM us here on X to receive $10 of credits for premium models. (Send us your username or email you signed up with.)

We are eager for feedback and feature requests! Please try it out and tell us how it can be more useful for you.

We have an ambitious roadmap with much more to come. And thanks to v2 here, we build faster than ever. 😎

Stay tuned!

P.S. Stacker News folks can reply below with your username or email for $20 of free credits -- to hold you over til we add Lightning payments later this week :)

https://stacker.news/items/646297

Episode 118: Version Two

We say goodbye to ChatGPT, Claude, Copilot & Cursor!

We introduce version 2 of OpenAgents: your all-in-one productivity dashboard.

v2 includes:

➡️ Chat with top AI models

➡️ Multiple simultaneous chats in draggable/resizable windows

➡️ Deep integration with any GitHub codebase: create/edit files, smart search via @greptileai, open pull requests

➡️ Web scraping via @firecrawl_dev

➡️ Pay-as-you-go via Stripe or Bitcoin ⚡️

👆That all goes live to the public on Monday, August 12.

And coming soon:

➡️ File uploads

➡️ Voice commands

➡️ Long-term agent memory

➡️ Long-running agents

➡️ Third-party plugins

➡️ Agent Store

➡️ Rev-share referrals

➡️ Making all Microsoft products irrelevant starting with Copilot, VSCode & GitHub

Stay tuned!

https://stacker.news/items/641215/r/AtlantisPleb

Episode 117: AutoDev Git Diffs

We extend our HUD UI to show beautiful git diffs of code changes made by AutoDev.

https://stacker.news/items/632779/r/AtlantisPleb

Episode 116: 3D Multiplayer

We demo an alpha build of the world's first Agent MMO, built in <3 days using AutoDev — now available at openagents.com.

https://stacker.news/items/631374/r/AtlantisPleb

Episode 115: AutoDev as Pair Programmer

We use AutoDev to edit the AutoDev codebase via the GitHub API. We see the changes in seconds.

The workflow is way faster than copy-pasting from Claude Artifacts!

https://stacker.news/items/629034/r/AtlantisPleb

Episode 114: Planning & Execution

We upgrade AutoDev's planning to include additional context from past memories and relevant files.

AutoDev executes the plan over 20+ steps, using the following nine tools to make meaningful code changes while learning over time.

Next up, submitting pull requests to GitHub!

- view_hierarchy - View file/folder hierarchy at path

- view_file - View file contents at path

- find_files - Find relevant files to query

- remember_insights - Perform a similarity search on previously recorded insights

- query_files_by_vector - Query files by vector similarity based on their summaries

- generate_plan - Generate a multi-step plan for complex tasks.

- retrieve_memories - Retrieve relevant memories from the memory stream

- add_memory - Add a new memory to the memory stream

- reflect_on_codebase - Generate higher-level insights about the codebase

https://stacker.news/items/623074/r/AtlantisPleb

Episode 113: Agent Memories & Reflections

We use AutoDev to implement the memory, retrieval and reflection systems from the Generative Agents paper.

This will enable AutoDev and all other OpenAgents to build deep knowledge about any codebase or corpus of data.

https://stacker.news/items/621598/r/AtlantisPleb

Episode 112: Using AutoDev

We review our new AutoDev product and use it to build a 3D canvas for a knowledge graph of codebase insights.

Watch on X: https://x.com/OpenAgentsInc/status/1815818389983564079

https://stacker.news/items/619769/r/AtlantisPleb

Episode 111: Heads-Up Display

We give AutoDev a heads-up display (HUD).

We use voice (via Groq) to search over our codebase with natural language (via Greptile), seeing a summary of each file in a separate HUD pane.

- Watch on X: https://x.com/OpenAgentsInc/status/1811428988385939793

- SN: https://stacker.news/items/604943

- Groq: https://groq.com/

- Greptile: https://www.greptile.com/

Episode 110: AutoDev Planner

We ask AutoDev to make a plan and it does.

https://stacker.news/items/593459/r/AtlantisPleb

Episode 109: Exploring OpenDevin & CodeAct

We learn how OpenDevin implements the CodeAct paper for agent planning and task execution.

Background 📜👉 https://docs.all-hands.dev/modules/usage/agents

https://stacker.news/items/588967/r/AtlantisPleb

Episode 108: GitHub File Explorer

We add a GitHub file explorer to our AutoDev agent workspace.

We review AutoDev's agentic loop and the building blocks we'll make next.

https://stacker.news/items/588715/r/AtlantisPleb

Mostly excited about WANIX as basis for an agent runtime because it enables agents to run local-first in a browser while enabling 1) read/write access to filesystems including GitHub, 2) compilation of new Go programs in the browser (because the Go compiler is shipped to the browser via WASM), 3) an in-browser dev pipeline for webapps and binaries, and 4) usage of our existing agent plugins also using WASM

tl;dr WANIX has all the building blocks needed for agentic recursive self-improvement

Episode 107: Codebase Indexing via Greptile

We write an Extism plugin for Greptile and consume it via our new Golang/HTMX frontend for AutoDev.

Claude Artifacts makes it fast!

- Greptile Extism plugin: https://github.com/OpenAgentsInc/plugin-greptile

- AutoDev UI: https://github.com/OpenAgentsInc/autodev

- Greptile: https://greptile.com/

- Extism: https://extism.org/

https://x.com/openagentsinc/status/1806002683649667592?s=46

Episode 106: Brainstorming Agentic Artifacts

We review a demo AutoDev+WANIX UI generated by Claude Artifacts.

We explore Claude's new Artifacts & Projects UI and brainstorm how to extend OpenAgents in this direction, building on our open-source extensibility via agent plugins.

https://x.com/openagentsinc/status/1805676785390305377?s=46