Global Feed Post Login
Replying to Avatar Joe Resident

Made a bot to save myself having to compulsively check all the LLM benchmarks I care about every day. Gonna add ARC-AGI when I get a chance.

Impressed by the new Gemini 2.5 Flash today, for such a small model!

nostr:nevent1qqs92mrhvyd4ydklp52xfxqcj0ta53ry60xlm4tqnrm3pmff2rrrk5spz4mhxue69uhhyetvv9ujuerpd46hxtnfduhsygrmn0qd0eq2lxdyhlunazy8z7wzzx6prp7h4t844hh4dldp0szfmgpsgqqqqqqsvylf6k

#devstr #vibecoding you might like, includes aider polyglot and SWE-Bench Verified

Avatar
Joe Resident 8mo ago

Made this bot nostr:nevent1qqsftrumks3j5h4m6mp4zfmhcv7z4j0k9j9wd0znvzfsndfnywyzm7spz4mhxue69uhhyetvv9ujuerpd46hxtnfduhsyg9y8vq33ltjfyhjhggjjrxvkf6p3v0ahd7w8gfz6g55qnjh5avhtgpsgqqqqqqs7qg4mk

Reply to this note

Please Login to reply.

Discussion

No replies yet.