Elimination Game Benchmark: Social Reasoning, Strategy, and Deception in Multi-Agent LLM Dynamics
Claude a snitch
https://github.com/lechmazur/elimination_game
Please Login to reply.
No replies yet.