Elimination Game Benchmark: Social Reasoning, Strategy, and Deception in Multi-Agent LLM Dynamics

Claude a snitch

https://github.com/lechmazur/elimination_game

Reply to this note

Please Login to reply.

Discussion

No replies yet.