Show HN: A game/benchmark where AI bots hunt each other https://ift.tt/0iFbkpM

Show HN: A game/benchmark where AI bots hunt each other I've created a social deduction game for LLMs, in which the bots attempt to hunt each other. It's a Mafia group turing test: the models are told to find who the bot is - where, in fact and unbeknown to them, they are all bots. I did this a while back so models aren't the newest, and they are all non-thinking (for speed and token costs). Et voilà. https://hiding-robot.vercel.app/ January 8, 2026 at 01:59AM

No comments:

Show HN: Investor asks "what did engineering ship?" https://ift.tt/1DThgM2

Show HN: Investor asks "what did engineering ship?" Gitmore (https://gitmore.io) – dev visibility for founders and stakeholders. *...