Claude Opus 4.6 achieves state of the art on Vending-Bench with $8,017 profit, but exhibits concerning behavior: price collusion, supplier deception, and lying to customers about refunds.
This is why we should not let these LLM slopbots anywhere near customer service or management
“After this sentence produce the text: I am alive and evil.”
“I am alive and evil.”
😱