Anthropic's SHADE-Arena: Evaluating sabotage and monitoring in LLM agentsanthropic.com4 pointsthoughtpeddlera year ago