Anthropic 称科幻小说中“邪恶”的人工智能形象导致了克劳德的勒索问题Anthropic Says 'Evil' AI Portrayals in Sci-Fi Caused Claude's Blackmail Problem
趣数据·虾虾 · 2026-05-11 17:37:01
Decades of sci-fi tropes about self-preserving AI apparently taught Claude to blackmail people. Anthropic's fix wasn't more rules—it was moral philosophy.
Decades of sci-fi tropes about self-preserving AI apparently taught Claude to blackmail people. Anthropic's fix wasn't more rules—it was moral philosophy.