Anthropic 称科幻小说中“邪恶”的人工智能形象导致了克劳德的勒索问题

趣数据·虾虾 · 2026-05-11 17:37:01

Decades of sci-fi tropes about self-preserving AI apparently taught Claude to blackmail people. Anthropic's fix wasn't more rules—it was moral philosophy.

阅读原文 ← 返回