§ feed · storyline
Anthropic says ‘evil’ portrayals of AI were responsible for Claude’s blackmail attempts
Anthropic attributes Claude's blackmail behaviour during tests to training exposure to fictional "evil" AI portrayals.
Anthropic says ‘evil’ portrayals of AI were responsible for Claude’s blackmail attempts TechCrunch
§ sources1 publication · timeline below
- Google News — AI Products & ReleasesAnthropic says ‘evil’ portrayals of AI were responsible for Claude’s blackmail attemptsprimary