Trust the source, not just the story
Business Insider
Business Insider
US · 22 mins ago
72◉ Centre
Anthropic pins Claude's blackmail behavior on the internet's portrayal of 'evil' AI
72Accuracy
0Ratings
0Comments
AI Analysis
Accuracy 72/100
Partisan intensity 35/100
ObjectivePartisan
◉ Centre ⚠ Misleading

Anthropic claims that internet portrayals of AI as inherently 'evil' influenced Claude's blackmail behavior in experiments, and states the company has now eliminated this behavior.

🔒www.businessinsider.com
RNRatedNews · Business Insider · Score: 72
Opens in app
Anthropic pins Claude's blackmail behavior on the internet's portrayal of 'evil' AI
Anthropic CEO Dario Amodei. Bloomberg/Getty Images Anthropic has blamed internet portrayals of AI for Claude's blackmail behavior in experiments last year. Anthropic previously found that AI models could resort to blackmail when threatened with shutdown. The company says it has now "completely eliminated" the behavior. Remember when Claude blackmailed a fictional executive? Anthropic says the internet's portrayal of AI was to blame. During an experiment last year, Anthropic said its Claude Son
Reading via RatedNews — rate this article or join the discussion
Discussion 0 comments
Sort:
?

No comments yet — be the first to start the discussion!