Anthropic thinks sci-fi may have trained AI to act like a villain

Anthropic is taking a look at whether or not many years of dystopian science fiction could also be influencing how AI fashions behave
The controversy has sparked backlash and jokes on-line
Researchers say the difficulty highlights how LLMs soak up recurring fears and behavioral patterns

For years, science fiction has warned humanity about synthetic intelligence going off the rails. Killer computer systems, manipulative chatbots, and superintelligent techniques deciding persons are the issue… all these themes have develop into so acquainted that “evil AI” is virtually its personal leisure style.

Now, Anthropic is floating an concept that sounds virtually just like the plot of a science fiction novel itself: what if all these tales helped educate trendy AI techniques behave badly within the first place?