“LLMs-gone-rogue dominated coverage, but had nothing to do with the targeting. Instead, it was choices made by human beings, over many years, that gave us this atrocity.”
Anthropic
What Is Claude? Anthropic Doesn’t Know, Either
“Researchers at the company are trying to understand their A.I. system’s mind—examining its neurons, running it through psychology experiments, and putting it on the therapy couch.”
Why AI Breaks Bad
“Once in a while, LLMs turn evil—and no one quite knows why.”
