How Can We Trust What We Don’t Understand?
I’ve been thinking about a question raised by Dario Amodei, CEO of Anthropic, in his recent piece, “The Urgency of Interpretability.” He writes about the increasing power of artificial intelligence systems and our unsettling lack of insight into how they actually work. The models are getting stronger. Our ability to understand them is not. This isn’t just … Read more