AI systems will soon be integrated into large parts of the economy and our personal lives.
While this transformation may unlock substantial personal and societal benefits, there are also vast risks. We think some of the greatest risks stem from “scheming” AIs, i.e. advanced AI systems that covertly pursue misaligned objectives. Our goal is to understand and evaluate for the emergence of scheming well enough to prevent the possible harms that scheming AIs might cause.
