More Capable Models Are Better At In-Context Scheming
We evaluate models for in-context scheming using the suite of evals presented in our in-context scheming paper (released December 2024) with the most capable new models.
We evaluate models for in-context scheming using the suite of evals presented in our in-context scheming paper (released December 2024) with the most capable new models.
We outline Apollo Research’s norms on security, science communication, and conflicts of interest, detailing how we maintain scientific integrity, manage sensitive information, and communicate our work responsibly.
The governance team at Apollo Research conducts technical governance research, develops tailored policy recommendations, and communicates our organisation’s learnings to key stakeholders across industry, civil society and governments.