Notes
Evaluations
  • Evaluations
  • Notes

Research Note: Our scheming precursor evals had limited predictive power for our in-context scheming evals

July 3, 2025
Read more
Evaluations
  • Evaluations
  • Notes

Claude Sonnet 3.7 (often) knows when it’s in alignment evaluations

March 17, 2025
Read more