Trustworthy AI

Evaluating AI for signs of Deception.
Profile image Alexander Meinke

What I do

I am a Research Scientist at Apollo Research. I work on developing evaluations for deceptive alignment.


Blog posts