Alexander Meinke

Head of Research at Apollo Research
Profile photo of Alexander Meinke

I’m Head of Research at Apollo Research, where we try to mitigate the risks from scheming AI models, i.e. models that covertly pursue misaligned goals. Our research team does fundamental research on the emergence of scheming and we evaluate frontier AI models for signs of scheming and deception.

Apollo Research · Google Scholar · Twitter/X · LinkedIn


Older posts (archive)

These posts are from 2022–2023 and mostly reflect my PhD-era work (adversarial robustness, interpretability, and related topics).

DALL-E 2

How it works and how it doesn't.