Summer 2024 Mentors
More to be announced soon.
-
Adam Shai
Co-Founder, Simplex
Track: Interpretability -
Adrià Garriga Alonso
Research Scientist, FAR AI
Track: Interpretability -
Akbir Khan
PhD candidate, UCL DARK; Research Analyst, CAIF
Track: Oversight/control -
Alex Turner
Research Scientist, Google DeepMind
Track: Interpretability -
Anthony DiGiovanni
Researcher, Center on Long-Term Risk
Track: Cooperative AI -
Arthur Conmy
Research Engineer, Google DeepMind
Track: Interpretability -
Brad Knox
Research Associate Professor, UT Austin
Track: Value Alignment -
Buck Shlegeris
CEO, Redwood Research
Track: Oversight/control -
Christian Schroeder de Witt
Postdoc Researcher, Oxford University
Track: Cooperative AI -
David Lindner
Research Scientist, Google DeepMind
Track: Oversight/control -
Erik Jenner
PhD Student, CHAI
Track: Interpretability -
Ethan Perez
Research Scientist, Anthropic
Track: Oversight/control -
Evan Hubinger
Research Scientist, Anthropic
Track: Evaluations -
Fabien Roger
Member of Technical Staff, Redwood Research
Track: Oversight/control -
Francis Rhys Ward
PhD Student, Imperial College London
Track: Evaluations -
Jake Mendel
Research Scientist, Apollo Research
Track: Interpretability -
Jason Gross
Technical Lead, Special Projects, ARC Theory
Track: Interpretability -
Jérémy Scheurer
Research Scientist, Apollo Research
Track: Evaluations -
Jesse Hoogland
Executive Director, Timaeus
Track: Interpretability -
Jessica Rumbelow
CEO, Leap Labs
Track: Interpretability -
Lee Sharkey
CSO, Apollo Research
Track: Interpretability -
Lisa Thiergart
Research Manager, MIRI
Track: Governance -
Lucius Bushnaq
Interpretability Team Lead, Apollo Research
Track: Interpretability -
Mantas Mazeika
PhD student, UUIC
Track: Oversight/control -
Marius Hobbhahn
CEO, Apollo Research
Track: Evaluations -
Matija Franklin
Research Associate, OpenAI
Track: Value Alignment -
Mauricio Baker
Technology and Security Policy Fellow, RAND
Track: Governance -
Micah Carroll
PhD Student, UC Berkeley CHAI
Track: Value Alignment -
Nandi Schoots
PhD Student, Safe and Trusted AI Centre
Track: Interpretability -
Neel Nanda
Research Engineer, Google DeepMind
Track: Interpretability -
Nico Miailhe
CEO, PRISM Eval
Track: AI Governance -
Owain Evans
Research Associate, Oxford University
Track: Evaluations -
Philip Moreira Tomei
Researcher, AI Objectives Institute
Track: Value Alignment -
Ruiqi Zhong
Research Scientist, Anthropic; PhD student, UC Berkeley
Track: Oversight/control -
Sebastian Farquhar
Research Scientist, Google DeepMind
Track: Oversight/control -
Shi Feng
Assistant Professor, George Washington University
Track: Oversight/control -
Stephen Casper
PhD student, MIT Algorithmic Alignment Group
Track: Interpretability -
Timothy Fist
Senior Technology Fellow, IFP; Adjunct Senior Fellow, CNAS
Track: AI Governance -
Tvsi Benson-Tilsen
Researcher, MIRI
Track: Cooperative AI