Rohan Subramani

Hi, I'm Rohan! I aim to promote welfare and reduce suffering as much as possible for all sentient beings, which has led me to work on AGI safety research. I am particularly interested in foundation model agents (FMAs): systems like AutoGPT and Operator that equip foundation models with memory, tool use, and other affordances so they can perform multi-step tasks autonomously.

I am the founder of Aether, an independent research lab focused on foundation model agent safety. I'm also an incoming PhD student at the University of Toronto, where I will be supervised by Professor Zhijing Jin and continue to run Aether. Previously, I completed an undergrad in CS and Math at Columbia, where I helped run Columbia Effective Altruism and Columbia AI Alignment Club (CAIAC). I have done research internships with AI Safety Hub Labs (now LASR Labs), UC Berkeley's Center for Human-Compatible AI (CHAI), and the ML Alignment & Theory Scholars (MATS) program.

I love playing tennis, listening to rock and indie pop music, playing social deduction games, reading fantasy books, watching a fairly varied set of TV shows and movies, and playing the saxophone, among other things.

Rohan Subramani

Papers

Higher-Order Beliefs in Incomplete Information MAIDs

The Partially Observable Off-Switch Game

Generalization Analogies: A Testbed for Generalizing AI Oversight to Hard-To-Measure Domains

On The Expressivity of Objective-Specification Formalisms in Reinforcement Learning

Projects

Coding GPT-2 from scratch

Implementing basic (and not-so-basic) LLM agents

Alignment Research Engineer Accelerator (ARENA) exercises

Experimenting with neural network pruning