Salman Rahman
Salman Rahman

I am a Ph.D. student, co-advised by Pavel Izmailov at NYU and Saadia Gabriel at UCLA.

I am also a Student Researcher at Google DeepMind.

My research focuses on scalable supervision: methods for aligning AI systems that are becoming more capable than humans.

I am also interested in how reasoning emerges in language models, from pre-training to post-training.

Previously, I interned at Google Research on the scientific discovery team, Amazon's AGI RL Reasoning team, and Apple's machine learning team. Before UCLA, I worked at NYU on scalable oversight and AI safety.

Publications

2026
Understanding Reasoning from Pre-Training to Post-Training: Chess as a Controlled Testbed
2026
When Can LLMs Learn to Reason with Weak Supervision?
2026
CoDaS: AI Co-Data-Scientist for Biomarker Discovery via Wearable Sensors
2025
SPARK: Stepwise Process-Aware Rewards for Reference-Free Reinforcement Learning
2025
AI Debate Aids Assessment of Controversial Claims
2025
𝕏-Teaming: Multi-Turn Jailbreaks and Defenses with Adaptive Multi-Agents

News

Jul 2026
Joined Google DeepMind as a Student Researcher.
Mar 2026
Passed my oral qualification exam and advanced to PhD candidacy.
Jan 2026
Joined Google Research as a Student Researcher.
Sep 2025
AI Debate paper accepted at NeurIPS 2025.
Aug 2025
MOSAIC paper accepted at EMNLP 2025.
Jul 2025
X-Teaming paper accepted at COLM 2025.

Teaching

2025
Teaching Assistant — Natural Language Processing
CS 162: Natural Language Processing, UCLA
2023
Guest Lecturer — Foundation (Large) Language Models
CS-GY 9223: Foundations of Data Science, Graduate, NYU