Botao Hao (郝博韬)
Research Scientist, Reasoning Team, OpenAI
I am a research scientist in the Reasoning Team at OpenAI. I work on using large-scale RL to train next-generation of reasoning models and build the smartest agent in Codex. I am a core contributor of o1 (o1 system card), o3 and GPT-5 series (GPT-5 system card). Previously, I was a Research Scientist at DeepMind and a Postdoc at Princeton University. See my talk at Stanford RL forum about information-directed sampling for explorations [Recording] [Slides].
About Me
- 2024-Present: Research Scientist, OpenAI
- 2020-2024: Research Scientist, DeepMind
- 2019-2020: Postdoc, Department of Electrical Engineering, Princeton University
- 2014-2019: Ph.D. in Statistics, Purdue University
Selected Publications
-
GPT-5 System Card
OpenAI, Aaditya Singh, Adam Fry, Adam Perelman, Adam Tart, Adi Ganesh, ... , Botao Hao, et al.
arXiv 2026. arXiv -
OpenAI o1 System Card
OpenAI, Aaron Jaech, Adam Kalai, Adam Lerer, Adam Richardson, Ahmed El-Kishky, ... , Botao Hao, et al.
arXiv 2024. arXiv -
Leveraging Demonstrations to Improve Online Learning: Quality Matters
Botao Hao, Rahul Jain, Tor Lattimore, Benjamin Van Roy, Zheng Wen
ICML 2023. arXiv -
Regret Bounds for Information-Directed Reinforcement Learning
Botao Hao, Tor Lattimore
NeurIPS 2022. arXiv -
Contextual Information-Directed Sampling
Botao Hao, Tor Lattimore, Chao Qin
ICML 2022. arXiv -
Online Sparse Reinforcement Learning
Botao Hao, Tor Lattimore, Csaba Szepesvari, Mengdi Wang
AISTATS 2021. arXiv poster -
Information Directed Sampling for Sparse Linear Bandits
Botao Hao, Tor Lattimore, Wei Deng
NeurIPS 2021 (spotlight). Proceedings slides -
High-Dimensional Sparse Linear Bandits
Botao Hao, Tor Lattimore, Mengdi Wang
NeurIPS 2020. arXiv slides poster -
Adaptive Exploration in Linear Contextual Bandit
Botao Hao, Tor Lattimore, Csaba Szepesvari
AISTATS 2020. arXiv slides -
Sparse and Low-rank Tensor Estimation via Cubic Sketchings
Botao Hao, Anru Zhang, Guang Cheng
IEEE Transactions on Information Theory (2020). arXiv slides -
Bootstrapping Upper Confidence Bound
Botao Hao, Yasin Abbasi-Yadkori, Zheng Wen, Guang Cheng
NeurIPS 2019. arXiv poster -
Simultaneous Clustering and Estimation of Heterogeneous Graphical Models
Botao Hao, Will Wei Sun, Yufeng Liu, Guang Cheng
Journal of Machine Learning Research. pdf slides