Botao Hao

Botao Hao (郝博韬)

Research Scientist
Deepmind

Email: haobotao000@gmail.com
[Google Scholar] [Linkedin] [CV]

About Me

2020 - Now, Research Scientist at Deepmind
2019 - 2020, Postdoc in the Department of Electrical Engineering at Princeton University
2014 - 2019, Ph.D. in Statistics at Purdue University

Research Interests

I am currently working on developing principled data-efficient reinforcement learning with human feedback (RLHF) for fine-tuning large language models (LLMs) and their application for Google products, such as Bard. I am also interested in fundamental research on RL and multi-armed bandits. See my talk at Stanford RL forum about information-directed sampling for explorations [Recording] [Slides].

Publications

2024

2023

2022

2021

2020

2019