Biography

I am a principle researcher at Tencent AI Lab, specializing in Search and Reinforcement Learning (RL) to enhance the reasoning capabilities of large language models (LLMs) and developing highly autonomous agents. I have been working in the field of Natural Language Processing (NLP) since 2010. I won the Best Long Paper Award (1 out of 350 accepted long research papers) at NAACL 2021. I hold a Ph.D. in Computer Science from the University of Rochester (UofR), where I was advised by Professor Daniel Gildea, and a master’s degree from the Institute of Computing Technology, Chinese Academy of Sciences (ICT/CAS), under the mentorship of Dr. Qun Liu.

Recent News

2025.05 Checkout our recent work on automatic theorem proving and math agent: HunyuanProver and MPS-prover.
2024.11 Checkout our recent work on improving the scaling efficiency of search and learning: AlphaLLM (Neurips 2024), LiteSearch (AAAI 2025), StepwiseCPL and Don’t Get Lost in the Trees (ACL 2025).
2024.02 I will serve as an Action Editor of ACL ARR in 2024.
2023.09 Checkout our recent papers on industrial-level post-training practices: Stabilizing RLHF, Collaborative Decoding and Reward Stability in RLHF (ICLR 2024).
2022.04 I’m honored to serve as a Senior Area Chair at EMNLP 2022.
2021.05 I’m so excited that our paper “Video-aided Unsupervised Grammar Induction” was selected as the Best Long Paper at the 2021 Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL).
2020.12 I will serve as a senior PC member (≈Area Chair) at IJCAI 2021.
2020.11 I will join the editorial team of TACL, a top journel focusing on NLP.
2020.09 I will give a talk about the current progress on AMR parsing and AMR-to-text generation at CCL 2020.
2020.03 I will serve as an Area Chair for the “Semantics” track of EMNLP 2020.