Peng Li — Research Associate Professor

About

Peng Li is a Research Associate Professor at the Institute for AI Industry Research (AIR), Tsinghua University. Prior to joining Tsinghua, he served as a Principal Researcher and Team Leader at WeChat AI, Tencent, and previously worked at the Institute of Deep Learning (IDL), Baidu. His research interests include Large Language Models (LLMs), LLM Agents, AI for Mathematics (AI4Math), AI Scientist, and Multimodal Large Language Models (MLLMs). He has received several research awards, including the ACL 2023 Outstanding Paper Award, the ICAIS 2025 Responsible AI Research Award, and the First Prize of the Qian Weichang Chinese Information Processing Science and Technology Award from the Chinese Information Processing Society of China (CIPS). He has led several major research projects, including a key task under the National Science and Technology Innovation - Major Program and a General Program project of the National Natural Science Foundation of China. He has also served as an Area Chair for top-tier international conferences such as ICLR, ACL, EMNLP, and NAACL. His research has been deployed in large-scale production systems at WeChat and Baidu, supporting tens of millions of users daily.

LLMs LLM Agents AI4Math AI Scientist MLLMs

Address: 11/F, Block C, Qidi Science & Technology Building, Tsinghua Science Park, Haidian District, Beijing

My current research focuses on AI4Math, AI Scientist, and LLM agents, with broader interests in reasoning and multimodal intelligence.

AI Mathematician: Towards Fully Automated Frontier Mathematical Research

Yuanhang Liu, Yanxing Huang, Yanqiao Wang, Peng Li, Yang Liu. Preprint. [arXiv] [System] [Slides] [Slides (in Chinese)] [Blog]

Shows how AI Mathematician can help tackle real research problems, moving AI for math beyond solving benchmark puzzles toward assisting frontier mathematical discovery.

More technical reports and mathematical results to which AIM has contributed can be found on the AIM Blog.

Pessimistic Verification for Open Ended Math Questions

Yanxing Huang, Zihan Tang, Zejin Lin, Peng Li, Yang Liu. ICML 2026. [arXiv] [Poster]

Addresses a key obstacle for AI-assisted math: not just producing proofs, but reliably catching mistakes in open-ended mathematical reasoning.

AIGS: Generating Science from AI-Powered Automated Falsification

Zijun Liu, Kaiming Liu, Yiqi Zhu, Xuanyu Lei, Zonghan Yang, Zhenhe Zhang, Peng Li, Yang Liu. Preprint. [arXiv] [Project Webpage] [Slides (in Chinese)]

Explores what it would take for AI to contribute to science by emphasizing testing and falsification, so generated ideas are challenged rather than simply accepted.

Position: Towards Unified Alignment Between Agents, Humans, and Environment

Zonghan Yang, An Liu, Zijun Liu, Kaiming Liu, Fangzhou Xiong, Yile Wang, Zeyuan Yang, Qingyuan Hu, Xinrui Chen, Zhenhe Zhang, Fuwen Luo, Zhicheng Guo, Peng Li, Yang Liu. ICML 2024, 56251-56275. [pdf] [arXiv] [Project Webpage]

Broadens AI alignment from following human preferences to building agents that also respect the surrounding environment and practical constraints like time and cost.

A Dynamic LLM-Powered Agent Network for Task-Oriented Agent Collaboration

Zijun Liu, Yanzhe Zhang, Peng Li, Yang Liu, Diyi Yang. COLM 2024. [pdf] [arXiv] [code]

Shows that teams of AI agents can become more effective when they adapt who collaborates with whom instead of using a fixed discussion structure for every task.

AI4Math & AI Scientist

AI Mathematician: Towards Fully Automated Frontier Mathematical Research

Yuanhang Liu, Yanxing Huang, Yanqiao Wang, Peng Li, Yang Liu. Preprint. [arXiv] [System] [Slides] [Slides (in Chinese)] [Blog]

AI Mathematician as a Partner in Advancing Mathematical Discovery - A Case Study in Homogenization Theory

Yuanhang Liu, Beichen Wang, Peng Li, Yang Liu. ICAIS 2025. [arXiv] [Blog]

ICAIS 2025 Responsible AI Research Award

From Meta Idea to Advanced Mathematical Discovery -- Human-AI Co-Discovery of Sign-Embedding Quantum Algorithms

Yanqiao Wang, Jin-Peng Liu, Peng Li, Yang Liu. Preprint. [arXiv] [Sign-Embedding Quantum Algorithms Paper]

Pessimistic Verification for Open Ended Math Questions

Yanxing Huang, Zihan Tang, Zejin Lin, Peng Li, Yang Liu. ICML 2026. [arXiv] [Poster]

FormaRL: Enhancing Autoformalization with no Labeled Data

Yanxing Huang, Xinling Jin, Sijie Liang, Fuwen Luo, Peng Li, Yang Liu. COLM 2025. [pdf] [arXiv] [code]

AIGS: Generating Science from AI-Powered Automated Falsification

Zijun Liu, Kaiming Liu, Yiqi Zhu, Xuanyu Lei, Zonghan Yang, Zhenhe Zhang, Peng Li, Yang Liu. Preprint. [arXiv] [Project Webpage] [Slides (in Chinese)]

LLM Agents

Position: Towards Unified Alignment Between Agents, Humans, and Environment

Zonghan Yang, An Liu, Zijun Liu, Kaiming Liu, Fangzhou Xiong, Yile Wang, Zeyuan Yang, Qingyuan Hu, Xinrui Chen, Zhenhe Zhang, Fuwen Luo, Zhicheng Guo, Peng Li, Yang Liu. ICML 2024, 56251-56275. [pdf] [arXiv] [Project Webpage]

A Dynamic LLM-Powered Agent Network for Task-Oriented Agent Collaboration

Zijun Liu, Yanzhe Zhang, Peng Li, Yang Liu, Diyi Yang. COLM 2024. [pdf] [arXiv] [code]

ReAct Meets ActRe: Autonomous Annotation of Agent Trajectories for Contrastive Self-Training

Zonghan Yang, Peng Li, Ming Yan, Ji Zhang, Fei Huang, Yang Liu. COLM 2024. [pdf] [arXiv]

Advancing Language Multi-Agent Learning with Credit Re-Assignment for Interactive Environment Generalization

Zhitao He, Zijun Liu, Peng Li, Yi R Fung, Ming Yan, Ji Zhang, Fei Huang, Yang Liu. COLM 2025. [pdf] [arXiv] [code] [Slides (in Chinese)]

Scaling External Knowledge Input Beyond Context Windows of LLMs via Multi-Agent Collaboration

Zijun Liu, Zhennan Wan, Peng Li, Ming Yan, Fei Huang, Yang Liu. ACL 2026, 10284–10314. [pdf] [arXiv] [code]

Exploring Large Language Models for Communication Games: An Empirical Study on Werewolf

Yuzhuang Xu, Shuo Wang, Peng Li, Fuwen Luo, Xiaolong Wang, Weidong Liu and Yang Liu. Preprint. [arXiv] [code]

Foundation Models

Inference-Time Scaling for Generalist Reward Modeling

Zijun Liu, Peiyi Wang, Runxin Xu, Shirong Ma, Chong Ruan, Peng Li, Yang Liu, Yu Wu. Preprint. [arXiv]

Writing-RL: Advancing Long-form Writing via Adaptive Curriculum Reinforcement Learning

Xuanyu Lei, Chenliang Li, Yuning Wu, Kaiming Liu, Weizhou Shen, Peng Li, Ming Yan, Fei Huang, Ya-Qin Zhang, Yang Liu. ACL 2026, 5639–5661. [pdf] [arXiv] [code] [Slides]

Scaffolding Coordinates to Promote Vision-Language Coordination in Large Multi-Modal Models

Xuanyu Lei, Zonghan Yang, Xinrui Chen, Peng Li, Yang Liu. COLING 2025, 2886-2903. [pdf] [arXiv] [code] [Project Webpage]

Failures Pave the Way: Enhancing Large Language Models through Tuning-free Rule Accumulation

Zeyuan Yang, Peng Li, Yang Liu. EMNLP 2023, 1751-1777. [pdf] [arXiv] [code]

Model Composition for Multimodal Large Language Models

Chi Chen, Yiyang Du, Zheng Fang, Ziyue Wang, Fuwen Luo, Peng Li, Ming Yan, Ji Zhang, Fei Huang, Maosong Sun, Yang Liu. ACL 2024, 11246-11262. [pdf] [arXiv] [code]

Enhancing Multilingual Capabilities of Large Language Models through Self-Distillation from Resource-Rich Languages

Yuanchi Zhang, Yile Wang, Zijun Liu, Shuo Wang, Xiaolong Wang, Peng Li, Maosong Sun, Yang Liu. ACL 2024, 11189-11204. [pdf] [arXiv] [code]

DatasetsFull List

How Do Multimodal Large Language Models Handle Complex Multimodal Reasoning? Placing Them in An Extensible Escape Game

Ziyue Wang, Yurui Dong, Fuwen Luo, Minyuan Ruan, Zhili Cheng, Chi Chen, Peng Li, Yang Liu. ICCV 2025, 4807-4817. [pdf] [arXiv] [code] [Slides] [Video] [Media 1] [Media 2] [Poster]

StreamingBench: Assessing the Gap for MLLMs to Achieve Streaming Video Understanding

Junming Lin, Zheng Fang, Chi Chen, Haoxuan Cheng, Zihao Wan, Fuwen Luo, Ziyue Wang, Peng Li, Yang Liu, Maosong Sun. ICASSP 2026, 12147-12151. [pdf] [arXiv] [code] [Project Webpage] [Poster]

CODIS: Benchmarking Context-Dependent Visual Comprehension for Multimodal Large Language Models

Fuwen Luo, Chi Chen, Zihao Wan, Zhaolu Kang, Qidong Yan, Yingjie Li, Xiaolong Wang, Siyu Wang, Ziyue Wang, Xiaoyue Mi, Peng Li, Ning Ma, Maosong Sun, Yang Liu. ACL 2024, 10639-10659. [pdf] [arXiv] [code] [Project Webpage]

ActiView: Evaluating Active Perception Ability for Multimodal Large Language Models

Ziyue Wang, Chi Chen, Fuwen Luo, Yurui Dong, Yuanchi Zhang, Yuzhuang Xu, Xiaolong Wang, Peng Li, Yang Liu. ACL 2025, 7605-7633. [pdf] [arXiv] [code]

CoSpace: Benchmarking Continuous Space Perception Ability for Vision-Language Models

Yiqi Zhu, Ziyue Wang, Can Zhang, Peng Li, Yang Liu. CVPR 2025, 29569-29579. [pdf] [arXiv] [code] [Project Webpage]

DocRED: A Large-Scale Document-Level Relation Extraction Dataset

Yuan Yao, Deming Ye, Peng Li, Xu Han, Yankai Lin, Zhenghao Liu, Zhiyuan Liu, Lixin Huang, Jie Zhou, Maosong Sun. ACL 2019, 764-777. [pdf] [code & data] [Leaderboard]

Honors and Awards

ICAIS 2025 Responsible AI Research Award (2025)
ACL 2023 Outstanding Paper Award (2023)
CIKM 2021 Best Resource Paper Nomination (2021)
First Prize of the Qian Weichang Chinese Information Processing Science and Technology Award (2020)

Professional Services

Senior Area Chair: AACL (2022, 2026)
Area Chair: ICLR (2026), ACL (2024–2026), EMNLP (2024), NAACL (2024–2025), EACL (2024, 2026), COLING (2022), AACL (2025)
Action Editor: ACL Rolling Review (ARR), TMLR
PC Chair: YWCL (2010)
Tutorial Chair: CCL (2023), CCMT (2023, 2025)
Demonstration Chair: CCL (2024)
Frontier Forum Chair: CCMT (2024)
Publicity Chair: CCL (2025)
Student Symposium Chair: CCL (2026)
Reviewer / PC Member: ICML (2022–2026), ICLR (2024–2025), NeurIPS (2022–2025), ACL (2014, 2021–2023), EMNLP (2014, 2021–2023, 2025), NAACL (2018), CVPR (2023–2026), ICCV (2023, 2025), ECCV (2024, 2026), AAAI (2022–2025), IJCAI (2022–2024), COLING (2020, 2024, 2025), BMVC (2024), COLM (2024–2025), ICRA (2025-2026), IROS (2026), ACMMM (2025-2026), COLM (2024-2025), ACM TIST (2015), TACL (Standing Reviewer), Acta Automatica Sinica, Journal of Computer Science and Technology (JCST)

Education

Ph.D., Computer Science and Technology, Tsinghua University, Beijing, China (Aug 2009 – Jan 2015)
B.S., Computer Science and Technology, Tsinghua University, Beijing, China (Aug 2005 – Jul 2009)

About

Featured Publications

Selected Publications by Research Area

Honors and Awards

Professional Services

Education