Peng Li portrait

About

Peng Li is a Research Associate Professor at the Institute for AI Industry Research (AIR), Tsinghua University. Prior to joining Tsinghua, he served as a Principal Researcher and Team Leader at WeChat AI, Tencent, and previously worked at the Institute of Deep Learning (IDL), Baidu. His research interests include Large Language Models (LLMs), LLM Agents, AI for Mathematics (AI4Math), AI Scientist, and Multimodal Large Language Models (MLLMs). He has received several research awards, including the ACL 2023 Outstanding Paper Award, the ICAIS 2025 Responsible AI Research Award, and a CIKM 2021 Best Resource Paper Nomination. He has led several major research projects, including a key task under the National Science and Technology Innovation - Major Program. He has also served as an Area Chair for top-tier international conferences such as ICLR, ACL, EMNLP, and NAACL. His research has been deployed in large-scale production systems at WeChat and Baidu, supporting tens of millions of users daily. He is also a recipient of the First Prize of the Qian Weichang Chinese Information Processing Science and Technology Award from the Chinese Information Processing Society of China (CIPS).

LLMs LLM Agents AI4Math AI Scientist MLLMs

Address: 11/F, Block C, Qidi Science & Technology Building, Tsinghua Science Park, Haidian District, Beijing

Selected Publications by Research Area

Full Publication List
AI4Math and AI Scientist
AI Mathematician: Towards Fully Automated Frontier Mathematical Research
Yuanhang Liu, Yanxing Huang, Yanqiao Wang, Peng Li, Yang Liu. Preprint. [arXiv] [System] [Slides] [Slides (in Chinese)] [Blog]
AI Mathematician as a Partner in Advancing Mathematical Discovery - A Case Study in Homogenization Theory
Yuanhang Liu, Beichen Wang, Peng Li, Yang Liu. ICAIS 2025. [arXiv] [Blog]
ICAIS 2025 Responsible AI Research Award
Pessimistic Verification for Open Ended Math Questions
Yanxing Huang, Zihan Tang, Zejin Lin, Peng Li, Yang Liu. ICML 2026. [arXiv]
FormaRL: Enhancing Autoformalization with no Labeled Data
Yanxing Huang, Xinling Jin, Sijie Liang, Fuwen Luo, Peng Li, Yang Liu. COLM 2025. [pdf] [arXiv] [code]
AIGS: Generating Science from AI-Powered Automated Falsification
Zijun Liu, Kaiming Liu, Yiqi Zhu, Xuanyu Lei, Zonghan Yang, Zhenhe Zhang, Peng Li, Yang Liu. Preprint. [arXiv] [Project Webpage] [Slides (in Chinese)]
LLM Agents
Position: Towards Unified Alignment Between Agents, Humans, and Environment
Zonghan Yang, An Liu, Zijun Liu, Kaiming Liu, Fangzhou Xiong, Yile Wang, Zeyuan Yang, Qingyuan Hu, Xinrui Chen, Zhenhe Zhang, Fuwen Luo, Zhicheng Guo, Peng Li, Yang Liu. ICML 2024, 56251-56275. [pdf] [arXiv] [Project Webpage]
A Dynamic LLM-Powered Agent Network for Task-Oriented Agent Collaboration
Zijun Liu, Yanzhe Zhang, Peng Li, Yang Liu, Diyi Yang. COLM 2024. [pdf] [arXiv] [code]
ReAct Meets ActRe: Autonomous Annotation of Agent Trajectories for Contrastive Self-Training
Zonghan Yang, Peng Li, Ming Yan, Ji Zhang, Fei Huang, Yang Liu. COLM 2024. [pdf] [arXiv]
Advancing Language Multi-Agent Learning with Credit Re-Assignment for Interactive Environment Generalization
Zhitao He, Zijun Liu, Peng Li, Yi R Fung, Ming Yan, Ji Zhang, Fei Huang, Yang Liu. COLM 2025. [pdf] [arXiv] [code] [Slides (in Chinese)]
Scaling External Knowledge Input Beyond Context Windows of LLMs via Multi-Agent Collaboration
Zijun Liu, Zhennan Wan, Peng Li, Ming Yan, Fei Huang, Yang Liu. ACL 2026. [arXiv] [code]
Exploring Large Language Models for Communication Games: An Empirical Study on Werewolf
Yuzhuang Xu, Shuo Wang, Peng Li, Fuwen Luo, Xiaolong Wang, Weidong Liu andΒ Yang Liu. Preprint. [arXiv] [code]
Reasoning and Learning
Inference-Time Scaling for Generalist Reward Modeling
Zijun Liu, Peiyi Wang, Runxin Xu, Shirong Ma, Chong Ruan, Peng Li, Yang Liu, Yu Wu. Preprint. [arXiv]
Writing-RL: Advancing Long-form Writing via Adaptive Curriculum Reinforcement Learning
Xuanyu Lei, Chenliang Li, Yuning Wu, Kaiming Liu, Weizhou Shen, Peng Li, Ming Yan, Fei Huang, Ya-Qin Zhang, Yang Liu. ACL 2026. [arXiv] [code]
MUSEG: Reinforcing Video Temporal Understanding via Timestamp-Aware Multi-Segment Grounding
Fuwen Luo, Shengfeng Lou, Chi Chen, Ziyue Wang, Chenliang Li, Weizhou Shen, Jiyue Guo, Peng Li, Ming Yan, Ji Zhang, Fei Huang, Yang Liu. ACL 2026. [arXiv] [code]
Enabling Stroke-Level Structural Analysis of Hieroglyphic Scripts without Language-Specific Priors
Fuwen Luo, Zihao Wan, Ziyue Wang, Yaluo Liu, Pau Tong Lin Xu, Xuanjia Qiao, Xiaolong Wang, Peng Li, Yang Liu. Findings of ACL 2026. [arXiv] [code] [Hugging Face]
Scaffolding Coordinates to Promote Vision-Language Coordination in Large Multi-Modal Models
Xuanyu Lei, Zonghan Yang, Xinrui Chen, Peng Li, Yang Liu. COLING 2025, 2886-2903. [pdf] [arXiv] [code] [Project Webpage]
Failures Pave the Way: Enhancing Large Language Models through Tuning-free Rule Accumulation
Zeyuan Yang, Peng Li, Yang Liu. EMNLP 2023, 1751-1777. [pdf] [arXiv] [code]
MLLMs
AdaMMS: Model Merging for Heterogeneous Multimodal Large Language Models with Unsupervised Coefficient Optimization
Yiyang Du, Xiaochen Wang, Chi Chen, Jiabo Ye, Yiru Wang, Peng Li, Ming Yan, Ji Zhang, Fei Huang, Zhifang Sui, Maosong Sun, Yang Liu. CVPR 2025, 9413-9422. [pdf] [arXiv] [code] [Slides]
Model Composition for Multimodal Large Language Models
Chi Chen, Yiyang Du, Zheng Fang, Ziyue Wang, Fuwen Luo, Peng Li, Ming Yan, Ji Zhang, Fei Huang, Maosong Sun, Yang Liu. ACL 2024, 11246-11262. [pdf] [arXiv] [code]
LVAgent: Long Video Understanding by Multi-Round Dynamical Collaboration of MLLM Agents
Boyu Chen, Zhengrong Yue, Siran Chen, Zikang Wang, Yang Liu, Peng Li, Yali Wang. ICCV 2025, 20237-20246. [pdf] [arXiv]
VideoChat-M1: Collaborative Policy Planning for Video Understanding via Multi-Agent Reinforcement Learning
Boyu Chen, Zikang Wang, Zhengrong Yue, Kainan Yan, Chenyun Yu, Yi Huang, Zijun Liu, Yafei Wen, Xiaoxin Chen, Yang Liu, Peng Li, Yali Wang. CVPR 2026, 33772-33783. [pdf] [arXiv]
Weakly Supervised Vision-and-Language Pre-training with Relative Representations
Chi Chen, Peng Li, Maosong Sun, Yang Liu. ACL 2023, 8341-8355. [pdf] [arXiv] [code]
Position-Enhanced Visual Instruction Tuning for Multimodal Large Language Models
Chi Chen, Ruoyu Qin, Fuwen Luo, Xiaoyue Mi, Peng Li, Maosong Sun, Yang Liu. Preprint. [arXiv] [code] [Demo]
Datasets
How Do Multimodal Large Language Models Handle Complex Multimodal Reasoning? Placing Them in An Extensible Escape Game
Ziyue Wang, Yurui Dong, Fuwen Luo, Minyuan Ruan, Zhili Cheng, Chi Chen, Peng Li, Yang Liu. ICCV 2025, 4807-4817. [pdf] [arXiv] [code] [Slides] [Video] [Media 1] [Media 2] [Poster]
Evaluating Time Awareness and Cross-modal Active Perception of Large Models via 4D Escape Room Task
Yurui Dong, Ziyue Wang, Shuyun Lu, Dairu Liu, Xuechen Liu, Fuwen Luo, Peng Li, Yang Liu. Preprint. [arXiv] [code]
StreamingBench: Assessing the Gap for MLLMs to Achieve Streaming Video Understanding
Junming Lin, Zheng Fang, Chi Chen, Haoxuan Cheng, Zihao Wan, Fuwen Luo, Ziyue Wang, Peng Li, Yang Liu, Maosong Sun. ICASSP 2026, 12147-12151. [pdf] [arXiv] [code] [Project Webpage]
ActiView: Evaluating Active Perception Ability for Multimodal Large Language Models
Ziyue Wang, Chi Chen, Fuwen Luo, Yurui Dong, Yuanchi Zhang, Yuzhuang Xu, Xiaolong Wang, Peng Li, Yang Liu. ACL 2025, 7605-7633. [pdf] [arXiv] [code]
CoSpace: Benchmarking Continuous Space Perception Ability for Vision-Language Models
Yiqi Zhu, Ziyue Wang, Can Zhang, Peng Li, Yang Liu. CVPR 2025, 29569-29579. [pdf] [arXiv] [code] [Project Webpage]
CODIS: Benchmarking Context-Dependent Visual Comprehension for Multimodal Large Language Models
Fuwen Luo, Chi Chen, Zihao Wan, Zhaolu Kang, Qidong Yan, Yingjie Li, Xiaolong Wang, Siyu Wang, Ziyue Wang, Xiaoyue Mi, Peng Li, Ning Ma, Maosong Sun, Yang Liu. ACL 2024, 10639-10659. [pdf] [arXiv] [code] [Project Webpage]
StableToolBench: Towards Stable Large-Scale Benchmarking on Tool Learning of Large Language Models
Zhicheng Guo, Sijie Cheng, Hao Wang, Shihao Liang, Yujia Qin, Peng Li, Zhiyuan Liu, Maosong Sun, Yang Liu. Findings of ACL 2024, 11143-11156. [pdf] [arXiv] [code] [Project Webpage]
DocRED: A Large-Scale Document-Level Relation Extraction Dataset
Yuan Yao, Deming Ye, Peng Li, Xu Han, Yankai Lin, Zhenghao Liu, Zhiyuan Liu, Lixin Huang, Jie Zhou, Maosong Sun. ACL 2019, 764-777. [pdf] [code & data] [Leaderboard]

Honors and Awards

  • ICAIS 2025 Responsible AI Research Award (2025)
  • ACL 2023 Outstanding Paper Award (2023)
  • CIKM 2021 Best Resource Paper Nomination (2021)
  • First Prize of the Qian Weichang Chinese Information Processing Science and Technology Award (2020)

Professional Services

  • Senior Area Chair: AACL (2022, 2026)
  • Area Chair: ICLR (2026), ACL (2024–2025), EMNLP (2024), NAACL (2024–2025), EACL (2024, 2026), COLING (2022), AACL (2025)
  • Action Editor: ACL Rolling Review (ARR), TMLR
  • PC Chair: YWCL (2010)
  • Tutorial Chair: CCL (2023), CCMT (2023, 2025)
  • Demonstration Chair: CCL (2024)
  • Frontier Forum Chair: CCMT (2024)
  • Publicity Chair: CCL (2025)
  • Student Symposium Chair: CCL (2026)
  • Reviewer / PC Member: ICML (2022–2026), ICLR (2024–2025), NeurIPS (2022–2025), ACL (2014, 2021–2023), EMNLP (2014, 2021–2023, 2025), NAACL (2018), CVPR (2023–2026), ICCV (2023, 2025), ECCV (2024, 2026), AAAI (2022–2025), IJCAI (2022–2024), COLING (2020, 2024, 2025), BMVC (2024), COLM (2024–2025), ICRA (2025-2026), IROS (2026), ACMMM (2025-2026), COLM (2024-2025), ACM TIST (2015), TACL (Standing Reviewer), Acta Automatica Sinica, Journal of Computer Science and Technology (JCST)

Education