Peng Li portrait

About

Peng Li is a Research Associate Professor at the Institute for AI Industry Research (AIR), Tsinghua University. Before joining Tsinghua, he was a principal researcher and team leader at WeChat AI, Tencent. He also previously worked at Institute of Deep Learning (IDL), Baidu Inc. His research spans Large Language Models (LLMs), LLM-based Agents, AI for mathematics (AI4Math), and Multimodal Large Language Models (MLLMs). He has published over 90 papers in top-tier venues and received the Outstanding Paper Award at ACL 2023. His work ranks first on several influential benchmarks, surpassing teams from Google Research and OpenAI. He has led major scientific research projects, including a key task under the National Science and Technology Innovation - Major Program and the National Natural Science Foundation of China (NSFC) General Program. He has also served as an Area Chair for top-tier international conferences such as ACL, EMNLP, and NAACL. His research has been deployed in Baidu and WeChat, reaching tens of millions of users. He received the First Prize of the Qian Weichang Chinese Information Processing Science and Technology Award by the Chinese Information Processing Society of China (CIPS).

Large Language Models (LLMs) LLM-based Agents AI4Math Multimodal Large Language Models (MLLMs)

Address: 11/F, Block C, Qidi Science & Technology Building, Tsinghua Science Park, Haidian District, Beijing

Publications & Preprints

View on Google Scholar
2025
AI Mathematician: Towards Fully Automated Frontier Mathematical Research
Yuanhang Liu, Yanxing Huang, Yanqiao Wang, Peng Li, Yang Liu. Preprint. [arXiv] [Slides] [Slides (in Chinese)]
ActiView: Evaluating Active Perception Ability for Multimodal Large Language Models
Ziyue Wang, Chi Chen, Fuwen Luo, Yurui Dong, Yuanchi Zhang, Yuzhuang Xu, Xiaolong Wang, Peng Li, Yang Liu. ACL 2025, 7605-7633. [pdf] [arXiv] [code]
Perspective Transition of Large Language Models for Solving Subjective Tasks
Xiaolong Wang, Yuanchi Zhang, Ziyue Wang, Yuzhuang Xu, Fuwen Luo, Yile Wang, Peng Li, Yang Liu. Findings of ACL 2025, 9686-9704. [pdf] [arXiv]
Scaffolding Coordinates to Promote Vision-Language Coordination in Large Multi-Modal Models
Xuanyu Lei, Zonghan Yang, Xinrui Chen, Peng Li, Yang Liu. COLING 2025, 2886-2903. [pdf] [arXiv] [code] [Project Webpage]
Rethinking Long Context Generation from the Continual Learning Perspective
Zeyuan Yang, Fangzhou Xiong, Peng Li and Yang Liu. COLING 2025, 1922-1933. [pdf]
Leveraging Language-based Representations for Better Solving Symbol-related Problems with Large Language Models
Yile Wang, Sijie Cheng, Zixin Sun, Peng Li, Yang Liu. COLING 2025, 5544-5557. [pdf] [arXiv] [code]
Dual-AEB: Synergizing Rule-Based and Multimodal Large Language Models for Effective Emergency Braking
Wei Zhang, Pengfei Li, Junli Wang, Bingchuan Sun, Qihao Jin, Guangjun Bao, Shibo Rui, Yang Yu, Wenchao Ding, Peng Li, Yilun Chen. ICRA 2025. [arXiv] [code]
CoSpace: Benchmarking Continuous Space Perception Ability for Vision-Language Models
Yiqi Zhu, Ziyue Wang, Can Zhang, Peng Li, Yang Liu. CVPR 2025. [pdf] [arXiv] [code] [Project Webpage]
AdaMMS: Model Merging for Heterogeneous Multimodal Large Language Models with Unsupervised Coefficient Optimization
Yiyang Du, Xiaochen Wang, Chi Chen, Jiabo Ye, Yiru Wang, Peng Li, Ming Yan, Ji Zhang, Fei Huang, Zhifang Sui, Maosong Sun, Yang Liu. CVPR 2025. [pdf] [arXiv] [code] [Slides]
How Do Multimodal Large Language Models Handle Complex Multimodal Reasoning? Placing Them in An Extensible Escape Game
Ziyue Wang, Yurui Dong, Fuwen Luo, Minyuan Ruan, Zhili Cheng, Chi Chen, Peng Li, Yang Liu. ICCV 2025. [arXiv] [code] [Slides] [Video]
LVAgent: Long Video Understanding by Multi-Round Dynamical Collaboration of MLLM Agents
Boyu Chen, Zhengrong Yue, Siran Chen, Zikang Wang, Yang Liu, Peng Li, Yali Wang. ICCV 2025. [arXiv]
Adversarial Robust Memory-Based Continual Learner
Xiaoyue Mi, Fan Tang, Zonghan Yang, Danding Wang, Juan Cao, Peng Li, Yang Liu. ICCV 2025. [arXiv]
Contrastive Private Data Synthesis via Weighted Multi-PLM Fusion
Tianyuan Zou, Yang Liu, Peng Li, Yufei Xiong, Jianqing Zhang, Jingjing Liu, Xiaozhou Ye, Ye Ouyang, Ya-Qin Zhang. ICML 2025. [arXiv]
Bench4Merge: A Comprehensive Benchmark for Merging in Realistic Dense Traffic with Micro-Interactive Vehicles
Zhengming Wang, Junli Wang, Pengfei Li, Zhaohan Li, Chunyang Liu, Bo Zhang, Peng Li, Yilun Chen. IROS 2025. [arXiv] [code] [Video]
EditEval: Towards Comprehensive and Automatic Evaluation for Text-guided Video Editing
Bingshuai Liu, Ante Wang, Zijun Min, Chenyang Lyu, Longyue Wang, Zhihao Wang, Xu Han, Peng Li, Jinsong Su. ACMMM 2025.
Advancing Language Multi-Agent Learning with Credit Re-Assignment for Interactive Environment Generalization
Zhitao He, Zijun Liu, Peng Li, Yi R Fung, Ming Yan, Ji Zhang, Fei Huang, Yang Liu. COLM 2025. [arXiv] [code] [Slides (in Chinese)]
FormaRL: Enhancing Autoformalization with no Labeled Data
Yanxing Huang, Xinling Jin, Sijie Liang, Fuwen Luo, Peng Li, Yang Liu. COLM 2025.
Agent-Environment Alignment via Automated Interface Generation
Kaiming Liu, Xuanyu Lei, Ziyue Wang, Peng Li, Yang Liu. Preprint. [arXiv] [Slides]
MUSEG: Reinforcing Video Temporal Understanding via Timestamp-Aware Multi-Segment Grounding
Fuwen Luo, Shengfeng Lou, Chi Chen, Ziyue Wang, Chenliang Li, Weizhou Shen, Jiyue Guo, Peng Li, Ming Yan, Ji Zhang, Fei Huang, Yang Liu. Preprint. [arXiv]
Inference-Time Scaling for Generalist Reward Modeling
Zijun Liu, Peiyi Wang, Runxin Xu, Shirong Ma, Chong Ruan, Peng Li, Yang Liu, Yu Wu. Preprint. [arXiv]
LiloDriver: A Lifelong Learning Framework for Closed-loop Motion Planning in Long-tail Autonomous Driving Scenarios
Huaiyuan Yao, Pengfei Li, Bu Jin, Yupeng Zheng, An Liu, Lisen Mu, Qing Su, Qian Zhang, Yilun Chen, Peng Li. Preprint. [arXiv]
Visual Abstract Thinking Empowers Multimodal Reasoning
Dairu Liu, Ziyue Wang, Minyuan Ruan, Fuwen Luo, Chi Chen, Peng Li, Yang Liu. Preprint. [arXiv]
A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, and Beyond
Xiaoye Qu, Yafu Li, Zhaochen Su, Weigao Sun, Jianhao Yan, Dongrui Liu, Ganqu Cui, Daizong Liu, Shuxian Liang, Junxian He, Peng Li, Wei Wei, Jing Shao, Chaochao Lu, Yue Zhang, Xian-Sheng Hua, Bowen Zhou, Yu Cheng. Preprint. [arXiv] [Project Webpage]
DongbaMIE: A Multimodal Information Extraction Dataset for Evaluating Semantic Understanding of Dongba Pictograms
Xiaojun Bi, Shuo Li, Junyao Xing, Ziyue Wang, Fuwen Luo, Weizheng Qiao, Lu Han, Ziwei Sun, Peng Li, Yang Liu. Preprint. [arXiv] [code] [Dataset]
Scaling External Knowledge Input Beyond Context Windows of LLMs via Multi-Agent Collaboration
Zijun Liu, Zhennan Wan, Peng Li, Ming Yan, Ji Zhang, Fei Huang, Yang Liu. Preprint. [arXiv] [code]
Writing-RL: Advancing Long-form Writing via Adaptive Curriculum Reinforcement Learning
Xuanyu Lei, Chenliang Li, Yuning Wu, Kaiming Liu, Weizhou Shen, Peng Li, Ming Yan, Ji Zhang, Fei Huang, Yang Liu. Preprint. [arXiv] [code]
MUCAR: Benchmarking Multilingual Cross-Modal Ambiguity Resolution for Multimodal Large Language Models
Xiaolong Wang, Zhaolu Kang, Wangyuxuan Zhai, Xinyue Lou, Yunghwei Lai, Ziyue Wang, Yawen Wang, Kaiyu Huang, Yile Wang, Peng Li, Yang Liu. Preprint. [arXiv]
2024
Position: Towards Unified Alignment Between Agents, Humans, and Environment
Zonghan Yang, An Liu, Zijun Liu, Kaiming Liu, Fangzhou Xiong, Yile Wang, Zeyuan Yang, Qingyuan Hu, Xinrui Chen, Zhenhe Zhang, Fuwen Luo, Zhicheng Guo, Peng Li, Yang Liu. ICML 2024, 56251-56275. [pdf] [arXiv] [Project Webpage]
CODIS: Benchmarking Context-Dependent Visual Comprehension for Multimodal Large Language Models
Fuwen Luo, Chi Chen, Zihao Wan, Zhaolu Kang, Qidong Yan, Yingjie Li, Xiaolong Wang, Siyu Wang, Ziyue Wang, Xiaoyue Mi, Peng Li, Ning Ma, Maosong Sun, Yang Liu. ACL 2024, 10639-10659. [pdf] [arXiv] [code] [Project Webpage]
Browse and Concentrate: Comprehending Multimodal Content via prior-LLM Context Fusion
Ziyue Wang, Chi Chen, Yiqi Zhu, Fuwen Luo, Peng Li, Ming Yan, Ji Zhang, Fei Huang, Maosong Sun, Yang Liu. ACL 2024, 11229-11245. [pdf] [arXiv] [code]
Model Composition for Multimodal Large Language Models
Chi Chen, Yiyang Du, Zheng Fang, Ziyue Wang, Fuwen Luo, Peng Li, Ming Yan, Ji Zhang, Fei Huang, Maosong Sun, Yang Liu. ACL 2024, 11246-11262. [pdf] [arXiv] [code]
Enhancing Multilingual Capabilities of Large Language Models through Self-Distillation from Resource-Rich Languages
Yuanchi Zhang, Yile Wang, Zijun Liu, Shuo Wang, Xiaolong Wang, Peng Li, Maosong Sun, Yang Liu. ACL 2024, 11189-11204. [pdf] [arXiv] [code]
Reasoning in Conversation: Solving Subjective Tasks through Dialogue Simulation for Large Language Models
Xiaolong Wang, Yile Wang, Yuanchi Zhang, Fuwen Luo, Peng Li, Maosong Sun, Yang Liu. ACL 2024, 15880-15893. [pdf] [arXiv]
StableToolBench: Towards Stable Large-Scale Benchmarking on Tool Learning of Large Language Models
Zhicheng Guo, Sijie Cheng, Hao Wang, Shihao Liang, Yujia Qin, Peng Li, Zhiyuan Liu, Maosong Sun, Yang Liu. Findings of ACL 2024, 11143-11156. [pdf] [arXiv] [code] [Project Webpage]
PANDA: Preference Adaptation for Enhancing Domain-Specific Abilities of LLMs
An Liu, Zonghan Yang, Zhenhe Zhang, Qingyuan Hu, Peng Li, Ming Yan, Ji Zhang, Fei Huang, Yang Liu. Findings of ACL 2024, 10960-10977. [pdf] [arXiv] [code]
Budget-Constrained Tool Learning with Planning
Yuanhang Zheng, Peng Li, Ming Yan, Ji Zhang, Fei Huang, Yang Liu. Findings of ACL 2024, 9039-9052. [pdf] [arXiv]
ReAct Meets ActRe: Autonomous Annotation of Agent Trajectories for Contrastive Self-Training
Zonghan Yang, Peng Li, Ming Yan, Ji Zhang, Fei Huang, Yang Liu. COLM 2024. [pdf] [arXiv]
A Dynamic LLM-Powered Agent Network for Task-Oriented Agent Collaboration
Zijun Liu, Yanzhe Zhang, Peng Li, Yang Liu, Diyi Yang. COLM 2024. [pdf] [arXiv] [code]
ToolRerank: Adaptive and Hierarchy-Aware Reranking for Tool Retrieval
Yuanhang Zheng, Peng Li, Wei Liu, Yang Liu, Jian Luan, Bin Wang. COLING 2024, 16263-16273. [pdf] [arXiv] [code]
Pluggable Neural Machine Translation Models via Memory-augmented Adapters
Yuzhuang Xu, Shuo Wang, Peng Li, Xuebo Liu, Xiaolong Wang, Weidong Liu, Yang Liu. COLING 2024, 12794-12808. [pdf] [arXiv] [code]
DEEM: Dynamic Experienced Expert Modeling for Stance Detection
Xiaolong Wang, Yile Wang, Sijie Cheng, Peng Li, Yang Liu. COLING 2024, 4530-4541. [pdf] [arXiv] [code]
EgoThink: Evaluating First-Person Perspective Thinking Capability of Vision-Language Models
Sijie Cheng, Zhicheng Guo, Jingwen Wu, Kechen Fang, Peng Li, Huaping Liu, Yang Liu. CVPR 2024, 14291-14302. [pdf] [arXiv] [code] [Project Webpage]
FuseGen: PLM Fusion for Data-generation based Zero-shot Learning
Tianyuan Zou, Yang Liu, Peng Li, Jianqing Zhang, Jingjing Liu, Ya-Qin Zhang. EMNLP 2024, 2172-2190. [pdf] [arXiv] [code]
Black-box Prompt Tuning with Subspace Learning
Yuanhang Zheng, Zhixing Tan, Peng Li, Yang Liu. IEEE/ACM Transactions on Audio, Speech and Language Processing (TASLP), (32):3002-3013. [pdf] [arXiv]
Exploring Universal Intrinsic Task Subspace for Few-shot Learning via Prompt Tuning
Yujia Qin, Xiaozhi Wang, Yusheng Su, Yankai Lin, Ning Ding, Jing Yi, Weize Chen, Zhiyuan Liu, Juanzi Li, Lei Hou, Peng Li, Maosong Sun, Jie Zhou. IEEE/ACM Transactions on Audio, Speech and Language Processing (TASLP), (32):3631-3643. [pdf] [arXiv]
Topology-preserving Adversarial Training for Alleviating Natural Accuracy Degradation
Xiaoyue Mi, Fan Tang, Yepeng Weng, Danding Wang, Juan Cao, Sheng Tang, Peng Li, Yang Liu. BMVC 2024. [pdf] [arXiv]
AIGS: Generating Science from AI-Powered Automated Falsification
Zijun Liu, Kaiming Liu, Yiqi Zhu, Xuanyu Lei, Zonghan Yang, Zhenhe Zhang, Peng Li, Yang Liu. Preprint. [arXiv] [Project Webpage] [Slides (in Chinese)]
Personal LLM Agents: Insights and Survey about the Capability, Efficiency and Security
Yuanchun Li, Hao Wen, Weijun Wang, Xiangyu Li, Yizhen Yuan, Guohong Liu, Jiacheng Liu, Wenxing Xu, Xiang Wang, Yi Sun, Rui Kong, Yile Wang, Hanfei Geng, Jian Luan, Xuefeng Jin, Zilong Ye, Guanjing Xiong, Fan Zhang, Xiang Li, Mengwei Xu, Zhijun Li, Peng Li, Yang Liu, Ya-Qin Zhang, Yunxin Liu. Preprint. [arXiv] [Paper List]
Enabling Weak LLMs to Judge Response Reliability via Meta Ranking
Zijun Liu, Boqun Kou, Peng Li, Ming Yan, Ji Zhang, Fei Huang, Yang Liu. Preprint. [arXiv] [code]
StreamingBench: Assessing the Gap for MLLMs to Achieve Streaming Video Understanding
Junming Lin, Zheng Fang, Chi Chen, Zihao Wan, Fuwen Luo, Peng Li, Yang Liu, Maosong Sun. Preprint. [arXiv] [code] [Project Webpage]
Visual-Friendly Concept Protection via Selective Adversarial Perturbations
Xiaoyue Mi, Fan Tang, Juan Cao, Peng Li, Yang Liu. Preprint. [arXiv]
Agent Hospital: A Simulacrum of Hospital with Evolvable Medical Agents
Junkai Li, Yunghwei Lai, Weitao Li, Jingyi Ren, Meng Zhang, Xinhui Kang, Siyu Wang, Peng Li, Ya-Qin Zhang, Weizhi Ma, Yang Liu. Preprint. [arXiv]
Interactive Visual Assessment for Text-to-Image Generation Models
Xiaoyue Mi, Fan Tang, Juan Cao, Qiang Sheng, Ziyao Huang, Peng Li, Yang Liu, Tong-Yee Lee. Preprint. [arXiv]
2023
Failures Pave the Way: Enhancing Large Language Models through Tuning-free Rule Accumulation
Zeyuan Yang, Peng Li, Yang Liu. EMNLP 2023, 1751-1777. [pdf] [arXiv] [code]
Learn and Consolidate: Continual Adaptation for Zero-Shot and Multilingual Neural Machine Translation
Kaiyu Huang, Peng Li, Junpeng Liu, Maosong Sun, Yang Liu. EMNLP 2023, 13938-13951. [pdf] [code]
Revisiting Source Context in Nearest Neighbor Machine Translation
Xuanhong Li, Peng Li, Po Hu. EMNLP 2023, 8087-8098. [pdf] [code]
Self-Knowledge Guided Retrieval Augmentation for Large Language Models
Yile Wang, Peng Li, Maosong Sun, Yang Liu. Findings of EMNLP 2023, 10303-10315. [pdf] [arXiv] [code]
Filling the Image Information Gap for VQA: Prompting Large Language Models to Proactively Ask Questions
Ziyue Wang, Chi Chen, Peng Li, Yang Liu. Findings of EMNLP 2023, 2874-2890. [pdf] [arXiv] [code]
Exploring Large Language Models for Communication Games: An Empirical Study on Werewolf
Yuzhuang Xu, Shuo Wang, Peng Li, Fuwen Luo, Xiaolong Wang, Weidong Liu andΒ Yang Liu. Preprint. [arXiv] [code]
Position-Enhanced Visual Instruction Tuning for Multimodal Large Language Models
Chi Chen, Ruoyu Qin, Fuwen Luo, Xiaoyue Mi, Peng Li, Maosong Sun, Yang Liu. Preprint. [arXiv] [code] [Demo]
Knowledge Transfer in Incremental Learning for Multilingual Neural Machine Translation
Kaiyu Huang, Peng Li, Jin Ma, Ting Yao, Yang Liu. ACL 2023, 15286-15304. [pdf] [code]
Bridging the Gap between Decision and Logits in Decision-based Knowledge Distillation for Pre-trained Language Models
Qinhong Zhou, Zonghan Yang, Peng Li, Yang Liu. ACL 2023, 13234-13248. [pdf] [arXiv] [code]
Weakly Supervised Vision-and-Language Pre-training with Relative Representations
Chi Chen, Peng Li, Maosong Sun, Yang Liu. ACL 2023, 8341-8355. [pdf] [arXiv] [code]
An Extensible Plug-and-Play Method for Multi-Aspect Controllable Text Generation
Xuancheng Huang, Zijun Liu, Peng Li, Tao Li, Maosong Sun, Yang Liu. ACL 2023, 15233-15256. [pdf] [arXiv] [code]
Continual Knowledge Distillation for Neural Machine Translation
Yuanchi Zhang, Peng Li, Maosong Sun, Yang Liu. ACL 2023, 7978-7996. [pdf] [arXiv] [code]
Hard Sample Aware Prompt-Tuning
Yuanjian Xu, Qi An, Jiahuan Zhang, Peng Li, Zaiqing Nie. ACL 2023, 12356-12369. [pdf]
Plug-and-Play Knowledge Injection for Pre-trained Language Models
Zhengyan Zhang, Zhiyuan Zeng, Yankai Lin, Huadong Wang, Deming Ye, Chaojun Xiao, Xu Han, Zhiyuan Liu, Peng Li, Maosong Sun, Jie Zhou. ACL 2023, 10641-10658. [pdf] [arXiv]
Prompt-Guided Retrieval Augmentation for Non-Knowledge-Intensive Tasks
Zhicheng Guo, Sijie Cheng, Yile Wang, Peng Li, Yang Liu. Findings of ACL 2023, 10896-10912. [pdf] [arXiv]
Improving Adversarial Robustness of Deep Equilibrium Models with Explicit Regulations Along the Neural Dynamics
Zonghan Yang, Peng Li, Tianyu Pang, Yang Liu. ICML 2023, 39349-39364. [pdf] [arXiv] [code]
Unified Detoxifying and Debiasing in Language Generation via Inference-time Adaptive Optimization
Zonghan Yang, Xiaoyuan Yi, Peng Li, Yang Liu, Xing Xie. ICLR 2023. [pdf] [arXiv]
Learning to Relate to Previous Turns in Conversational Search
Fengran Mo, Jian-Yun Nie, Kaiyu Huang, Kelong Mao, Yutao Zhu, Peng Li, Yang Liu. KDD 2023, 1722-1732. [pdf] [arXiv]
Exploring the Effectiveness of Student Behavior in Prerequisite Relation Discovery for Concepts
Jifan Yu, Hanming Li, Gan Luo, Yankai Lin, Peng Li, Jianjun Xu, Lei Hou, Bin Xu. APWeb-WAIM 2023, 359-374. [pdf]
When to Trust Aggregated Gradients: Addressing Negative Client Sampling in Federated Learning
Wenkai Yang, Yankai Lin, Guangxiang Zhao, Peng Li, Jie Zhou, Xu Sun. Transactions on Machine Learning Research (TMLR), 2835-8856. [pdf] [arXiv] [code]
Gradual Syntactic Label Replacement for Language Model Pre-training
Yile Wang, Yue Zhang, Peng Li, Yang Liu. IEEE/ACM Transactions on Audio, Speech and Language Processing (TASLP), (32):486-496. [pdf]
AdaDS: Adaptive Data Selection for Accelerating Pre-trained Language Model Knowledge Distillation
Qinhong Zhou, Peng Li, Yang Liu, Yuyang Guan, Qizhou Xing, Ming Chen, Maosong Sun, Yang Liu. AI Open, (4):56-63. [pdf]
Restricted Orthogonal Gradient Projection for Continual Learning
Zeyuan Yang, Zonghan Yang, Yichen Liu, Peng Li, Yang Liu. AI Open, (4):98-110. [pdf] [arXiv]
2022
End-to-End Unsupervised Vision-and-Language Pre-training with Referring Expression Matching
Chi Chen, Peng Li, Maosong Sun, Yang Liu. EMNLP 2022, 10799-10810. [pdf] [code]
A Template-based Method for Constrained Neural Machine Translation
Shuo Wang, Peng Li, Zhixing Tan, Zhaopeng Tu, Maosong Sun, Yang Liu. EMNLP 2022, 3665-3679. [pdf] [arXiv] [code]
Entropy-Based Vocabulary Substitution for Incremental Learning in Multilingual Neural Machine Translation
Kaiyu Huang, Peng Li, Jin Ma, Yang Liu. EMNLP 2022, 10537-10550. [pdf] [code]
ROSE: Robust Selective Fine-tuning for Pre-trained Language Models
Lan Jiang, Hao Zhou, Yankai Lin, Peng Li, Jie Zhou, Rui Jiang. EMNLP 2022, 2886-2897. [pdf] [arXiv] [code]
MAVEN-ERE: A Unified Large-scale Dataset for Event Coreference, Temporal, Causal and Subevent Relation Extraction
Xiaozhi Wang, Yulin Chen, Ning Ding, Hao Peng, Zimu Wang, Yankai Lin, Xu Han, Lei Hou, Juanzi Li, Zhiyuan Liu, Peng Li, Jie Zhou. EMNLP 2022, 926-941. [pdf] [arXiv] [code]
From Mimicking to Integrating: Knowledge Integration for Pre-Trained Language Models
Lei Li, Yankai Lin, Xuancheng Ren, Guangxiang Zhao, Peng Li, Jie Zhou, Xu Sun. Findings of EMNLP 2022, 6420-6431. [pdf] [arXiv] [code]
Event Detection with Dual Relational Graph Attention Networks
Jiaxin Mi, Po Hu, Peng Li. COLING 2022, 1979-1989. [pdf] [code]
Rethinking the Promotion Brought by Contrastive Learning to Semi-Supervised Node Classification
Deli Chen, Yankai Lin, Lei Li, Xuancheng Ren, Peng Li, Jie Zhou, Xu Sun. IJCAI-22, 2852-2858. [pdf] [arXiv]
Knowledge Inheritance for Pre-trained Language Models
Yujia Qin, Yankai Lin, Jing Yi, Jiajie Zhang, Xu Han, Zhengyan Zhang, Yusheng Su, Zhiyuan Liu, Peng Li, Maosong Sun, Jie Zhou. NAACL 2022, 3921-3937. [pdf] [arXiv] [code]
On Transferability of Prompt Tuning for Natural Language Processing
Yusheng Su, Xiaozhi Wang, Yujia Qin, Chi-Min Chan, Yankai Lin, Huadong Wang, Kaiyue Wen, Zhiyuan Liu, Peng Li, Juanzi Li, Lei Hou, Maosong Sun, Jie Zhou. NAACL 2022, 3949-3969. [pdf] [arXiv] [code]
Fully Hyperbolic Neural Networks
Weize Chen, Xu Han, Yankai Lin, Hexu Zhao, Zhiyuan Liu, Peng Li, Maosong Sun, Jie Zhou. ACL 2022, 5672-5686. [pdf] [arXiv] [code]
Unsupervised Dependency Graph Network
Yikang Shen, Shawn Tan, Alessandro Sordoni, Peng Li, Jie Zhou, Aaron Courville. ACL 2022, 4767-4784. [pdf]
Packed Levitated Marker for Entity and Relation Extraction
Deming Ye, Yankai Lin, Peng Li, Maosong Sun. ACL 2022, 4904-4917. [pdf] [arXiv] [code]
CTRLEval: An Unsupervised Reference-Free Metric for Evaluating Controlled Text Generation
Pei Ke, Hao Zhou, Yankai Lin, Peng Li, Jie Zhou, Xiaoyan Zhu, Minlie Huang. ACL 2022, 2306-2319. [pdf] [arXiv] [code]
A Simple but Effective Pluggable Entity Lookup Table for Pre-trained Language Models
Deming Ye, Yankai Lin, Peng Li, Maosong Sun, Zhiyuan Liu. ACL 2022, 523-529. [pdf] [arXiv] [code]
ELLE: Efficient Lifelong Pre-training for Emerging Data
Yujia Qin, Jiajie Zhang, Yankai Lin, Zhiyuan Liu, Peng Li, Maosong Sun, Jie Zhou. Findings of ACL 2022, 2789-2810. [pdf] [arXiv] [code]
Do Pre-trained Models Benefit Knowledge Graph Completion? A Reliable Evaluation and a Reasonable Approach
Xin Lv, Yankai Lin, Yixin Cao, Lei Hou, Juanzi Li, Zhiyuan Liu, Peng Li, Jie Zhou. Findings of ACL 2022, 3570-3581. [pdf] [code]
MoEfication: Transformer Feed-forward Layers are Mixtures of Experts
Zhengyan Zhang, Yankai Lin, Zhiyuan Liu, Peng Li, Maosong Sun, Jie Zhou. Findings of ACL 2022, 877-890. [pdf] [arXiv] [code]
Manual-Guided Dialogue for Flexible Conversational Agents
Ryuichi Takanobu, Hao Zhou, Yankai Lin, Peng Li, Jie Zhou, Minlie Huang. Preprint. [arXiv]
2021
Topology-Imbalance Learning for Semi-Supervised Node Classification
Deli Chen, Yankai Lin, Guangxiang Zhao, Xuancheng Ren, Peng Li, Jie Zhou, Xu Sun. NeurIPS 2021. [pdf] [code]
Dynamic Knowledge Distillation for Pre-trained Language Models
Lei Li, Yankai Lin, Shuhuai Ren, Peng Li, Jie Zhou, Xu Sun. EMNLP 2021, 379-389. [pdf] [code]
RAP: Robustness-Aware Perturbations for Defending against Backdoor Attacks on NLP Models
Wenkai Yang, Yankai Lin, Peng Li, Jie Zhou, Xu Sun. EMNLP 2021, 8365-8381. [pdf] [arXiv]
CodRED: A Cross-Document Relation Extraction Dataset for Acquiring Knowledge in the Wild
Yuan Yao, Jiaju Du, Yankai Lin, Peng Li, Zhiyuan Liu, Jie Zhou, Maosong Sun. EMNLP 2021, 4452-4472. [pdf] [code & data] [Leaderboard]
CascadeBERT: Accelerating Inference of Pre-trained Language Models via Calibrated Complete Models Cascade
Lei Li, Yankai Lin, Deli Chen, Shuhuai Ren, Peng Li, Jie Zhou, Xu Sun. Findings of EMNLP 2021, 475-486. [pdf] [code]
MOOCCubeX: A Large Knowledge-centered Repository for Adaptive Learning in MOOCs
Jifan Yu, Yuquan Wang, Qingyang Zhong, Gan Luo, Yiming Mao, Kai Sun, Wenzheng Feng, Wei Xu, Shulin Cao, Kaisheng Zeng, Zijun Yao, Lei Hou, Yankai Lin, Peng Li, Jie Zhou, Bin Xu, Juanzi Li, Jie Tang, Maosong Sun. CIKM 2021, 4643-4652. [pdf] [code]
CokeBERT: Contextual Knowledge Selection and Embedding towards Enhanced Pre-Trained Language Models
Yusheng Su, Xu Han, Zhengyan Zhang, Peng Li, Zhiyuan Liu, Yankai Lin, Jie Zhou, Maosong Sun. AI Open, (2):127-134. [pdf] [code]
ERICA: Improving Entity and Relation Understanding for Pre-trained Language Models via Contrastive Learning
Yujia Qin, Yankai Lin, Ryuichi Takanobu, Zhiyuan Liu, Peng Li, Heng Ji, Minlie Huang, Maosong Sun, Jie Zhou. ACL-IJCNLP 2021, 3350-3363. [pdf] [code]
Rethinking Stealthiness of Backdoor Attack against NLP Models
Wenkai Yang, Yankai Lin, Peng Li, Jie Zhou, Xu Sun. ACL-IJCNLP 2021, 5543-5557. [pdf] [code]
CLEVE: Contrastive Pre-training for Event Extraction
Ziqi Wang, Xiaozhi Wang, Xu Han, Yankai Lin, Lei Hou, Zhiyuan Liu, Peng Li, Juanzi Li, Jie Zhou. ACL-IJCNLP 2021, 6283-6297. [pdf] [code]
GoG: Relation-aware Graph-over-Graph Network for Visual Dialog
Feilong Chen, Xiuyi Chen, Fandong Meng, Peng Li, Jie Zhou. Findings of ACL-IJCNLP 2021, 230-243. [pdf]
Multimodal Incremental Transformer with Visual Grounding for Visual Dialogue Generation
Feilong Chen, Fandong Meng, Xiuyi Chen, Peng Li, Jie Zhou. Findings of ACL-IJCNLP 2021, 436-446. [pdf]
Unsupervised Knowledge Selection for Dialogue Generation
Xiuyi Chen, Feilong Chen, Fandong Meng, Peng Li, Jie Zhou. Findings of ACL-IJCNLP 2021, 1230-1244. [pdf]
Manual Evaluation Matters: Reviewing Test Protocols of Distantly Supervised Relation Extraction
Tianyu Gao, Xu Han, Yuzhuo Bai, Keyue Qiu, Zhiyu Xie, Yankai Lin, Zhiyuan Liu, Peng Li, Maosong Sun, Jie Zhou. Findings of ACL-IJCNLP 2021, 1306-1318. [pdf] [code]
Aspect-Level Sentiment-Controllable Review Generation with Mutual Learning Framework
Huimin Chen, Yankai Lin, Fanchao Qi, Jinyi Hu, Peng Li, Jie Zhou, Maosong Sun. AAAI 2021, 12639-12647. [pdf]
Guiding Non-Autoregressive Neural Machine Translation Decoding with Reordering Information
Qiu Ran*, Yankai Lin*, Peng Li*, Jie Zhou. AAAI 2021, 13727-13735. [pdf] [code]
Context Tracking Network: Graph-based Context Modeling for Implicit Discourse Relation Recognition
Yingxue Zhang, Fandong Meng, Peng Li, Ping Jian, Jie Zhou. NAACL 2021, 1592–1599. [pdf] [code]
CSS-LM: A Contrastive Framework for Semi-supervised Fine-tuning of Pre-trained Language Models
Yusheng Su, Xu Han, Yankai Lin, Zhengyan Zhang, Zhiyuan Liu, Peng Li, Jie Zhou, Maosong Sun. IEEE/ACM Transactions on Audio, Speech and Language Processing (TASLP), (29):2930-2941. [pdf] [code]
WeChat Neural Machine Translation Systems for WMT21
Xianfeng Zeng, Yijin Liu, Ernan Li, Qiu Ran, Fandong Meng, Peng Li, Jinan Xu, Jie Zhou. WMT21, 243–254. [pdf]
MS-Ranker: Accumulating Evidence from Potentially Correct Candidates via Reinforcement Learning for Answer Selection
Yingxue Zhang, Fandong Meng, Peng Li, Ping Jian, Jie Zhou. Neurocomputing, (449):270-279. [pdf]
2020
Bridging the Gap between Prior and Posterior Knowledge Selection for Knowledge-Grounded Dialogue Generation
Xiuyi Chen, Fandong Meng, Peng Li, Feilong Chen, Shuang Xu, Bo Xu, Jie Zhou. EMNLP 2020, 3426–3437. [pdf]
Learning from Context or Names? An Empirical Study on Neural Relation Extraction
Hao Peng, Tianyu Gao, Xu Han, Yankai Lin, Peng Li, Zhiyuan Liu, Maosong Sun, Jie Zhou. EMNLP 2020, 3661–3672. [pdf] [code]
Coreferential Reasoning Learning for Language Representation
Deming Ye, Yankai Lin, Jiaju Du, Zhenghao Liu, Peng Li, Maosong Sun, Zhiyuan Liu. EMNLP 2020, 7170–7186. [pdf] [code]
Disentangle-based Continual Graph Representation Learning
Xiaoyu Kou, Yankai Lin, Shaobo Liu, Peng Li, Jie Zhou, Yan Zhang. EMNLP 2020, 2961-2972. [pdf] [code]
MAVEN: A Massive General Domain Event Detection Dataset
Xiaozhi Wang, Ziqi Wang, Xu Han, Wangyi Jiang, Rong Han, Zhiyuan Liu, Juanzi Li, Peng Li, Yankai Lin, Jie Zhou. EMNLP 2020, 1652–1671. [pdf] [code]
WeChat Neural Machine Translation Systems for WMT20
Fandong Meng, Jianhao Yan, Yijin Liu, Yuan Gao, Xianfeng Zeng, Qinsong Zeng, Peng Li, Ming Chen, Jie Zhou, Sifan Liu, Hao Zhou. WMT20, 239–247. [pdf]
Learning to Recover from Multi-Modality Errors for Non-Autoregressive Neural Machine Translation
Qiu Ran*, Yankai Lin*, Peng Li*, Jie Zhou. ACL 2020, 3059–3069. [pdf] [code]
Continual Relation Learning via Episodic Memory Activation and Reconsolidation
Xu Han, Yi Dai, Tianyu Gao, Yankai Lin, Zhiyuan Liu, Peng Li, Maosong Sun, Jie Zhou. ACL 2020, 6429–6440. [pdf] [code]
Measuring and Relieving the Over-smoothing Problem for Graph Neural Networks from the Topological View
Deli Chen, Yankai Lin, Wei Li, Peng Li, Jie Zhou, Xu Sun. AAAI 2020, 3438-3445. [pdf]
DMRM: A Dual-channel Multi-hop Reasoning Model for Visual Dialog
Feilong Chen, Fandong Meng, Jiaming Xu, Peng Li, Bo Xu, Jie Zhou. AAAI 2020, 7504-7511. [pdf] [code]
Neural Gibbs Sampling for Joint Event Argument Extraction
Xiaozhi Wang, Shengyu Jia, Xu Han, Zhiyuan Liu, Juanzi Li, Peng Li, Jie Zhou. AACL 2020, 169-180. [pdf] [code]
More Data, More Relations, More Context and More Openness: A Review and Outlook for Relation Extraction
Xu Han, Tianyu Gao, Yankai Lin, Hao Peng, Yaoliang Yang, Chaojun Xiao, Zhiyuan Liu, Peng Li, Jie Zhou, Maosong Sun. AACL 2020, 745-758. [pdf]
2019
NumNet: Machine Reading Comprehension with Numerical Reasoning
Qiu Ran, Yankai Lin, Peng Li, Jie Zhou, Zhiyuan Liu. EMNLP 2019, 2474-2484. [pdf] [code]
HMEAE: Hierarchical Modular Event Argument Extraction
Xiaozhi Wang, Ziqi Wang, Xu Han, Zhiyuan Liu, Juanzi Li, Peng Li, Maosong Sun, Jie Zhou, Xiang Ren. EMNLP 2019, 5777–5783. [pdf] [code]
FewRel 2.0: Towards More Challenging Few-Shot Relation Classification
Tianyu Gao, Xu Han, Hao Zhu, Zhiyuan Liu, Peng Li, Maosong Sun, Jie Zhou. EMNLP 2019, 6250–6255. [pdf] [code] [benchmark]
DocRED: A Large-Scale Document-Level Relation Extraction Dataset
Yuan Yao, Deming Ye, Peng Li, Xu Han, Yankai Lin, Zhenghao Liu, Zhiyuan Liu, Lixin Huang, Jie Zhou, Maosong Sun. ACL 2019, 764-777. [pdf] [code & data] [Leaderboard]
Towards Fine-grained Text Sentiment Transfer
Fuli Luo, Peng Li, Pengcheng Yang, Jie Zhou, Yutong Tan, Baobao Chang, Zhifang Sui, Xu Sun. ACL 2019, 2013-2022. [pdf] [code]
Key Fact as Pivot: A Two-Stage Model for Low Resource Table-to-Text Generation
Shuming Ma, Pengcheng Yang, Tianyu Liu, Peng Li, Jie Zhou, Xu Sun. ACL 2019, 2047-2057. [pdf] [code]
A Dual Reinforcement Learning Framework for Unsupervised Text Style Transfer
Fuli Luo, Peng Li, Jie Zhou, Pengcheng Yang, Baobao Chang, Xu Sun, Zhifang Sui. IJCAI 2019, 5116-5122. [pdf] [code]
Adversarial Training for Weakly Supervised Event Detection
Xiaozhi Wang, Xu Han, Zhiyuan Liu, Maosong Sun, Peng Li. NAACL 2019, 998-1008. [pdf] [code]
HighwayGraph: Modelling Long-distance Node Relations for Improving General Graph Neural Networks
Deli Chen, Xiaoqian Liu, Yankai Lin, Peng Li, Jie Zhou, Qi Su, Xu Sun. Preprint. [arXiv]
Option Comparison Network for Multiple-choice Reading Comprehension
Qiu Ran, Peng Li, Weiwei Hu, Jie Zhou. Preprint. [arXiv] [code]
2018 and earlier
Hierarchical Relation Extraction with Coarse-to-Fine Grained Attention
Xu Han, Pengfei Yu, Zhiyuan Liu, Maosong Sun, Peng Li. EMNLP 2018, 2236-2245. [pdf] [code]
Dataset and Neural Recurrent Sequence Labeling Model for Open-Domain Factoid Question Answering
Peng Li, Wei Li, Zhengyan He, Xuguang Wang, Ying Cao, Jie Zhou, Wei Xu. Preprint. [arXiv] [code (in the directory "neural_qa", veryfied on Ubuntu 16.04 with PaddlePaddle 0.10.5)]
Deep Recurrent Models with Fast-Forward Connections for Neural Machine Translation
Jie Zhou, Ying Cao, Xuguang Wang, Peng Li, Wei Xu. Transactions of the Association for Computational Linguistics (TACL), (4):371-383. [pdf]
A Neural Reordering Model for Phrase-based Translation
Peng Li, Yang Liu, Maosong Sun, Tatsuya Izuha, Dakun Zhang. COLING 2014, 1897-1907. [pdf] [slides]
Neural Reordering Model for Hierarchical Phrase-based Translations
Peng Li, Yang Liu, Maosong Sun. J Tsinghua Univ (Sci & Technol), (54):1529-1533.
Recursive Autoencoders for ITG-based Translation
Peng Li, Yang Liu, Maosong Sun. EMNLP 2013, 567-577. [pdf] [Talk at MSRA Ph.D Forum]
An Extended GHKM Algorithm for Inducing Lambda-SCFG
Peng Li, Yang Liu, Maosong Sun. AAAI-13, 605-611. [pdf] [slides]
A Beam Search Algorithm for ITG Word Alignment
Peng Li, Yang Liu, Maosong Sun. COLING 2012: Posters, 673-682. [pdf]
Fast-Champollion: A Fast and Robust Sentence Alignment Algorithm
Peng Li, Maosong Sun, Ping Xue. COLING 2010: Posters, 710-718. [pdf]
Content-based and Graph-based Tag Suggestion
Xiance Si, Zhiyuan Liu, Peng Li, Qixia Jiang, Maosong Sun. ECML/PKDD 2009 Discovery Challenge Workshop, 243-260. [pdf]
Clustering to Find Exemplar Terms for Keyphrase Extraction
Zhiyuan Liu, Peng Li, Yabin Zheng, Maosong Sun. EMNLP 2009, 257-266. [pdf]
Community Detection by Affinity Propagation
Zhiyuan Liu, Peng Li, Yabin Zheng, Maosong Sun. Technical Report. [pdf]

Professional Services

  • Senior Area Chair: AACL (2022)
  • Area Chair: ACL (2024–2025), EMNLP (2024), NAACL (2024–2025), EACL (2024), COLING (2022)
  • Action Editor: ACL Rolling Review (ARR)
  • PC Chair: YWCL (2010)
  • Tutorial Chair: CCL (2023), CCMT (2023, 2025)
  • Demonstration Chair: CCL (2024)
  • Frontier Forum Chair: CCMT (2024)
  • Publicity Chair: CCL (2025)
  • Reviewer / PC Member: NeurIPS (2022–2025), ICML (2022–2025), ICLR (2024–2025), ACL (2014, 2021–2023), EMNLP (2014, 2021–2023, 2025), NAACL (2018), CVPR (2023–2025), ICCV (2023, 2025), ECCV (2024), AAAI (2022–2025), IJCAI (2022–2024), COLING (2020, 2024, 2025), BMVC (2024), COLM (2024–2025), ICRA (2025), ACM TIST (2015), TACL (Standing Reviewer), Acta Automatica Sinica

Education