AI Mathematician: Towards Fully Automated Frontier Mathematical Research
ActiView: Evaluating Active Perception Ability for Multimodal Large Language Models
Ziyue Wang, Chi Chen, Fuwen Luo, Yurui Dong, Yuanchi Zhang, Yuzhuang Xu, Xiaolong Wang,
Peng Li, Yang Liu.
ACL 2025, 7605-7633. [
pdf] [
arXiv] [
code]
Perspective Transition of Large Language Models for Solving Subjective Tasks
Xiaolong Wang, Yuanchi Zhang, Ziyue Wang, Yuzhuang Xu, Fuwen Luo, Yile Wang,
Peng Li, Yang Liu.
Findings of ACL 2025, 9686-9704. [
pdf] [
arXiv]
Scaffolding Coordinates to Promote Vision-Language Coordination in Large Multi-Modal Models
Rethinking Long Context Generation from the Continual Learning Perspective
Zeyuan Yang, Fangzhou Xiong,
Peng Li and Yang Liu.
COLING 2025, 1922-1933. [
pdf]
Leveraging Language-based Representations for Better Solving Symbol-related Problems with Large Language Models
Yile Wang, Sijie Cheng, Zixin Sun,
Peng Li, Yang Liu.
COLING 2025, 5544-5557. [
pdf] [
arXiv] [
code]
Dual-AEB: Synergizing Rule-Based and Multimodal Large Language Models for Effective Emergency Braking
Wei Zhang, Pengfei Li, Junli Wang, Bingchuan Sun, Qihao Jin, Guangjun Bao, Shibo Rui, Yang Yu, Wenchao Ding,
Peng Li, Yilun Chen.
ICRA 2025. [
arXiv] [
code]
CoSpace: Benchmarking Continuous Space Perception Ability for Vision-Language Models
AdaMMS: Model Merging for Heterogeneous Multimodal Large Language Models with Unsupervised Coefficient Optimization
Yiyang Du, Xiaochen Wang, Chi Chen, Jiabo Ye, Yiru Wang,
Peng Li, Ming Yan, Ji Zhang, Fei Huang, Zhifang Sui, Maosong Sun, Yang Liu.
CVPR 2025. [
pdf] [
arXiv] [
code] [
Slides]
How Do Multimodal Large Language Models Handle Complex Multimodal Reasoning? Placing Them in An Extensible Escape Game
Ziyue Wang, Yurui Dong, Fuwen Luo, Minyuan Ruan, Zhili Cheng, Chi Chen,
Peng Li, Yang Liu.
ICCV 2025. [
arXiv] [
code] [
Slides] [
Video]
LVAgent: Long Video Understanding by Multi-Round Dynamical Collaboration of MLLM Agents
Boyu Chen, Zhengrong Yue, Siran Chen, Zikang Wang, Yang Liu,
Peng Li, Yali Wang.
ICCV 2025. [
arXiv]
Adversarial Robust Memory-Based Continual Learner
Xiaoyue Mi, Fan Tang, Zonghan Yang, Danding Wang, Juan Cao,
Peng Li, Yang Liu.
ICCV 2025. [
arXiv]
Contrastive Private Data Synthesis via Weighted Multi-PLM Fusion
Tianyuan Zou, Yang Liu,
Peng Li, Yufei Xiong, Jianqing Zhang, Jingjing Liu, Xiaozhou Ye, Ye Ouyang, Ya-Qin Zhang.
ICML 2025. [
arXiv]
Bench4Merge: A Comprehensive Benchmark for Merging in Realistic Dense Traffic with Micro-Interactive Vehicles
Zhengming Wang, Junli Wang, Pengfei Li, Zhaohan Li, Chunyang Liu, Bo Zhang,
Peng Li, Yilun Chen.
IROS 2025. [
arXiv] [
code] [
Video]
EditEval: Towards Comprehensive and Automatic Evaluation for Text-guided Video Editing
Bingshuai Liu, Ante Wang, Zijun Min, Chenyang Lyu, Longyue Wang, Zhihao Wang, Xu Han, Peng Li, Jinsong Su. ACMMM 2025.
Advancing Language Multi-Agent Learning with Credit Re-Assignment for Interactive Environment Generalization
FormaRL: Enhancing Autoformalization with no Labeled Data
Yanxing Huang, Xinling Jin, Sijie Liang, Fuwen Luo, Peng Li, Yang Liu. COLM 2025.
Agent-Environment Alignment via Automated Interface Generation
Kaiming Liu, Xuanyu Lei, Ziyue Wang,
Peng Li, Yang Liu.
Preprint. [
arXiv] [
Slides]
MUSEG: Reinforcing Video Temporal Understanding via Timestamp-Aware Multi-Segment Grounding
Fuwen Luo, Shengfeng Lou, Chi Chen, Ziyue Wang, Chenliang Li, Weizhou Shen, Jiyue Guo,
Peng Li, Ming Yan, Ji Zhang, Fei Huang, Yang Liu.
Preprint. [
arXiv]
Inference-Time Scaling for Generalist Reward Modeling
Zijun Liu, Peiyi Wang, Runxin Xu, Shirong Ma, Chong Ruan,
Peng Li, Yang Liu, Yu Wu.
Preprint. [
arXiv]
LiloDriver: A Lifelong Learning Framework for Closed-loop Motion Planning in Long-tail Autonomous Driving Scenarios
Huaiyuan Yao, Pengfei Li, Bu Jin, Yupeng Zheng, An Liu, Lisen Mu, Qing Su, Qian Zhang, Yilun Chen,
Peng Li.
Preprint. [
arXiv]
Visual Abstract Thinking Empowers Multimodal Reasoning
Dairu Liu, Ziyue Wang, Minyuan Ruan, Fuwen Luo, Chi Chen,
Peng Li, Yang Liu.
Preprint. [
arXiv]
A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, and Beyond
Xiaoye Qu, Yafu Li, Zhaochen Su, Weigao Sun, Jianhao Yan, Dongrui Liu, Ganqu Cui, Daizong Liu, Shuxian Liang, Junxian He,
Peng Li, Wei Wei, Jing Shao, Chaochao Lu, Yue Zhang, Xian-Sheng Hua, Bowen Zhou, Yu Cheng.
Preprint. [
arXiv] [
Project Webpage]
DongbaMIE: A Multimodal Information Extraction Dataset for Evaluating Semantic Understanding of Dongba Pictograms
Xiaojun Bi, Shuo Li, Junyao Xing, Ziyue Wang, Fuwen Luo, Weizheng Qiao, Lu Han, Ziwei Sun,
Peng Li, Yang Liu.
Preprint. [
arXiv] [
code] [
Dataset]
Scaling External Knowledge Input Beyond Context Windows of LLMs via Multi-Agent Collaboration
Zijun Liu, Zhennan Wan,
Peng Li, Ming Yan, Ji Zhang, Fei Huang, Yang Liu.
Preprint. [
arXiv] [
code]
Writing-RL: Advancing Long-form Writing via Adaptive Curriculum Reinforcement Learning
Xuanyu Lei, Chenliang Li, Yuning Wu, Kaiming Liu, Weizhou Shen,
Peng Li, Ming Yan, Ji Zhang, Fei Huang, Yang Liu.
Preprint. [
arXiv] [
code]
MUCAR: Benchmarking Multilingual Cross-Modal Ambiguity Resolution for Multimodal Large Language Models
Xiaolong Wang, Zhaolu Kang, Wangyuxuan Zhai, Xinyue Lou, Yunghwei Lai, Ziyue Wang, Yawen Wang, Kaiyu Huang, Yile Wang,
Peng Li, Yang Liu.
Preprint. [
arXiv]