Research
I'm interested in reinforcement learning, large model agents, and multi-agent systems.
Throughout my academic career, I was dedicated to developing advanced reinforcement learning algorithms
to enhance the sequential decision-making capabilities of autonomous agents and multi-agent systems
in dynamic environments.
Join Us
I am looking for Master students at SJTU SAI and self-motivated research interns (with paid).
Contact me if you are interested in the above topics.
|
Recent Papers
|
RDHNet: addressing rotational and permutational symmetries in continuous multi-agent systems
Dongzi Wang, Lilan Huang, Muning Wen, Yuanxi Peng, Minglong Li, Teng Li,
Frontiers of Computer Science, 19(11):1911365, 2025
Springer
|
HammerBench: Fine-Grained Function-Calling Evaluation in Real Mobile Assistant Scenarios
Jun Wang, Jiamu Zhou, Xihuai Wang, Xiaoyun Mo, Haoyu Zhang, Qiqiang Lin, Cheng Jin, Muning Wen, Weinan Zhang, Qiuying Peng, Jun Wang
Findings of the Association for Computational Linguistics: ACL 2025, 3350-3376, 2025
ACL
|
Autonomous goal detection and cessation in reinforcement learning: A case study on source term estimation
Yiwei Shi, Muning Wen, Qi Zhang, Weinan Zhang, Cunjia Liu, Weiru Liu
AAAI Conference on Artificial Intelligence, 39(1):738-745, 2025
AAAI
|
Robust Gymnasium: A Unified Modular Benchmark for Robust Reinforcement Learning
Shangding Gu, Laixi Shi, Muning Wen, Ming Jin, Eric Mazumdar, Yuejie Chi, Adam Wierman, Costas Spanos
The Thirteenth International Conference on Learning Representations (ICLR), 2025
ICLR
|
PMAT: Optimizing Action Generation Order in Multi-Agent Reinforcement Learning
Kun Hu, Muning Wen, Xihuai Wang, Shao Zhang, Yiwei Shi, Minne Li, Minglong Li, Ying Wen
Proceedings of the 24th International Conference on Autonomous Agents and Multiagent Systems, 997-1005, 2025
AAMAS
|
Robust function-calling for on-device language model via function masking
Qiqiang Lin, Muning Wen, Qiuying Peng, Guanyu Nie, Junwei Liao, Jun Wang, Xiaoyun Mo, Jiamu Zhou, Cheng Cheng, Yin Zhao, et al.
The Thirteenth International Conference on Learning Representations (ICLR), 2025
ICLR
|
Reinforcing LLM Agents via Policy Optimization with Action Decomposition
Muning Wen, Ziyu Wan, Jun Wang, Weinan Zhang, Ying Wen
Advances in Neural Information Processing Systems (NeurIPS), 37:103774-103805, 2024
NeurIPS
|
TRAD: Enhancing LLM Agents with Step-wise Thought Retrieval and Aligned Decision
Ruiwen Zhou, Yingxuan Yang, Muning Wen, Ying Wen, Wenhao Wang, Chunling Xi, Guoqiang Xu, Yong Yu, Weinan Zhang
Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, 3-13, 2024
SIGIR
|
AlphaZero-Like Tree-Search can Guide Large Language Model Decoding and Training
Ziyu Wan, Xidong Feng, Muning Wen, Stephen Marcus Mcaleer, Ying Wen, Weinan Zhang, Jun Wang
International Conference on Machine Learning (ICML), 49890-49920, 2024
ICML
|
Safe multiagent learning with soft constrained policy optimization in real robot control
Shangding Gu, Dianye Huang, Muning Wen, Guang Chen, Alois Knoll
IEEE Transactions on Industrial Informatics, 20(9):10706-10716, 2024
IEEE TII
|
Romat: Role-based multi-agent transformer for generalizable heterogeneous cooperation
Dongzi Wang, Fangwei Zhong, Minglong Li, Muning Wen, Yuanxi Peng, Teng Li, Adam Yang
Neural Networks, 174:106129, 2024
Elsevier
|
Large sequence models for sequential decision-making: a survey
Muning Wen, Runji Lin, Hanjing Wang, Yaodong Yang, Ying Wen, Luo Mai, Jun Wang, Haifeng Zhang, Weinan Zhang
Frontiers of Computer Science, 17(6):176349, 2023
Springer
|
Malib: A parallel framework for population-based multi-agent reinforcement learning
Ming Zhou, Ziyu Wan, Hanjing Wang, Muning Wen, Runzhe Wu, Ying Wen, Yaodong Yang, Yong Yu, Jun Wang, Weinan Zhang
Journal of Machine Learning Research, 24(150):1-12, 2023
JMLR
|
Multi-agent reinforcement learning is a sequence modeling problem
Muning Wen, Jakub Kuba, Runji Lin, Weinan Zhang, Ying Wen, Jun Wang, Yaodong Yang
Advances in Neural Information Processing Systems (NeurIPS), 35:16509-16521, 2022
NeurIPS
|
Offline pre-trained multi-agent decision transformer
Linghui Meng, Muning Wen, Chenyang Le, Xiyun Li, Dengpeng Xing, Weinan Zhang, Ying Wen, Haifeng Zhang, Jun Wang, Yaodong Yang 等
Machine Intelligence Research, 20(2):233-248, 2022
Springer
|
Trust Region Policy Optimisation in Multi-Agent Reinforcement Learning
Jakub Grudzien Kuba, Ruiqing Chen, Muning Wen, Ying Wen, Fanglei Sun, Jun Wang, Yaodong Yang
International Conference on Learning Representations (ICLR), 2022
ICLR
|
Settling the variance of multi-agent policy gradients
Jakub Grudzien Kuba, Muning Wen, Linghui Meng, Haifeng Zhang, David Mguni, Jun Wang, Yaodong Yang 等
Advances in Neural Information Processing Systems, 34:13458-13470, 2021
NeurIPS
|
This website is built and modified based on Jon Barron's open-source template. Many thanks!
|
|