About
I am currently a research engineer at Ant Group in Hangzhou, China. Previously, I had the privilege of pursuing a Ph.D. in Computer Science at Singapore Management University, under the supervision of Prof. Pradeep Varakantham. My research primarily focuses on training reinforcement learning (RL) agents and humans to be more generalizable, with an emphasis on enhancing the effectiveness and efficiency of training processes. For an example of my recent work, see CENIE. In addition, I explore the intersection of large language models (LLMs) and reinforcement learning, particularly in tuning LLMs using RL-based techniques. A recent example is Dr.GRPO, where we identified the bias in GRPO algorithm to enhance R1-Zero-style training performance.

Before joining SMU, I completed my MSc in Electrical Engineering at the Viterbi School of Engineering, University of Southern California, where I discovered my passion for machine learning and AI research. Outside of my academic pursuits, I enjoy sports, travelling, reading and supporting FC Bayern Munich.


News

  • [05/2026] One paper accepted to ICML 2026.
  • [04/2026] One paper accepted to AIED 2026.
  • [07/2025] Our paper Dr.GRPO has been accepted to COLM 2025.
  • [05/2025] One paper accepted to ACL-main 2025.
  • [01/2025] One paper accepted to NAACL-findings 2025.
  • [12/2024] One paper accepted to AAAI 2025 for an oral presentation.
  • [09/2024] My paper has been accepted to NeurIPS 2024 for an oral presentation!
  • [06/2024] I started my internship at Huawei, researching LLM Agents.
  • [08/2020] I started my Ph.D. journey at SMU.

Selected Publications
Large Language Models
HiMAP-Travel: Hierarchical Multi-Agent Planning for Long-Horizon Constrained Travel
The Viet Bui*, Wenjun Li*, Yong Liu
International Conference on Machine Learning (ICML), 2026
(Paper)
Understanding R1-Zero-Like Training: A Critical Perspective
Zichen Liu*, Changyu Chen*, Wenjun Li*, Penghui Qi*, Tianyu Pang, Chao Du, Wee Sun Lee, Min Lin
The 2nd Conference on Language Modeling (COLM), 2025
(Paper) (Codes)
Adaptive Tool Use in Large Language Models with Meta-Cognition Trigger
Wenjun Li, Dexun Li, Kuicai Dong, Cong Zhang, Hao Zhang, Weiwen Liu, Yasheng Wang, Ruiming Tang, Yong Liu
The 63rd Annual Meeting of the Association for Computational Linguistics (ACL-main), 2025
(Paper)
Unlocking the Planning Capabilities of Large Language Models with Maximum Diversity Fine-tuning
Wenjun Li, Changyu Chen, Pradeep Varakantham
Annual Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics (NAACL-Findings), 2025
(Paper)

Reinforcement Learning
RL Agents Reveal What's Hard: Bootstrapping Difficulty-Ordered Curricula for Human Learners
Sidney Tio*, Wenjun Li*, Ramesha Karusena, Pradeep Varakantham
International Conference on Artificial Intelligence in Education (AIED), 2026
(Paper Releasing Soon)
Marginal Benefit Driven RL Teacher for Unsupervised Environment Design
Dexun Li, Wenjun Li, Pradeep Varakantham
Association for the Advancement of Artificial Intelligence (AAAI Oral), 2025
(Paper)
Improving Environment Novelty Quantification for Effective Unsupervised Environment Design
Jayden Teoh*, Wenjun Li*, Pradeep Varakantham
Advances in Neural Information Processing Systems (NeurIPS Oral), 2024 (Paper)
Unsupervised Training Sequence Design: Efficient and Generalizable Agent Training
Wenjun Li, Pradeep Varakantham
Association for the Advancement of Artificial Intelligence (AAAI), 2024
(Paper) (Website)
Generalization through Diversity: Improving Unsupervised Environment Design
Wenjun Li, Pradeep Varakantham, Dexun Li
International Joint Conference on Artificial Intelligence (IJCAI), 2023
(Paper) (Website)



Education
Singapore Management University - Ph.D. in Compute Science - (2020.08-2025.05)


University of Southern California - MSc in Eletrical Engineering - (2018.08-2020.05)


China Jiliang University - BSc in Optoelectronic Information Science and Engineering - (2014.09-2018.06)



Honors
  • Graduate: SMU Presidential Doctoral Fellowship 2023; SMU Presidential Doctoral Fellowship 2024.

  • Undergraduate: Excellent Graduate; 1st Class Scholarship for Academic Excellence.


Invited Talk
  • "Training Robust Reinforcement Learning Agents via Environment Generation" at Telenor Research.


Additional Information
  • Languages: Mandarin, English, German.

  • Reviewing: ICML 2026, ICLR 2025, IJCAI 2023.

  • Interests: Football, Badminton, Travelling, Reading.