Yifu Yuan

I am a second-year PhD student at Deep Reinforcement Learning (DRL) Lab, Tianjin University (TJU), advised by Prof. Jianye Hao. I am also co-advised by Prof. Yan Zheng. I interned at Netease Fuxi AI Lab (Fuxi) during 2023, where I worked with Dr. Yujing Hu. Now I intern at Tencent AI Lab, advised by Prof. Zhongwen Xu. I received my Bachelor's degree from the Dalian University of Technology (DUT), advised by Prof. Guozhen Tan.

Research interest

I am broadly interested in research on the building embodied agents for decision making. To achieve this, my current research interest contains Embodied AI, Diffusion Model, Reinforcement Learning and LLM agents. I hope to ride the wave of AI changing the world.

News

May 17, 2025

One paper accepted by ACL 2025 findings
May 1, 2025

Two papers accepted by ICML 2025
Jan 22, 2025

One paper accepted by ICLR 2025
Jan 20, 2025

One paper accepted by WWW 2025 Oral Presentation
Jan 9, 2025

Invited Talks: Diffusion Models for Decision Making @ BeNeRL Workshop [Link]
Jan 8, 2025

Received the BYD Scholarship
Jan 4, 2025

Supported by China Association for Science and Technology Youth Ph.D Talent Support Project, Supported by CCF (首届中国科协青年人才托举工程博士生专项计划, 托举学会：中国计算机学会)
Dec 14, 2024

Invited Talks on Auto Bidding as AlGB Track Winner @ NeurIPS 2024 Auto-Bidding in Large-Scale Auctions Competition [Link]
Nov 20, 2024

One papers accepted by IEEE Transactions on Emerging Topics in Computational Intelligence
Nov 4, 2024

Supported by CIE-Tencent Doctoral Research Incentive Project (首届中国电子学会—腾讯博士生科研激励计划，全国17人，科研基金10万)
Sept 26, 2024

Three papers accepted by NeurIPS 2024
Aug 04, 2024

Invited Talks: Diffusion Models for Decision Making @ THU MOS [Link]
May 5, 2024

Supported by Tencent Hornbill Elite Talent Programme (腾讯犀牛鸟精英人才计划)
May 2, 2024

One papers accepted by ICML 2024
Jan 15, 2024

Two papers accepted by ICLR 2024
Dec 03, 2023

Invited Talks: Unsupervised Reinforcement Learning with World Model @ DAI2023 Conference
May 10, 2023

One papers accepted by ICML 2023
Jan 20, 2023

One papers accepted by ICLR 2023

Awards

National Scholarship (国家奖学金)
Outstanding Graduates of Dalian City (大连市优秀毕业生，Top 4%)
Tencent Hornbill Elite Talent Programme (腾讯犀牛鸟精英人才计划)
BYD Scholarship (比亚迪奖学金， 7 Ph.D students each year)
First batch CIE-Tencent Doctoral Research Incentive Project (首届中国电子学会—腾讯博士生科研激励计划)
The First China Association for Science and Technology Youth Ph.D Talent Support Project, Supported by the CCF (首届中国科协青年人才托举工程博士生专项计划)

Publications and preprints

Papers sorted by recency. Authors with equal contribution are marked by *. Representative papers are highlighted.

From Seeing to Doing: Bridging Reasoning and Decision for Robotic Manipulation
Yifu Yuan, Haiqin Cui, Yibin Chen, Zibin Dong, Fei Ni, Longxin Kou, Jinyi Liu, Pengyi Li, Yan Zheng, Jianye HAO
Arxiv
paper

Embodied-R1: Reinforced Embodied Reasoning for General Robotic Manipulation
Yifu Yuan, Haiqin Cui, Yaoting Huang, Yibin Chen, Fei Ni, Zibin Dong, Pengyi Li, YAN ZHENG, Jianye HAO
Arxiv
paper

AhaRobot: A Low-Cost Open-Source Bimanual Mobile Manipulator for Embodied AI
Haiqin Cui*, Yifu Yuan*, Yan Zheng, Jianye HAO
Submitted to IROS2025
project page / paper / ros / firmwares / hardware / assembly guide / demo

MODULI: Unlocking Preference Generalization via Diffusion Models for Offline Multi-Objective Reinforcement Learning
Yifu Yuan, Zhenrui Zheng, Zibin Dong, Jianye HAO
International Conference on Machine Learning (ICML), 2025
paper / code

R*: Efficient Reward Design via Reward Structure Evolution and Parameter Alignment Optimization with Large Language Models
Pengyi Li, Jianye HAO, Hongyao Tang, Yifu Yuan, Jinbin Qiao, Zibin Dong, Yan Zheng
International Conference on Machine Learning (ICML), 2025
paper

Entropy-based Activation Function Optimization: A Method on Searching Better Activation Functions
Haoyuan Sun, Zihao Wu, Bo Xia, Pu Chang, Zibin Dong, Yifu Yuan, Yongzhe Chang, Xueqian Wang
International Conference on Learning Representations (ICLR), 2025
paper

SheetAgent: A Generalist Agent for Spreadsheet Reasoning and Manipulation via Large Language Models
Yibin Chen*, Yifu Yuan*, Zeyu Zhang, Yan Zheng, Jinyi Liu, Fei Ni, Jianye HAO
The ACM Web Conference (WWW), 2025, Oral Presentation (Top5%)
IJCAI2024 Automates Workshop
project page / paper / code / benchmark

MENTOR: Guiding Hierarchical Reinforcement Learning with Human Feedback and Dynamic Distance Constraint
Xinglin Zhou*, Yifu Yuan*, Shaofu Yang, Jianye Hao
IEEE Transactions on Emerging Topics in Computational Intelligence (JCR Q1)
paper / code

CleanDiffuser: An Easy-to-use Modularized Library for Diffusion Models in Decision Making
Zibin Dong*, Yifu Yuan*, Jianye HAO, Fei Ni, Yi Ma, Pengyi Li, Yan Zheng
Conference on Neural Information Processing Systems (NeurIPS Datasets and Benchmarks Track), 2024
paper / code / docs

DiffuserLite: Towards Real-time Diffusion Planning
Zibin Dong, Jianye Hao, Yifu Yuan, Fei Ni, Yitian Wang, Pengyi Li, Yan Zheng
Conference on Neural Information Processing Systems (NeurIPS), 2024
project page / paper / code

PERIA: Perceive, Reason, Imagine, Act via Holistic Language and Vision Planning for Manipulation
Fei Ni, Jianye HAO, Shiguang Wu, Longxin Kou, Yifu Yuan, Zibin Dong, Jinyi Liu, MingZhi Li, YAN ZHENG, Yuzheng Zhuang
Conference on Neural Information Processing Systems (NeurIPS), 2024
project page / paper

KISA: A Unified Keyframe Identifier and Skill Annotator for Long-Horizon Robotics Demonstrations
Longxin Kou, Fei Ni, YAN ZHENG, Jinyi Liu, Yifu Yuan, Zibin Dong, Jianye HAO
International Conference on Machine Learning (ICML), 2024
project page / paper

CriticGPT: Multimodal LLM as a Critic for Robot Manipulation
Jinyi Liu*, Yifu Yuan*, Jianye HAO, Fei Ni, Lingzhi Fu, Yibin Chen, Yan Zheng
AAAI2024 RL+LLMs Workshop
project page / paper

Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback
Yifu Yuan, Jianye Hao, Yi Ma, Zibin Dong, Hebin Liang, Jinyi Liu, Zhixin Feng, Kai Zhao, Yan Zheng
International Conference on Learning Representations (ICLR), 2024
project page / paper / benchmark & dataset / platform

AlignDiff: Aligning Diverse Human Preferences via Behavior-Customisable Diffusion Model
Zibin Dong*, Yifu Yuan*, Jianye Hao, Fei Ni, Yao Mu, Yan Zheng, Yujing Hu, Tangjie Lv, Changjie Fan, Zhipeng Hu
International Conference on Learning Representations (ICLR), 2024
NeurIPS Diffusion Workshop, 2023
project page / paper / code / dataset

ED2: Environment Dynamics Decomposition World Models for Continuous Control
Yifu Yuan, Hongyao Tang, Cong Wang, Yan Zheng, Jianye Hao
Submitted to TNNLS
paper / code

MetaDiffuser: Diffusion Model as Conditional Planner for Offline Meta-RL
Fei Ni, Jianye Hao, Yao Mu, Yifu Yuan, Yan Zheng, Bin Wang, Zhixuan Liang
International Conference on Machine Learning (ICML), 2023
project page / paper

EUCLID: Towards Efficient Unsupervised Reinforcement Learning with Multi-choice Dynamics Model
Yifu Yuan, Jianye Hao, Fei Ni, Yao Mu, Yan Zheng, Yujing Hu, Jinyi Liu, Yingfeng Chen, Changjie Fan
International Conference on Learning Representations (ICLR), 2023
NeurIPS DeepRL workshop, 2022
project page / pretrained model / paper / code

Misc. open-source projects

AhaRobot⭐ ...
Jan 2025
paper / project page / ros code / firmwares / hardware / assembly guide / demo

CleanDiffuser Library⭐ ...
June 2024
paper / code / docs

Uni-RLHF Platform⭐ ...
Oct 2023
project page / code / minimal version

Clean-Offline-RLHF⭐ ...
Oct 2023
project page / code

Education & Internship Experiences

2024.07 -

AI Researcher (Intern, Hornbill Programme), Tencent AI Lab, Tencent, ShenZhen

(advised by Zhongwen Xu)
2023.09 -

Phd student, College of Intelligence and Computing, Tianjin University

(advised by Jianye Hao)
2022.11 - 2024.05

AI Researcher (Intern), Fuxi AI Lab, NetEase, HangZhou

(advised by Yujing Hu)
2021.09 - 2023.06

Master, College of Intelligence and Computing, Tianjin University

(advised by Jianye Hao, transfer to a PhD student)
2017.09 - 2021.06

Bachelor, Electronic Information and Electrical Engineering Department, Dalian University of Technology

(advised by Guozhen Tan)

Invited Talks

2024.12

Retrospect the Past: Diffusion Bidding Policy Optimization with Action Relabeling via Ensemble Q-Learning

NeurIPS 2024 Auto-Bidding in Large-Scale Auctions (NeurIPS 2024), Canada
AlGB Track Winner (CleanDiffuser Team) Presentation
2024.7

Diffusion Models for Decision Making

Nanjing University
2023.12

Unsupervised Reinforcement Learning with World Model

The 5th International Conference on Distributed Artificial Intelligence (DAI 2023), Singapore
2023.06

EUCLID: Towards Efficient Unsupervised Reinforcement Learning with Multi-choice Dynamics Model

RLChina Seminar
2023.03

EUCLID: Towards Efficient Unsupervised Reinforcement Learning with Multi-choice Dynamics Model

AI Time Seminar

Academic Service

Reviewers: ICML2024/2025, NeurIPS 2024/2025, ICLR2025, ICCV2025, IROS2025, UAI2024, TNNLS

Contact

I'm very welcome to any kind of collaboration or discussion. Feel free to contact me via email at
yuanyf [at] tju.edu.cn.

Yifu Yuan (袁逸夫)