I am a second-year PhD student at Deep Reinforcement Learning (DRL) Lab, Tianjin University (TJU), advised by Prof. Jianye Hao. I am also co-advised by Prof. Yan Zheng. I interned at Netease Fuxi AI Lab (Fuxi) during 2023, where I worked with Dr. Yujing Hu. Now I intern at Tencent AI Lab, advised by Prof. Zhongwen Xu. I received my Bachelor's degree from the Dalian University of Technology (DUT), advised by Prof. Guozhen Tan.
Research interest
I am broadly interested in research on the building embodied agents for decision making. To achieve this, my current research interest contains Model-based reinforcement learning, Diffusion Model, RLHF and LLM agents. I hope to ride the wave of AI changing the world.
News
-
Jan 22, 2025One paper accepted by ICLR 2025
-
Jan 20, 2025One paper accepted by WWW 2025 Oral Presentation
-
Jan 9, 2025Invited Talks: Diffusion Models for Decision Making @ BeNeRL Workshop [Link]
-
Jan 8, 2025Received the BYD Scholarship
-
Jan 4, 2025Supported by China Association for Science and Technology Youth Ph.D Talent Support Project, Supported by CCF (首届中国科协青年人才托举工程博士生专项计划, 托举学会:中国计算机学会)
-
Dec 14, 2024Invited Talks on Auto Bidding as AlGB Track Winner @ NeurIPS 2024 Auto-Bidding in Large-Scale Auctions Competition [Link]
-
Nov 20, 2024One papers accepted by IEEE Transactions on Emerging Topics in Computational Intelligence
-
Nov 4, 2024Supported by CIE-Tencent Doctoral Research Incentive Project (首届中国电子学会—腾讯博士生科研激励计划,全国17人,科研基金10万)
-
Sept 26, 2024Three papers accepted by NeurIPS 2024
-
Aug 04, 2024Invited Talks: Diffusion Models for Decision Making @ THU MOS [Link]
-
May 5, 2024Supported by Tencent Hornbill Elite Talent Programme (腾讯犀牛鸟精英人才计划)
-
May 2, 2024One papers accepted by ICML 2024
-
Jan 15, 2024Two papers accepted by ICLR 2024
-
Dec 03, 2023Invited Talks: Unsupervised Reinforcement Learning with World Model @ DAI2023 Conference
-
May 10, 2023One papers accepted by ICML 2023
-
Jan 20, 2023One papers accepted by ICLR 2023
Awards
-
National Scholarship (国家奖学金)
-
Outstanding Graduates of Dalian City (大连市优秀毕业生,Top 4%)
-
Tencent Hornbill Elite Talent Programme (腾讯犀牛鸟精英人才计划)
-
BYD Scholarship (比亚迪奖学金, 7 Ph.D students each year)
-
First batch CIE-Tencent Doctoral Research Incentive Project (首届中国电子学会—腾讯博士生科研激励计划)
-
The First China Association for Science and Technology Youth Ph.D Talent Support Project, Supported by the CCF (首届中国科协青年人才托举工程博士生专项计划)
Publications and preprints
Papers sorted by recency. Authors with equal contribution are marked by *. Representative papers are highlighted.
Yifu Yuan*, Zhenrui Zheng*, Zibin Dong, Jianye HAO
Preprint
arXiv
Haoyuan Sun, Zihao Wu, Bo Xia, Pu Chang, Zibin Dong, Yifu Yuan, Yongzhe Chang, Xueqian Wang
International Conference on Learning Representations (ICLR), 2025
arXiv
Yibin Chen*, Yifu Yuan*, Zeyu Zhang, Yan Zheng, Jinyi Liu, Fei Ni, Jianye HAO
The ACM Web Conference (WWW), 2025, Oral Presentation (Top4%)
IJCAI2024 Automates Workshop
project page / arXiv
Xinglin Zhou*, Yifu Yuan*, Shaofu Yang, Jianye Hao
IEEE Transactions on Emerging Topics in Computational Intelligence
arXiv
Zibin Dong*, Yifu Yuan*, Jianye HAO, Fei Ni, Yi Ma, Pengyi Li, Yan Zheng
Conference on Neural Information Processing Systems (NeurIPS Datasets and Benchmarks Track), 2024
arXiv / code / docs
Zibin Dong, Jianye Hao, Yifu Yuan, Fei Ni, Yitian Wang, Pengyi Li, Yan Zheng
Conference on Neural Information Processing Systems (NeurIPS), 2024
project page / arXiv
Fei Ni, Jianye HAO, Shiguang Wu, Longxin Kou, Yifu Yuan, Zibin Dong, Jinyi Liu, MingZhi Li, YAN ZHENG, Yuzheng Zhuang
Conference on Neural Information Processing Systems (NeurIPS), 2024
project page / pdf
Longxin Kou, Fei Ni, YAN ZHENG, Jinyi Liu, Yifu Yuan, Zibin Dong, Jianye HAO
International Conference on Machine Learning (ICML), 2024
Jinyi Liu*, Yifu Yuan*, Jianye HAO, Fei Ni, Lingzhi Fu, Yibin Chen, Yan Zheng
AAAI2024 RL+LLMs Workshop
project page / arXiv
Yifu Yuan, Jianye Hao, Yi Ma, Zibin Dong, Hebin Liang, Jinyi Liu, Zhixin Feng, Kai Zhao, Yan Zheng
International Conference on Learning Representations (ICLR), 2024
project page / arXiv / benchmark / platform
Zibin Dong*, Yifu Yuan*, Jianye Hao, Fei Ni, Yao Mu, Yan Zheng, Yujing Hu, Tangjie Lv, Changjie Fan, Zhipeng Hu
International Conference on Learning Representations (ICLR), 2024
NeurIPS Diffusion Workshop, 2023
project page / arXiv / code / dataset
Jianye Hao, Yifu Yuan* (student first author), Cong Wang, Zhen Wang
Submitted to TNNLS
arXiv / code
Fei Ni, Jianye Hao, Yao Mu, Yifu Yuan, Yan Zheng, Bin Wang, Zhixuan Liang
International Conference on Machine Learning (ICML), 2023
project page / arXiv
Yifu Yuan, Jianye Hao, Fei Ni, Yao Mu, Yan Zheng, Yujing Hu, Jinyi Liu, Yingfeng Chen, Changjie Fan
International Conference on Learning Representations (ICLR), 2023
NeurIPS DeepRL workshop, 2022
project page / pretrained model / arXiv / code
Misc. open-source projects
Education & Internship Experiences
-
2024.07 -AI Researcher (Intern, Hornbill Programme), Tencent AI Lab, Tencent, ShenZhen(advised by Zhongwen Xu)
-
2023.09 -Phd student, College of Intelligence and Computing, Tianjin University(advised by Jianye Hao)
-
2022.11 - 2024.05AI Researcher (Intern), Fuxi AI Lab, NetEase, HangZhou(advised by Yujing Hu)
-
2021.09 - 2023.06Master, College of Intelligence and Computing, Tianjin University(advised by Jianye Hao, transfer to a PhD student)
-
2017.09 - 2021.06Bachelor, Electronic Information and Electrical Engineering Department, Dalian University of Technology(advised by Guozhen Tan)
Invited Talks
-
2024.12Retrospect the Past: Diffusion Bidding Policy Optimization with Action Relabeling via Ensemble Q-LearningNeurIPS 2024 Auto-Bidding in Large-Scale Auctions (NeurIPS 2024), Canada
AlGB Track Winner (CleanDiffuser Team) Presentation
-
2024.7Diffusion Models for Decision MakingNanjing University
-
2023.12Unsupervised Reinforcement Learning with World ModelThe 5th International Conference on Distributed Artificial Intelligence (DAI 2023), Singapore
-
2023.06EUCLID: Towards Efficient Unsupervised Reinforcement Learning with Multi-choice Dynamics ModelRLChina Seminarvideo
-
2023.03EUCLID: Towards Efficient Unsupervised Reinforcement Learning with Multi-choice Dynamics ModelAI Time Seminar