I received my Bachelor’s degree from Renmin University of China, where I worked under the supervision of Prof. Jun He from Renmin University of China and Prof. Hongyan Liu from Tsinghua University. During my visit to HKUST, I was fortunate to be advised by Prof. Qifeng Chen, focusing on video generation. After that, I was an intern at MSRA, working with Dr. Junliang Guo and Tianyu He focusing on video generation and world simulator.

My research interests lie in diffusion models, video generation, and world modeling. I am currently interested in build interactive, real-time, and consistent video generation models that can serve as world simulators.

I am actively seeking PhD opportunities for Spring/Fall 2026. I am a highly self-motivated student with a deep passion for research and coding. I am eager to work on a series of meaningful projects to advance video generation as a foundation for world simulators.

🔥 News

  • 2025.07.11:  🎉🎉 We release Geometry Forcing!
  • 2025.02.26:  🎉🎉 VideoDPO was accepted by CVPR2025!
  • 2024.12.19:  🎉🎉 We make the VideoDPO paper and code public!
  • 2024.11.01:  🎉🎉 We make the VideoTuna V0.1.0 public!
  • 2023.07:  🎉🎉 Emotalk is accepted by ICCV23.

📝 Publications

Arxiv 2025
sym

Geometry Forcing: Marrying Video Diffusion and 3D Representation for Consistent World Modeling

Haoyu Wu*, Diankun Wu*, Tianyu He, Junliang Guo, Yang Ye, Yueqi Duan, Jiang Bian

Paper Project Code

  • Geometry Forcing encourages video diffusion models to internalize latent 3D representations in order to bridge the gap between video diffusion models and the 3D nature of the real world.
CVPR 2025
sym

VideoDPO: Omni-Preference Alignment for Video Diffusion Generation

Runtao Liu*, Haoyu Wu*, Ziqiang Zheng, Chen Wei, Yingqing He, Renjie Pi, Qifeng Chen

Paper Project Code

  • We propose a whole pipeline for DPO finetuning video diffusion models.
ICCV 2023
sym

EmoTalk: Speech-Driven Emotional Disentanglement for 3D Face Animation

Ziqiao Peng, Haoyu Wu, Zhenbo Song, Hao Xu, Xiangyu Zhu, Hongyan Liu, Jun He, Zhaoxin Fan Project

  • We propose an end-to-end neural network for speech-driven emotion-enhanced 3D facial animation.

📖 Educations

  • 2021.09 - 2025.07, Undergraduate student at Renmin University of China, Beijing, China.
  • 2024.07 - 2025.01, Visiting student supervised by Prof. Qifeng Chen at HKUST, Hong Kong, China.

💻 Internships

  • 2024.11-2025.07, ML Group, Microsoft Research Asia

📕 Teaching Experiences

  • 2024.09 - 2025.01, Teaching Assistant of Introduction to Computer System (I), Renmin University of China.

💬 Invited Talks

  • 2023.01, “Introduction to Linux” of ”Missing Classes” series in RUC Computer Association
  • 2023.08, AITIME Debate about 3D digital human development | [video]

🎖 Honors and Awards

  • 2022.11 The Chinese Mathematical Competition,First Prize