I am a Ph.D. student at HKU supervised by Prof. Xihui Liu. Prior to that, I received my Bachelor’s degree from Renmin University of China, where I worked under the supervision of Prof. Jun He from Renmin University of China and Prof. Hongyan Liu from Tsinghua University. During my visit to HKUST, I was fortunate to be advised by Prof. Qifeng Chen, focusing on video generation. After that, I was an intern at MSRA, working with Dr. Junliang Guo and Tianyu He focusing on video generation and world simulator.

My research interests lie in building video world models. I am specifically interested in build interactive, real-time, and consistent video generation models that can serve as world simulators.

I am a highly self-motivated student with a deep passion for research and coding. I am eager to work on a series of influential projects to advance video generation as a foundation for world simulators.

You could find me through wuhaoyu556@connect.hku.hk.

🔥 News

2026.01.26: 🎉🎉 Geometry Forcing is accepted to ICLR26!
2025.09.22: 🎉🎉 Geometry Forcing is accepted to NeurIPS 2025 NextVid Workshop!
2025.07.11: 🎉🎉 We release Geometry Forcing!
2025.02.26: 🎉🎉 VideoDPO was accepted by CVPR2025!
2024.12.19: 🎉🎉 We make the VideoDPO paper and code public!
2024.11.01: 🎉🎉 We make the VideoTuna V0.1.0 public!
2023.07: 🎉🎉 Emotalk is accepted by ICCV23.

📝 Publications

ICLR 2026

Geometry Forcing: Marrying Video Diffusion and 3D Representation for Consistent World Modeling

Haoyu Wu*, Diankun Wu*, Tianyu He, Junliang Guo, Yang Ye, Yueqi Duan, Jiang Bian

Paper Project Code

Geometry Forcing encourages video diffusion models to internalize latent 3D representations in order to bridge the gap between video diffusion models and the 3D nature of the real world.

CVPR 2025

VideoDPO: Omni-Preference Alignment for Video Diffusion Generation

Runtao Liu*, Haoyu Wu*, Ziqiang Zheng, Chen Wei, Yingqing He, Renjie Pi, Qifeng Chen

Paper Project Code

We propose a whole pipeline for DPO finetuning video diffusion models.

ICCV 2023

EmoTalk: Speech-Driven Emotional Disentanglement for 3D Face Animation

Ziqiao Peng, Haoyu Wu, Zhenbo Song, Hao Xu, Xiangyu Zhu, Hongyan Liu, Jun He, Zhaoxin Fan Project

We propose an end-to-end neural network for speech-driven emotion-enhanced 3D facial animation.

[preprint] VGG-Tex: A Vivid Geometry-Guided Facial Texture Estimation Model for High Fidelity Monocular 3D Face Reconstruction Haoyu Wu, Ziqiao Peng, Xukun Zhou, Yunfei Cheng, Jun He, Hongyan Liu, Zhaoxin Fan

📖 Educations

2021.09 - 2025.07, Undergraduate student at Renmin University of China, Beijing, China.
2024.07 - 2025.01, Visiting student supervised by Prof. Qifeng Chen at HKUST, Hong Kong, China.

💻 Internships

2024.11-2025.07, ML Group, Microsoft Research Asia

📕 Teaching Experiences

2024.09 - 2025.01, Teaching Assistant of Introduction to Computer System (I), Renmin University of China.

💬 Invited Talks

2023.01, “Introduction to Linux” of ”Missing Classes” series in RUC Computer Association
2023.08, AITIME Debate about 3D digital human development | [video]

🎖 Honors and Awards

2022.11 The Chinese Mathematical Competition,First Prize