最后更新于:2025年01月02日 20:00
Last updated on: 2025-01-02 20:00
Welcome
👋 Hi! My name is Junyao Hu (胡钧耀).
🔍 My research interests include deep learning and computer vision, particularly focusing on generative AI for creativity.
🥰 Please feel free to make any suggestions. Any questions and inquiries about my work and study life are welcome. You can contact with me in following ways.
- GitHub (JunyaoHu)
- Email (hujunyao0329 AT gmail DOT com)
- WeChat (ID: LittleDream_hjy, and you can see the QR code when floating on the WeChat icon)
📃 More details:
- 中文个人使用说明书: 这里有一些无法写在简历中,但能让你更好地认识我的内容。
- English CV: Last updated on 2024-12-05.
- Google Scholar
News
2024-09-09 ✒️ Study I quit my PhD and went back to my hometown, Enshi, to take my gap year. I am trying to think about the meaning of life and find the road that suits me. I would like to express my gratitude to my supervisor and coworkers for their encouragement and guidance throughout this year.
2024-05-05 💼 Activity I attended VALSE 2024 (Chongqing, China).
2024-02-27 😋 Accepted A paper was accepted to CVPR 2024, and I was honored as one of Outstanding Reviewers (top 2%).
2023-09-01 ✒️ Study I started my PhD studying at Nankai University (NKU) under the supervision of Prof. Jufeng Yang.
2023-07-15 ✒️ Study I ended my undergraduate life at China University of Mining and Technology (CUMT), thanks to all teachers and friends around me, especially my parents.
2023-06-10 💼 Activity I attended VALSE 2023 (Wuxi, China).
Selected Publications
If you want to view my all publications, click here.
Note: #
= Equal Contribution , *
= Corresponding Author.
CVPR24 ExtDM: Distribution extrapolation diffusion model for video prediction
Zhicheng Zhang#, Junyao Hu#, Wentao Cheng*, Danda Paudel, Jufeng Yang
TL;DR: We present ExtDM, a new diffusion model that extrapolates video content from current frames by accurately modeling distribution shifts towards future frames.
📘 CVPR 📃 Paper 📃 中译版 📦 Code ⚒️ Project 📊 Poster 📅 Slide 🎞️ Bilibili 🎞️ YouTube
Details
Abstract: Video prediction is a challenging task due to its nature of uncertainty, especially for forecasting a long period. To model the temporal dynamics, advanced methods benefit from the recent success of diffusion models, and repeatedly refine the predicted future frames with 3D spatiotemporal U-Net. However, there exists a gap between the present and future and the repeated usage of U-Net brings a heavy computation burden. To address this, we propose a diffusion-based video prediction method that predicts future frames by extrapolating the present distribution of features, namely ExtDM. Specifically, our method consists of three components: (i) a motion autoencoder conducts a bijection transformation between video frames and motion cues; (ii) a layered distribution adaptor module extrapolates the present features in the guidance of Gaussian distribution; (iii) a 3D U-Net architecture specialized for jointly fusing guidance and features among the temporal dimension by spatiotemporal-window attention. Extensive experiments on five popular benchmarks covering short- and long-term video prediction verify the effectiveness of ExtDM.BibTex1 |
|
Academic Service
Projects
- common_metrics_on_video_quality
Integrated FVD, PSNR, SSIM and LPIPS metrics for assessing the quality of generated videos easily. - academic-project-page-template-vue
Developed a new project homepage template based on Vue, supporting one-click copy BibTex citation.
Reviewer
- Conference: CVPR’{24-25} (2024 Outstanding Reviewer, top 2%) , ACMMM’23
- Transaction: TMM’23
Teaching Assistant
- High School Student Talent Plan 2024: Guided students to learn Python programming, provided advice for their practical projects.
- Freshman Education Lecture 2022: Offered actionable advice to freshmen on thriving academically and socially in college on my experience.
- Discrete Mathematics 2021: Strengthen students to grasp concepts through targeted review sessions enriched with example problems.