PhD (CUHK), MS (UCAS), BEng (Xidian)
Senior researcher
Huawei Noah's Ark Lab
Email: zhensongzhang[at]hotmail[dot]com
I joined Huawei Noah's Ark Lab after I obtained my Ph.D. degree from The Chinese University of Hong Kong in 2018. Before that, I received a BEng. degree and a M.S. degree from Xidian University and University of Chinese Academy of Sciences in 2011 and 2014, respectively. I am currently working on Visual Language Model/3DGS/GenAI.
We are looking for self-motivated interns and full times, if you are insterested in doing cool VLM/AIGC projects, welcome to join us, please drop me an email.
[01/2026] Our CHROMA paper is accepted to ICLR, congrats to all coauthors!
[12/2025] Our iCo3D paper is accepted to IJCV, congrats to all coauthors!
[11/2025] Our egocentric intent disambiguation paper is accepted to AAAI 2026, congrats to all coauthors!
[11/2025] Our SCENIC paper is accepted to 3DV 2026, congrats to all coauthors!
[09/2025] Our ViDAR paper is accepted to NeurIPS 2025, congrats to all coauthors!
[07/2025] Our survey paper on Human Motion Video Generation is accepted to TPAMI, congrats to all coauthors!
[07/2025] One paper is accepted to ICCV 2025, congrats to all coauthors!
[06/2025] We won the second place award in HD-EPIC VQA Challenges 2025, congrats to all coauthors!
[04/2025] Our paper on Video Human Motion In-betweening is accepted to IJCAI 2025, congrats to all coauthors!
[03/2025] Our CaricatureBooth is accepted to CVPR 2025, congrats to all coauthors!
[01/2025] One paper is accepted to ICASSP 2025, congrats to all coauthors!
[09/2024] One paper is accepted to NeurIPS 2024, congrats to all coauthors!
[02/2024] Three papers are accepted to CVPR 2024, congrats to all coauthors!
[12/2023] One paper is accepted to ICASSP 2024.
[10/2023] We won the Reproducibility Award in GENEA Challenge 2023.
[09/2023] Our work on robust monocular depth estimation is accepted to IJCV.
[07/2023] Our UnifiedGesture is accepted to ACM MM 2023.
[05/2023] Our DiffuseStyleGesture is accepted to IJCAI 2023.
[05/2023] Our QPGesture is accepted to CVPR 2023 as a highlight.
[11/2022] Our sign language avatar appears on HDC 2022 / HC 2022 and helps translate Chinese keynotes into CSL, 量子位, 华为人, 华为开发者联盟服务.
[10/2022] Our joint team Megatron_RVC won the RVC 2022 single image depth prediction challenge, news
[8/2022] Our human pose and shape estimation paper CLIFF is accepted to ECCV as oral presentation, news
[Preprint] Charge: A Comprehensive Novel View Synthesis Benchmark and Dataset to Bind Them All
Michal Nazarczuk, Thomas Tanay, Arthur Moreau, Zhensong Zhang, Eduardo Pérez-Pellitero
[Preprint] Off The Grid: Detection of Primitives for Feed-Forward 3D Gaussian Splatting
Arthur Moreau, Richard Shaw, Michal Nazarczuk, Jisu Shin, Thomas Tanay, Zhensong Zhang, Songcen Xu, Eduardo Pérez-Pellitero
[Preprint] Map2Thought: Explicit 3D Spatial Reasoning via Metric Cognitive Maps
Xiangjun Gao, Zhensong Zhang, Dave Zhenyu Chen, Songcen Xu, Long Quan, Eduardo Pérez-Pellitero, Youngkyoon Jang
[Preprint] GASPACHO: Gaussian Splatting for Controllable Humans and Objects
Aymen Mir, Arthur Moreau, Helisa Dhamo, Zhensong Zhang, Eduardo Pérez-Pellitero
[Preprint] Better Together: Unified Motion Capture and 3D Avatar Reconstruction
Arthur Moreau, Mohammed Brahimi, Richard Shaw, Athanasios Papaioannou, Thomas Tanay, Zhensong Zhang, Eduardo Pérez-Pellitero
[ICLR] CHROMA: Consistent Harmonization of Multi-View Appearance via Bilateral Grid Prediction
Jisu Shin, Richard Shaw, Seunghyun Shin, Zhensong Zhang, Hae-Gon Jeon, Eduardo Perez-Pellitero
In ICLR 2026.
[IJCV] ICo3D: An Interactive Conversational 3D Virtual Human
Richard Shaw, Youngkyoon Jang, Athanasios Papaioannou, Arthur Moreau, Helisa Dhamo, Zhensong Zhang, Eduardo Pérez-Pellitero
In IJCV.
[Website]
[AAAI] Plug-and-Play Clarifier: A Zero-Shot Multimodal Framework for Egocentric Intent Disambiguation
Sicheng Yang, Yukai Huang, Weitong Cai, Shitong Sun, You He, Jiankang Deng, Hang Zhang, Jifei Song, Zhensong Zhang
In AAAI 2026.
[3DV] SCENIC: Scene-aware Semantic Navigation with Instruction-guided Control
Xiaohan Zhang, Sebastian Starke, Vladimir Guzov, Zhensong Zhang, Eduardo Pérez Pellitero, Gerard Pons-Moll
In 3DV 2026.
[NeurIPS] ViDAR: Video Diffusion-Aware 4D Reconstruction From Monocular Inputs
Michal Nazarczuk, Sibi Catley-Chandar, Thomas Tanay, Zhensong Zhang, Gregory Slabaugh, Eduardo Pérez-Pellitero
In NeurIPS 2025.
[TPAMI] Human Motion Video Generation: A Survey
Haiwei Xue, Xiangyang Luo, Zhanghao Hu, Xin Zhang, Xunzhi Xiang, Yuqin Dai, Jianzhuang Liu, Zhensong Zhang, Minglei Li, Jian Yang, Fei Ma, Zhiyong Wu, Changpeng Yang, Zonghong Dai, Fei Richard Yu
In IEEE Transactions on Pattern Analysis and Machine Intelligence, 2025.
[Website]
[ICCV] Frequency-Guided Diffusion for Training-Free Text-Driven Image Translation
Zheng Gao, Jifei Song, Zhensong Zhang, Jiankang Deng, Ioannis Patras
In ICCV 2025.
[CVPR] CaricatureBooth: Data-Free Interactive Caricature Generation in a Photo Booth
Zhiyu Qu, Yunqi Miao, Zhensong Zhang, Jifei Song, Jiankang Deng, Yi-Zhe Song
In CVPR 2025.
[IJCAI] VideoHumanMIB: Unlocking Appearance Decoupling for Video Human Motion In-betweening
Haiwei Xue, Zhensong Zhang, Minglei Li, Zonghong Dai, Fei Yu, Fei Ma, Zhiyong Wu
In IJCAI 2025.
[ICASSP] Identity-Preserving Audio-Driven Holistic Human Motion Video Generation
Haiwei Xue, Zhensong Zhang, Minglei Li, Zonghong Dai
In ICASSP 2025.
Winner, ECCV 2022 RVC monoculer depth estimation prediction challenge
Second place award in HD-EPIC VQA Challenges 2025
Paper Lists
Computer Graphics Papers, ECCV Papers, CVPR/ICCV Papers, NeurIPS Papers