Zerong Zheng

郑泽荣 | PhD

zrzheng1995 AT foxmail DOT com

About Me

Hi, this is Zerong Zheng (郑泽荣). I am currently an algorithm scientist at Bytedance, working on virtual humans. I obtained my PhD degree from Department of Automation, Tsinghua University. My adviser was Prof. Yebin Liu. My research focuses on computer vision and graphics, especially 3D human reconstruction, avatar modeling, neural human rendering, human video generation and so on.

Background

Bytedance Inc.

Algorithm Scientist

July 2024 - present Shenzhen, China

I am currently an algorithm scientist at Bytedance. I aim to develop the most powerful virtual human technologies using cutting-edge techniques such as diffusion transformers and 3D Gaussian splatting.

NNKosmos Technology

Chief Algorithm Scientist

July 2023 - July 2024 Hangzhou, China

NNKosmos is a start-up founded by Prof. Yebin Liu, focusing on virtual human technologies and their application in E-commerce, entertainment and so on. As the chief algorithm scientist, I leaded the algorithm development in this company.

Tsinghua University

B.Eng & Ph.D

September 2014 - June 2023 Beijing, China

I obtained my PhD degree in June 2023, and my advisor was Prof. Yebin Liu. Before that, I received a B.Eng degree from Department of Automation, Tsinghua University in July 2018.

Facebook Inc.

Research Intern

June 2019 - September 2019 San Francisco, USA

I was excited to join Facebook Reality Lab @ Sausalito as a research intern this summer, working with Dr. Tony Tung.

University of Southern California

Undergraduate Visiting Scholar

June 2017 - August 2017 Los Angeles, USA

I spent an exciting summer as a visiting researcher at USC Institute for Creative Technologies, working with Prof. Hao Li.

Research

OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human Animation Models

G. Lin*, J. Jiang*, J. Yang*, Z. Zheng*, Chao Liang

arxiv 2025

We propose an end-to-end multimodality-conditioned human video generation framework named OmniHuman, which can generate human videos based on a single human image and motion signals (e.g., audio only, video only, or a combination of audio and video).

NOTE: We do not offer services/downloads anywhere, nor do we have any SNS accounts for the project. Please be cautious of fraudulent information.

Human4DiT: 360-degree Human Video Generation with 4D Diffusion Transformer

R. Shao, Y. Pang, Z. Zheng, J. Sun, Y. Liu

ACM Transactions of Graphics (SIGGRAPH) - SIGGRAPH Asia 2024 Journal Track

We present a novel framework based on 4D diffusion transformer, which adopts a cascaded structure consisting of the 2D image, the view transformer, and the temporal blocks. Given a reference image, SMPL sequences and camera parameters, our method is capable of generating free-view dynamic human videos.

MeshAvatar: Learning High-quality Triangular Human Avatars from Multi-view Videos

Y. Chen, Z. Zheng, Z. Li, C. Xu, Y. Liu

European Conference on Computer Vision - ECCV 2024

We present a novel pipeline for learning high-quality triangular human avatars from multi-view videos. Our method represents the avatar with an explicit triangular mesh extracted from an implicit SDF field, complemented by an implicit material field conditioned on given poses.

@inproceedings{chen2024meshavatar,
title={MeshAvatar: Learning High-quality Triangular Human Avatars from Multi-view Videos},
author={Yushuo Chen and Zerong Zheng and Zhe Li and Chao Xu and Yebin Liu},
booktitle={ECCV},
year={2024}
}

3D Gaussian Parametric Head Model

Y. Xu, L. Wang, Z. Zheng, Z. Su, Y. Liu

European Conference on Computer Vision - ECCV 2024

We introduce 3D Gaussian Parametric Head Model (GPHM), which employs 3D Gaussians to accurately represent the complexities of the human head, allowing precise control over both identity and expression..

@inproceedings{xu2023gphm,
title={3D Gaussian Parametric Head Model},
author={Xu, Yuelang and Wang, Lizhen and Zheng, Zerong and Su, Zhaoqi and Liu, Yebin},
booktitle={ECCV},
year={2024}
}

LayGA: Layered Gaussian Avatars for Animatable Clothing Transfer

S. Lin, Z. Li, Z. Su, Z. Zheng, H. Zhang, Y. Liu

ACM SIGGRAPH - SIGGRAPH 2024 Conference Track

We present Layered Gaussian Avatars (LayGA), a new representation that formulates body and clothing as two separate layers for photorealistic animatable clothing transfer from multiview videos.

@inproceedings{lin2024layga,
title={LayGA: Layered Gaussian Avatars for Animatable Clothing Transfer},
author={Lin, Siyou and Li, Zhe and Su, Zhaoqi and Zheng, Zerong and Zhang, Hongwen and Liu, Yebin},
booktitle={SIGGRAPH Conference Papers},
year={2024}
}