| Pengfei Wan I lead the Kling models and data team (aka the Kling Team) at Kuaishou Technology. I used to be the director of AI department (MT Lab) at Meitu, Inc. I did my Ph.D. at the ECE Department of HKUST, and B.E. at the EEIS Department of USTC.
Kling Team is building next-generation multimodal world models across video, audio, text, 3D, and beyond. Our work spans, but is not limited to: - Multimodal Understanding & Reasoning
- Multimodal Generation & Interaction
- Multimodal Data Systems & Algorithms
We are continuously seeking outstanding talents to join us. Feel free to reach out! Kling AI / Google Scholar / GitHub / Email | | Technology I'm interested in computer vision and graphics, generative AI, and multimodal machine learning. Below is a list of selected publications in recent years. | Flow-GRPO: Training Flow Matching Models via Online RL Jie Liu, Gongye Liu, Jiajun Liang, Yangguang Li, Jiaheng Liu, Xintao Wang, Pengfei Wan, Di Zhang, Wanli Ouyang NeurIPS, 2025 | OmniSync: Towards Universal Lip Synchronization via Diffusion Transformers Ziqiao Peng, Jiwen Liu, Haoxian Zhang, Xiaoqiang Liu, Songlin Tang, Pengfei Wan, Di Zhang, Hongyan Liu, Jun He NeurIPS Spotlight, 2025 | Context as Memory: Scene-consistent Interactive Long Video Generation with Memory Retrieval Jiwen Yu, Jianhong Bai, Yiran Qin, Quande Liu, Xintao Wang, Pengfei Wan, Di Zhang, Xihui Liu SIGGRAPH Asia, 2025 | FullDiT: Multi-Task Video Generative Foundation Model with Full Attention Xuan Ju, Weicai Ye, Quande Liu, Qiulin Wang, Xintao Wang, Pengfei Wan, Di Zhang, Kun Gai, Qiang Xu ICCV, 2025 | ReCamMaster: Camera-Controlled Generative Rendering from a Single Video Jianhong Bai, Menghan Xia, Xiao Fu, Xintao Wang, Lianrui Mu, Jinwen Cao, Zuozhu Liu, Haoji Hu, Xiang Bai, Pengfei Wan, Di Zhang ICCV Oral, Best Paper Award Finalist, 2025 | Towards Precise Scaling Laws for Video Diffusion Transformers Yuanyang Yin, Yaqi Zhao, Mingwu Zheng, Ke Lin, Jiarong Ou, Rui Chen, Victor Shea-Jay Huang, Jiahao Wang, Xin Tao, Pengfei Wan, Di Zhang, Baoqun Yin, Wentao Zhang, Kun Gai CVPR, 2025 | MODA: MOdular Duplex Attention for Multimodal Perception, Cognition, and Emotion Understanding Zhicheng Zhang, Wuyou Xia, Chenxi Zhao, Yan Zhou, Xiaoqiang Liu, Yongjie Zhu, Wenyu Qin, Pengfei Wan, Di Zhang, Jufeng Yang ICML Spotlight, 2025 | 3DTrajMaster: Mastering 3D Trajectory for Multi-Entity Motion in Video Generation Xiao Fu, Xian Liu, Xintao Wang, Sida Peng, Menghan Xia, Xiaoyu Shi, Ziyang Yuan, Pengfei Wan, Di Zhang, Dahua Lin ICLR, 2025 | DVIS++: Improved Decoupled Framework for Universal Video Segmentation Tao Zhang, Xingye Tian, Yikang Zhou, Shunping Ji, Xuebo Wang, Xin Tao, Yuan Zhang, Pengfei Wan, Zhongyuan Wang, Yu Wu TPAMI, 2025 | Agent Attention: On the Integration of Softmax and Linear Attention Dongchen Han, Tianzhu Ye, Yizeng Han, Zhuofan Xia, Siyuan Pan, Pengfei Wan, Shiji Song, Gao Huang ECCV, 2024 | VideoTetris: Towards Compositional Text-to-Video Generation Ye Tian, Ling Yang, Haotian Yang, Yuan Gao, Yufan Deng, Jingmin Chen, Xintao Wang, Zhaochen Yu, Xin Tao, Pengfei Wan, Di Zhang, Bin Cui NeurIPS, 2024 | I2V-Adapter: A General Image-to-Video Adapter for Diffusion Models Xun Guo, Mingwu Zheng, Liang Hou, Yuan Gao, Yufan Deng, Pengfei Wan, Di Zhang, Yufan Liu, Weiming Hu, Zhengjun Zha, Haibin Huang, Chongyang Ma SIGGRAPH, 2024 | Augmentation-Aware Self-Supervision for Data-Efficient GAN Training Liang Hou, Qi Cao, Yige Yuan, Songtao Zhao, Chongyang Ma, Siyuan Pan, Pengfei Wan, Zhongyuan Wang, Huawei Shen, Xueqi Cheng NeurIPS, 2023 | Towards Practical Capture of High-fidelity Relightable Avatars Haotian Yang, Mingwu Zheng, Wanquan Feng, Haibin Huang, Yu-Kun Lai, Pengfei Wan, Zhongyuan Wang, Chongyang Ma SIGGRAPH Asia, 2023 | FEditNet: Few-Shot Editing of Latent Semantics in GAN Spaces Mengfei Xia, Yezhi Shu, Yuji Wang, Yu-Kun Lai, Qiang Li, Pengfei Wan, Zhongyuan Wang, Yong-Jin Liu AAAI Oral, 2023 | PMP-Net++: Point Cloud Completion by Transformer-Enhanced Multi-Step Point Moving Paths Xin Wen, Peng Xiang, Zhizhong Han, Yan-Pei Cao, Pengfei Wan, Wen Zheng, Yu-Shen Liu TPAMI, 2022 | Debiased Self-Training for Semi-Supervised Learning Baixu Chen, Junguang Jiang, Ximei Wang, Pengfei Wan, Jianmin Wang, Mingsheng Long NeurIPS Oral, 2022 | Snowflake Point Deconvolution for Point Cloud Completion and Generation with Skip-Transformer Peng Xiang, Xin Wen, Yu-Shen Liu, Yan-Pei Cao, Pengfei Wan, Wen Zheng, Zhizhong Han TPAMI, 2022 | | Talks | Presentation on "An Introduction to Kling and Our Research towards More Powerful Video Generation Models", Tutorial Session: From Video Generation to World Model, CVPR, Nashville, 2025
Virtual panel discussion on "Video Generation Models", Project Odyssey AI Film Gala, San Francisco, 2024
Roundtable forum on "The Innovations and Challenges of the Next-generation Artificial Intelligence Architecture", Plenary Session: Scientific Frontier, World Artificial Intelligence Conference (WAIC), Shanghai, 2024
Presentation on "Kling Video Generation Models" & roundtable forum on "Multimodality, AGI, On-device AI", BAAI Conference, Beijing, 2024
Presentation on "Multimodal Digital Human: Technological Innovations and Industrial Applications", Opening Plenary, China Digital Human Conference, Beijing, 2024 | | Services | Reviewer/Program Committee Member of CVPR, ICCV, NeurIPS, ICLR, AAAI, TIP, etc. | | | | This template is a modification of Jon Barron's website | |