Feb 2026
One paper was accepted to CVPR 2026
Our paper Generative Neural Video Compression via Video Diffusion Prior was accepted to the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2026).
Qi Mao is a Professor with the State Key Laboratory of Media Convergence and Communication and the School of Information and Communication Engineering, Communication University of China.
Communication University of China · Ph.D., Peking University · Principal Investigator, MIPG
Current Focus
About
I am currently a Professor at the School of Information and Communication Engineering and the State Key Laboratory of Media Convergence and Communication, Communication University of China.
I received my Ph.D. degree from Peking University in July 2021, where I was affiliated with the Institute of Digital Media and worked with Prof. Wen Gao and Prof. Siwei Ma.
Prior to that, I obtained both the B.E. degree in Digital Media Technology and the B.A. degree in Journalism from Communication University of China in 2016.
I was also a visiting Ph.D. student at the Vision and Learning Lab, University of California, Merced, under the supervision of Prof. Ming-Hsuan Yang.
I also worked as a visiting scholar with the National University of Singapore (NUS), where I collaborated with Prof. Mike Zheng Shou.
My research interests include controllable image and video generation, image editing, multimedia intelligence, and image-video compression based on generative models.
News
Feb 2026
Our paper Generative Neural Video Compression via Video Diffusion Prior was accepted to the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2026).
Nov 2025
UniVid: Unifying Vision Tasks with Pre-trained Video Generation Models was accepted to the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV 2026).
Sep 2025
Our work on ultra-low bitrate image compression enabled by multimodal large foundation models appeared in IEEE Transactions on Image Processing.
Oct 2025
The group welcomes undergraduate students with interests in generative AI, agents, and multimedia intelligence.
Lab
I lead MIPG, the Multimedia Intelligent Processing Group at Communication University of China. The group conducts research in visual generation, image and video editing, multimedia intelligence, and generative compression.
F15, State Key Laboratory of Media Convergence and Communication, Communication University of China, Beijing 100024, China.
Information and Communication Engineering School, Communication University of China, No. 1 Dingfuzhuang East Street, Chaoyang District, Beijing 100024, China.
The MIPG homepage provides additional information about group members, projects, publications, and opportunities for prospective students.
Research
Research on controllable generation, localized editing, semantic alignment, and instruction-aware synthesis for visual content creation.
Image and video compression with generative priors, multimodal knowledge, and human-machine collaborative perception.
Visual communication, representation learning, and intelligent multimedia systems that connect media understanding with generation.
Collaboration
Publications
Qi Mao, Hao Cheng, Tinghan Yang, Libiao Jin, Siwei Ma. CVPR 2026.
Lan Chen, Yuchao Gu, Qi Mao. WACV 2026.
Junlong Gao, Zhimeng Huang, Qi Mao(*), Siwei Ma, Chuanmin Jia. IEEE Transactions on Image Processing, 34:5904-5919.
Yuanhang Li, Qi Mao(*), Lan Chen, Zhen Fang, Lei Tian, Xinyan Xiao, Libiao Jin, Hua Wu. IEEE Transactions on Multimedia.
Lan Chen, Qi Mao, Yuchao Gu, Mike Zheng Shou. arXiv preprint.
Qi Mao, Lan Chen, Yuchao Gu, Mike Zheng Shou, Ming-Hsuan Yang. arXiv preprint.
Qi Mao, Lan Chen, Yuchao Gu, Zhen Fang, Mike Zheng Shou. ACM MM 2024.
Qi Mao, Tinghan Yang, Yinuo Zhang, Zijian Wang, Meng Wang, Shiqi Wang, Siwei Ma. DCC 2024.
Naifu Xue, Qi Mao, Zijian Wang, Yuan Zhang, Siwei Ma. ICME 2024.
Yuanhang Li, Qi Mao, Libiao Jin. ICMEW 2024.
Qi Mao, Chongyu Wang, Meng Wang, Shiqi Wang, Ruijie Chen, Libiao Jin, Siwei Ma. IEEE Transactions on Image Processing, 33:408-422.
Qi Mao, Siwei Ma. IEEE Transactions on Multimedia, 25:8511-8526.
Jianhui Chang, Zhenghui Zhao, Chuanmin Jia, Shiqi Wang, Lingbo Yang, Qi Mao, Jian Zhang, Siwei Ma. IEEE Transactions on Image Processing, 31:2809-2823.
Qi Mao, Hung-Yu Tseng, Hsin-Ying Lee, Jia-Bin Huang, Siwei Ma, Ming-Hsuan Yang. International Journal of Computer Vision.
H.-Y. Lee, H.-Y. Tseng, Q. Mao, J.-B. Huang, Y.-D. Lu, M. K. Singh, M.-H. Yang. International Journal of Computer Vision, 128(10-11):2402-2417.
Q. Mao*, H.-Y. Lee*, H.-Y. Tseng*, S. Ma, M.-H. Yang. CVPR 2019.
Students
I welcome inquiries from highly motivated students interested in controllable generation, image and video editing, generative compression, and multimedia intelligence.
Projects
Localized image editing in complex scenarios with mask-based attention-adjusted guidance.
Code SAVI2ICode release for continuous and diverse image-to-image translation via signed attribute vectors.
Code MSGANRepository for mode-seeking generative adversarial networks for diverse image synthesis.
Code MAG-Edit CodeCodebase and implementation details for localized image editing experiments.
Project Edit TransferProject page for learning image editing via vision in-context relations.
Code UnifyEditCode release for tuning-free image editing with improved fidelity and editability.
Code UniVidRepository for unifying vision tasks with pre-trained video generation models.
Code VQGAN-CompressionResources for extreme image compression and generation-compression research.
Resource Conditional Image Generation PapersA curated paper list on conditional image generation research.
Resource Image-to-Image PapersA curated reading list for image-to-image translation literature.
Contact