Peng-Tao Jiang

My name is Peng-Tao Jiang (姜鹏涛). Currently, I am a lead researcher & engineer in the Quality Enhancement Center of vivo. Before that, I was a post-doc researcher at Zhejiang University, working with Prof. Chunhua Shen. I received my PhD from Nankai University, advised by Prof. Ming-Ming Cheng. I have also completed internships at SenseTime and Tencent YouTu.

My recent research interests mainly focus on the following topics:

  • Universal scene parsing and understanding: SDMatte (ICCV25), DepthMaster (arXiv 2024), MLoRE (CVPR 2024), TaskDiffusion (ICLR 2025)
  • Mobile image enhancement: RRW (CVPR24), LAL (ICML25), ConsisSR (arxiv 2024)
  • Generative models and applications: B-TTDM (ECCV24), DDAEBM (ICML24), ADM (ACM MM24)
  • Photography editing: Any2Bokeh (arxiv 25), MagicTryOn (arxiv 25), PPC (NeurIPS25)
  • Email  /  CV  /  Google Scholar  /  Github

    profile photo
    News
    [08.24,2025]: Three papers have been accepted by NeurIPS 2025.

    [08.24,2025]: One paper has been accepted by TPAMI.

    [06.26,2025]: Three papers have been accepted by ICCV 2025.
    Research
    * denotes equal contributions, # denotes corresponding authors.
    Q-Ponder: A Unified Training Pipeline for Reasoning-based Visual Quality Assessment
    Zhuoxuan Cai, Jian Zhang, Xinbin Yuan, Peng-Tao Jiang, Wenxiang Chen, Bowen Tang, Lujian Yao, Qiyuan Wang, Jinwen Chen, Bo Li#
    arxiv, 2025
    paper /code /project
    Any-to-Bokeh: Arbitrary-Subject Video Refocusing with Video Diffusion Model
    Yang Yang*, Siming Zheng*, Jinwei Chen, Boxi Wu#, Xiaofei He, Deng Cai, Bo Li, Peng-Tao Jiang#
    arxiv, 2025
    paper /code /project
    MagicTryOn: Harnessing Diffusion Transformer for Garment-Preserving Video Virtual Try-on
    Guangyuan Li*, Siming Zheng*, Hao Zhang, Jinwei Chen, Junsheng Luan, Binkai Ou, Lei Zhao#, Bo Li, Peng-Tao Jiang#
    arxiv, 2025
    paper /code /project
    DepthMaster: Taming Diffusion Models for Monocular Depth Estimation
    Ziyang Song*, Zerong Wang*, Bo Li, Hao Zhang, Ruijie Zhu, Li Liu, Peng-Tao Jiang#, Tianzhu Zhang#
    arxiv, 2025
    paper /code /project
    Photography Perspective Composition: Towards Aesthetic Perspective Recommendation
    Lujian Yao*, Siming Zheng*, Xinbin Yuan, Zhuoxuan Cai, Pu Wu, Jinwei Chen, Bo Li, Peng-Tao Jiang#
    NeurIPS, 2025
    paper /code /project
    Enhancing Visual Grounding for GUI Agents via Self-Evolutionary Reinforcement Learning
    Xinbin Yuan, Jian Zhang, Kaixin Li, Zhuoxuan Cai, Lujian Yao, Jie Chen, Enguang Wang, Qibin Hou, Jinwei Chen, Peng-Tao Jiang, Bo Li
    NeurIPS, 2025
    paper
    Learning Differential Pyramid Representation for Tone Mapping
    Qirui Yang, Yinbo Li, Peng-Tao Jiang, Qihua Cheng, Biting Yu, Yihao Liu, Huanjing Yue, Jingyu Yang
    NeurIPS, 2025
    paper /demo
    Bidirectional Beta-Tuned Diffusion Model
    Tianyi Zheng, Jiayang Zou, Peng-Tao Jiang, Hao Zhang, Jinwei Chen, Jia Wang, Bo Li
    TPAMI, 2025
    paper
    DSDNet: Raw Domain Demoireing via Dual Color-Space Synergy
    Qirui Yang, Fangpu Zhang, Yeying Jin, Qihua Cheng, Peng-Tao Jiang, Huanjing Yue, Jingyu Yang
    ACM MM, 2025
    paper / demo
    SDMatte: Grafting Diffusion Models for Interactive Matting
    Longfei Huang, Yu Liang, Hao Zhang, Jinwei Chen, Wei Dong, Lunde Chen, Wanyu Liu, Bo Li, Peng-Tao Jiang#
    ICCV, 2025
    paper / code
    PGformer: Proxy-Bridged Game Transformer for Multi-Person Highly Interactive Extreme Motion Prediction
    Yanwen Fang, Jintai Chen, Peng-Tao Jiang, Chao Li, Yifeng Geng, Eddy K. F. Lam, Guodong Li
    ICCV, 2025
    paper / code
    MOERL: When Mixture-of-Experts Meet Reinforcement Learning for Adverse Weather Image Restoration
    Tao Wang, Peiwen Xia, Bo Li, Peng-Tao Jiang, Zhe Kong, Kaihao Zhang, Tong Lu, Wenhan Luo
    ICCV, 2025
    paper / code
    Learning Adaptive Lighting via Channel-Aware Guidance
    Qirui Yang*, Peng-Tao Jiang*#, Hao Zhang, Jinwei Chen, Bo Li, Huanjing Yue, Jingyu Yang#
    ICML, 2025
    paper / demo
    Multi-Task Dense Predictions via Unleashing the Power of Diffusion
    Yuqi Yang*, Peng-Tao Jiang*, Qibin Hou#, Hao Zhang, Jinwei Chen, Bo Li
    ICLR, 2025
    paper / code
    High-Precision Dichotomous Image Segmentation via Probing Diffusion Capacity
    Qian Yu*, Peng-Tao Jiang*#, Hao Zhang, Jinwei Chen, Bo Li, Lihe Zhang#, Huchuan Lu
    ICLR, 2025
    paper / code
    Advancing Comprehensive Aesthetic Insight with Multi-Scale Text-Guided Self-Supervised Learning
    Yuti Liu, Shice Liu, Junyuan Gao, Peng-Tao Jiang, Hao Zhang, Jinwei Chen, Bo Li#
    AAAI, 2025
    paper
    Boosting Vision State Space Model with Fractal Scanning
    Haoke Xiao*, Lv Tang*, Peng-Tao Jiang, Hao Zhang, Jinwei Chen, Bo Li#
    AAAI, 2025, ORAL
    paper
    Unsupervised Modality Adaptation with Text-to-Image Diffusion Models for Semantic Segmentation
    Ruihao Xia, Yu Liang, Peng-Tao Jiang, Hao Zhang, Bo Li#, Yang Tang#, Pan Zhou
    NeurIPS, 2024
    paper / code
    Chain of Visual Perception: Harnessing Multimodal Large Language Models for Zero-shot Camouflaged Object Detection
    Lv Tang, Peng-Tao Jiang, Zhihao Shen, Hao Zhang, Jinwei Chen, Bo Li
    ACM MM, 2024
    paper / code
    Non-uniform Timestep Sampling: Towards Faster Diffusion Model Training
    Tianyi Zheng, Cong Geng, Peng-Tao Jiang, Ben Wan, Hao Zhang, Jinwei Chen, Jia Wang#, Bo Li#
    ACM MM, 2024
    paper
    Beta-Tuned Timestep Diffusion Model
    Tianyi Zheng, Peng-Tao Jiang, Ben Wan, Hao Zhang, Jinwei Chen, Jia Wang#, Bo Li#
    ACM MM, 2024
    paper
    Towards Training-free Open-world Segmentation via Image Prompt Foundation Models
    Lv Tang*, Peng-Tao Jiang*, Haoke Xiao*, Bo Li#
    IJCV, 2024
    paper
    Improving Adversarial Energy-Based Model via Diffusion Process
    Cong Geng, Tian Han, Peng-Tao Jiang, Hao Zhang, Jinwei Chen, Søren Hauberg, Bo Li
    ICML, 2024
    paper
    Revisiting Single Image Reflection Removal In the Wild
    Yurui Zhu, Xueyang Fu, Peng-Tao Jiang, Hao Zhang, Qibin Sun, Jinwei Chen, Zheng-Jun Zha, Bo Li
    CVPR, 2024
    paper /code
    Multi-Task Dense Prediction via Mixture of Low-Rank Experts
    Yuqi Yang*, Peng-Tao Jiang*, Qibin Hou, Hao Zhang, Jinwei Chen, Bo Li
    CVPR, 2024
    paper /code
    Traffic Scene Parsing through the TSP6K Dataset
    Peng-Tao Jiang*, Yuqi Yang*, Yang Cao, Qibin Hou, Ming-Ming Cheng, Chunhua Shen
    CVPR, 2024
    paper /code /dataset[password:Wi9qFT]
    Looking Through the Glass: Neural Surface Reconstruction Against High Specular Reflections
    Jiaxiong Qiu, Peng-Tao Jiang, Yifan Zhu, Ze-Xin Yin, Ming-Ming Cheng, Bo Ren
    CVPR, 2023
    paper /code
    RDNeRF: Relative Depth Guided NeRF for Dense Free View Synthesis
    Jiaxiong Qiu*, Yifan Zhu*, Peng-Tao Jiang, Ming-Ming Cheng, Bo Ren
    TVC, 2023
    paper /code
    Deeply Explain CNN via Hierarchical Decomposition
    Ming-Ming Cheng*, Peng-Tao Jiang*, Ling-Hao Han, Liang Wang, Philip Torr
    IJCV, 2023
    paper /demo
    L2G: A Simple Local-to-Global Knowledge Transfer Framework for Weakly Supervised Semantic Segmentation
    Peng-Tao Jiang, Yuqi Yang, Qibin Hou, Yunchao Wei
    CVPR, 2022
    paper /code
    Attention mechanisms in computer vision: A survey
    Meng-Hao Guo, Tian-Xing Xu, Jiang-Jiang Liu, Zheng-Ning Liu, Peng-Tao Jiang, Tai-Jiang Mu, Song-Hai Zhang, Ralph R. Martin, Ming-Ming Cheng, and Shi-Min Hu
    CVMJ, 2022
    paper /code /Best Paper Award
    Personalized Image Semantic Segmentation"
    Yu Zhang, Chang-bin Zhang, Peng-Tao Jiang, Feng Mao, Ming-Ming Cheng
    ICCV, 2021
    paper /code
    Online Attention Accumulation for Weakly Supervised Semantic Segmentation
    Peng-Tao Jiang*, Ling-Hao Han*, Qibin Hou, Ming-Ming Cheng, Yunchao Wei
    TPAMI, 2021
    paper /code
    Delving Deep into Label Smoothing
    Chang-bin Zhang*, Peng-Tao Jiang*, Qibin Hou, Yunchao Wei, Qi Han, Zhen Li, Ming-Ming Cheng
    TIP, 2021
    paper /code
    LayerCAM: Exploring Hierarchical Class Activation Maps for Localization
    Peng-Tao Jiang*, Chang-bin Zhang*, Qibin Hou, Ming-Ming Cheng, Yunchao Wei
    TIP, 2021
    paper /code
    Integral Object Mining via Online Attention Accumulation
    Peng-Tao Jiang, Qibin Hou, Yang Cao, Ming-Ming Cheng, Yunchao Wei, Hongkai Xiong
    ICCV, 2019
    paper /code /project
    Self-Erasing Network for Integral Object Attention
    Qibin Hou, Peng-Tao Jiang, Yunchao Wei, Ming-Ming Cheng
    NeurIPS, 2018
    paper /code
    DEL: Deep Embedding Learning for Efficient Image Segmentation
    Yun Liu, Peng-Tao Jiang, Vahan Petrosyan, Shi-Jie Li, Jiawang Bian, Le Zhang, and Ming-Ming Cheng
    IJCAI, 2018
    paper /code
    People
    Industry
    Academic Service

    Thanks for the source codes from Yang Cao, Jon Barron