Diffusion for Natural Image Matting

被引:0
|
作者
Hu, Yihan [1 ,2 ,5 ]
Lin, Yiheng [1 ,2 ]
Wang, Wei [1 ,2 ]
Zhao, Yao [1 ,2 ,3 ]
Wei, Yunchao [1 ,2 ,3 ]
Shi, Humphrey [4 ,5 ]
机构
[1] Beijing Jiaotong Univ, Inst Informat Sci, Beijing, Peoples R China
[2] Minist Educ, Visual Intelligence X Int Joint Lab, Beijing, Peoples R China
[3] Pengcheng Lab, Shenzhen, Peoples R China
[4] Georgia Inst Technol, Atlanta, GA 30332 USA
[5] Picsart AI Res PAIR, Atlanta, GA USA
来源
COMPUTER VISION-ECCV 2024, PT LVII | 2025年 / 15115卷
关键词
Image matting; Diffusion process; Iterative refinement;
D O I
10.1007/978-3-031-72998-0_11
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Existing natural image matting algorithms inevitably have flaws in their predictions on difficult cases, and their one-step prediction manner cannot further correct these errors. In this paper, we investigate a multi-step iterative approach for the first time to tackle the challenging natural image matting task, and achieve excellent performance by introducing a pixel-level denoising diffusion method (DiffMatte) for the alpha matte refinement. To improve iteration efficiency, we design a lightweight diffusion decoder as the only iterative component to directly denoise the alpha matte, saving the huge computational overhead of repeatedly encoding matting features. We also propose an ameliorated self-aligned strategy to consolidate the performance gains brought about by the iterative diffusion process. This allows the model to adapt to various types of errors by aligning the noisy samples used in training and inference, mitigating performance degradation caused by sampling drift. Extensive experimental results demonstrate that DiffMatte not only reaches the state-of-the-art level on the mainstream Composition-1k test set, surpassing the previous best methods by 8% and 15% in the SAD metric and MSE metric respectively, but also show stronger generalization ability in other benchmarks. The code will be open-sourced for the following research and applications. Code is available at https://github.com/YihanHu-2022/DiffMatte.
引用
收藏
页码:181 / 199
页数:19
相关论文
共 50 条
  • [41] A Survey on Pre-Processing in Image Matting
    Yao, Gui-Lin
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2017, 32 (01) : 122 - 138
  • [42] Semantic Image Matting: General and Specific Semantics
    Sun, Yanan
    Tang, Chi-Keung
    Tai, Yu-Wing
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024, 132 (03) : 710 - 730
  • [43] Semantic Image Matting: General and Specific Semantics
    Yanan Sun
    Chi-Keung Tang
    Yu-Wing Tai
    International Journal of Computer Vision, 2024, 132 : 710 - 730
  • [44] Dynamic curve color model for image matting
    Cho, Sunyoung
    Byun, Hyeran
    PATTERN RECOGNITION LETTERS, 2012, 33 (07) : 920 - 933
  • [45] Deep Interactive Image Matting With Feature Propagation
    Ding, Henghui
    Zhang, Hui
    Liu, Chang
    Jiang, Xudong
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 2421 - 2432
  • [46] Image Matting Based on Deep Equilibrium Models
    Liu, Xinshuang
    Zhao, Yue
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING-ICANN 2024, PT II, 2024, 15017 : 379 - 391
  • [47] SALIENCY-BASED UNSUPERVISED IMAGE MATTING
    Tan, Guanghua
    Qi, Jun
    Gao, Chunming
    Chen, Jin
    Zhuo, Liyuan
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2014, 28 (04)
  • [48] Image dehazing based on a transmission fusion strategy by automatic image matting
    Yuan, Feiniu
    Zhou, Yu
    Xia, Xue
    Shi, Jinting
    Fang, Yuming
    Qian, Xueming
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2020, 194
  • [49] Automatic shadow detection and removal using image matting
    Amin, Benish
    Riaz, M. Mohsin
    Ghafoor, Abdul
    SIGNAL PROCESSING, 2020, 170
  • [50] Shadow verification based on feature matching and image matting
    Zhang, Liang
    He, Xiaomei
    Haili Wang
    Information Technology Journal, 2013, 12 (03) : 518 - 521