Diffusion for Natural Image Matting

被引:0
|
作者
Hu, Yihan [1 ,2 ,5 ]
Lin, Yiheng [1 ,2 ]
Wang, Wei [1 ,2 ]
Zhao, Yao [1 ,2 ,3 ]
Wei, Yunchao [1 ,2 ,3 ]
Shi, Humphrey [4 ,5 ]
机构
[1] Beijing Jiaotong Univ, Inst Informat Sci, Beijing, Peoples R China
[2] Minist Educ, Visual Intelligence X Int Joint Lab, Beijing, Peoples R China
[3] Pengcheng Lab, Shenzhen, Peoples R China
[4] Georgia Inst Technol, Atlanta, GA 30332 USA
[5] Picsart AI Res PAIR, Atlanta, GA USA
来源
COMPUTER VISION-ECCV 2024, PT LVII | 2025年 / 15115卷
关键词
Image matting; Diffusion process; Iterative refinement;
D O I
10.1007/978-3-031-72998-0_11
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Existing natural image matting algorithms inevitably have flaws in their predictions on difficult cases, and their one-step prediction manner cannot further correct these errors. In this paper, we investigate a multi-step iterative approach for the first time to tackle the challenging natural image matting task, and achieve excellent performance by introducing a pixel-level denoising diffusion method (DiffMatte) for the alpha matte refinement. To improve iteration efficiency, we design a lightweight diffusion decoder as the only iterative component to directly denoise the alpha matte, saving the huge computational overhead of repeatedly encoding matting features. We also propose an ameliorated self-aligned strategy to consolidate the performance gains brought about by the iterative diffusion process. This allows the model to adapt to various types of errors by aligning the noisy samples used in training and inference, mitigating performance degradation caused by sampling drift. Extensive experimental results demonstrate that DiffMatte not only reaches the state-of-the-art level on the mainstream Composition-1k test set, surpassing the previous best methods by 8% and 15% in the SAD metric and MSE metric respectively, but also show stronger generalization ability in other benchmarks. The code will be open-sourced for the following research and applications. Code is available at https://github.com/YihanHu-2022/DiffMatte.
引用
收藏
页码:181 / 199
页数:19
相关论文
共 50 条
  • [21] A Study on Image Matting Techniques
    Parihar, Anil Singh
    2020 5TH IEEE INTERNATIONAL CONFERENCE ON RECENT ADVANCES AND INNOVATIONS IN ENGINEERING (IEEE - ICRAIE-2020), 2020,
  • [22] Hierarchical and Progressive Image Matting
    Qiao, Yu
    Liu, Yuhao
    Wei, Ziqi
    Wang, Yuxin
    Cai, Qiang
    Zhang, Guofeng
    Yang, Xin
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2023, 19 (02)
  • [23] Smart Scribbles for Image Matting
    Yang, Xin
    Qiao, Yu
    Chen, Shaozhe
    He, Shengfeng
    Yin, Baocai
    Zhang, Qiang
    Wei, Xiaopeng
    Lau, Rynson W. H.
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2021, 16 (04)
  • [24] Medical matting: Medical image segmentation with uncertainty from the matting perspective
    Wang, Lin
    Ye, Xiufen
    Ju, Lie
    He, Wanji
    Zhang, Donghao
    Wang, Xin
    Huang, Yelin
    Feng, Wei
    Song, Kaimin
    Ge, Zongyuan
    COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 158
  • [25] Automatic and Accurate Image Matting
    Hu, Wu-Chih
    Huang, Deng-Yuan
    Yang, Ching-Yu
    Jhu, Jia-Jie
    Lin, Cheng-Pin
    COMPUTATIONAL COLLECTIVE INTELLIGENCE: TECHNOLOGIES AND APPLICATIONS, PT III, 2010, 6423 : 11 - +
  • [26] Situational Perception Guided Image Matting
    Xu, Bo
    Xie, Jiake
    Huang, Han
    Li, Ziwen
    Lu, Cheng
    Tang, Yong
    Guo, Yandong
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 5283 - 5293
  • [27] A GPU-based matting Laplacian solver for high resolution image matting
    Mengcheng Huang
    Fang Liu
    Enhua Wu
    The Visual Computer, 2010, 26 : 943 - 950
  • [28] Flexible Interactive Guided Image Matting
    Cheng, Hang
    Xu, Shugong
    Guo, Fengjun
    IEEE ACCESS, 2023, 11 : 58808 - 58821
  • [29] Image matting through a Web browser
    Lin, Yen-Chun
    Wang, Hsiang-An
    Hsieh, Yi-Fang
    MULTIMEDIA TOOLS AND APPLICATIONS, 2012, 61 (03) : 551 - 570
  • [30] Image matting through a Web browser
    Yen-Chun Lin
    Hsiang-An Wang
    Yi-Fang Hsieh
    Multimedia Tools and Applications, 2012, 61 : 551 - 570