Diffusion for Natural Image Matting

被引:0
|
作者
Hu, Yihan [1 ,2 ,5 ]
Lin, Yiheng [1 ,2 ]
Wang, Wei [1 ,2 ]
Zhao, Yao [1 ,2 ,3 ]
Wei, Yunchao [1 ,2 ,3 ]
Shi, Humphrey [4 ,5 ]
机构
[1] Beijing Jiaotong Univ, Inst Informat Sci, Beijing, Peoples R China
[2] Minist Educ, Visual Intelligence X Int Joint Lab, Beijing, Peoples R China
[3] Pengcheng Lab, Shenzhen, Peoples R China
[4] Georgia Inst Technol, Atlanta, GA 30332 USA
[5] Picsart AI Res PAIR, Atlanta, GA USA
来源
COMPUTER VISION-ECCV 2024, PT LVII | 2025年 / 15115卷
关键词
Image matting; Diffusion process; Iterative refinement;
D O I
10.1007/978-3-031-72998-0_11
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Existing natural image matting algorithms inevitably have flaws in their predictions on difficult cases, and their one-step prediction manner cannot further correct these errors. In this paper, we investigate a multi-step iterative approach for the first time to tackle the challenging natural image matting task, and achieve excellent performance by introducing a pixel-level denoising diffusion method (DiffMatte) for the alpha matte refinement. To improve iteration efficiency, we design a lightweight diffusion decoder as the only iterative component to directly denoise the alpha matte, saving the huge computational overhead of repeatedly encoding matting features. We also propose an ameliorated self-aligned strategy to consolidate the performance gains brought about by the iterative diffusion process. This allows the model to adapt to various types of errors by aligning the noisy samples used in training and inference, mitigating performance degradation caused by sampling drift. Extensive experimental results demonstrate that DiffMatte not only reaches the state-of-the-art level on the mainstream Composition-1k test set, surpassing the previous best methods by 8% and 15% in the SAD metric and MSE metric respectively, but also show stronger generalization ability in other benchmarks. The code will be open-sourced for the following research and applications. Code is available at https://github.com/YihanHu-2022/DiffMatte.
引用
收藏
页码:181 / 199
页数:19
相关论文
共 50 条
  • [31] A GPU-based matting Laplacian solver for high resolution image matting
    Huang, Mengcheng
    Liu, Fang
    Wu, Enhua
    VISUAL COMPUTER, 2010, 26 (6-8) : 943 - 950
  • [32] Image Matting With Deep Gaussian Process
    Zheng, Yuanjie
    Yang, Yunshuai
    Che, Tongtong
    Hou, Sujuan
    Huang, Wenhui
    Gao, Yue
    Tan, Ping
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (11) : 8879 - 8893
  • [33] Multimodal Image Fusion Method Based on Multiscale Image Matting
    Maqsood, Sarmad
    Damasevicius, Robertas
    Silka, Jakub
    Wozniak, Marcin
    ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING (ICAISC 2021), PT II, 2021, 12855 : 57 - 68
  • [34] Automatic Image Matting Using Component-Hue-Difference-Based Spectral Matting
    Hu, Wu-Chih
    Hsu, Jung-Fu
    INTELLIGENT INFORMATION AND DATABASE SYSTEMS (ACIIDS 2012), PT II, 2012, 7197 : 148 - 157
  • [35] A Survey on Pre-Processing in Image Matting
    Gui-Lin Yao
    Journal of Computer Science and Technology, 2017, 32 : 122 - 138
  • [36] Text-Guided Portrait Image Matting
    Xu Y.
    Yao X.
    Liu B.
    Quan Y.
    Ji H.
    IEEE Transactions on Artificial Intelligence, 2024, 5 (08): : 4149 - 4162
  • [37] A Hierarchical Framework on Affinity Based Image Matting
    Yao G.-L.
    Zhao Z.-J.
    Su X.-D.
    Xin H.-T.
    Hu W.
    Qin X.-L.
    Zidonghua Xuebao/Acta Automatica Sinica, 2021, 47 (01): : 209 - 223
  • [38] Deep Learning Methods in Image Matting: A Survey
    Huang, Lingtao
    Liu, Xipeng
    Wang, Xuelin
    Li, Jiangqi
    Tan, Benying
    APPLIED SCIENCES-BASEL, 2023, 13 (11):
  • [39] Deep Image Matting With Sparse User Interactions
    Wei, Tianyi
    Chen, Dongdong
    Zhou, Wenbo
    Liao, Jing
    Zhao, Hanqing
    Zhang, Weiming
    Hua, Gang
    Yu, Nenghai
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (02) : 881 - 895
  • [40] Image matting in the perception granular deep learning
    Hu, Hong
    Pang, Liang
    Shi, Zhongzhi
    KNOWLEDGE-BASED SYSTEMS, 2016, 102 : 51 - 63