TFMNet: Trimap-free Real-time Image Matting Algorithm Based On Deep Learning

被引:0
作者
Peng, Ge [1 ]
Yang, Jingzong [1 ]
机构
[1] Baoshan Univ, Sch Big Data, Baoshan 678000, Yunnan, Peoples R China
来源
JOURNAL OF APPLIED SCIENCE AND ENGINEERING | 2024年 / 27卷 / 04期
基金
芬兰科学院;
关键词
real-time image matting; image semantic segmentation; convolution neural network without pooling; image processing; GENERATION;
D O I
10.6180/jase.202404_27(04).0009
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
The conventional image matting algorithms needed priori manual Trimap information to produce excellent matting results which made real time matting impossible. To tackle the problem, a Trimap-free image matting network, TFMNet, is proposed in this paper. The proposed network consists of four modules, ConvNeXt backbone module for image features extraction, Trimap prediction module for normalized Trimap generation, glance matting module for rough matting results prediction, and post-processing module for exact matting results production. To further optimize the training process of the proposed model, an improved Loss function based on frequency domain information is proposed. In experiment, Sets of Experiments designed by variable controlling approach prove that the proposed TFMNet do well in real time image matting. The TFMNet model achieves 8.99, 0.011, 12.31, 11.15 in the accuracy metrics of SAD, MSE, GRAD, CONN, respectively, costs 51ms for one image averagely which meet the real-time requirements, and model size is 671M. Besides, further experiments conducted by comparing with five state-of-the-art models based on three typical matting databases demonstrate the superiority of the proposed algorithm.
引用
收藏
页码:2307 / 2318
页数:12
相关论文
共 30 条
[1]  
Aggarwal A., 2020, J. Comput. Sci., V16, P651
[2]  
Aghdam E.K., 2022, PREPRINT
[3]   IGWO-IVNet3: DL-Based Automatic Diagnosis of Lung Nodules Using an Improved Gray Wolf Optimization and InceptionNet-V3 [J].
Bilal, Anas ;
Shafiq, Muhammad ;
Fang, Fang ;
Waqar, Muhammad ;
Ullah, Inam ;
Ghadi, Yazeed Yasin ;
Long, Haixia ;
Zeng, Rao .
SENSORS, 2022, 22 (24)
[4]   AI-Based Automatic Detection and Classification of Diabetic Retinopathy Using U-Net and Deep Learning [J].
Bilal, Anas ;
Zhu, Liucun ;
Deng, Anan ;
Lu, Huihui ;
Wu, Ning .
SYMMETRY-BASEL, 2022, 14 (07)
[5]   A Transfer Learning and U-Net-based automatic detection of diabetic retinopathy from fundus images [J].
Bilal, Anas ;
Sun, Guangmin ;
Mazhar, Sarah ;
Imran, Azhar ;
Latif, Jahanzaib .
COMPUTER METHODS IN BIOMECHANICS AND BIOMEDICAL ENGINEERING-IMAGING AND VISUALIZATION, 2022, 10 (06) :663-674
[6]  
Brinkmann R., 2008, The art and science of digital compositing: Techniques for visual effects, animation and motion graphics, DOI 10.1016/B978-0-12-370638-6.X0001-6
[7]  
Cao Hu, 2023, Computer Vision - ECCV 2022 Workshops: Proceedings. Lecture Notes in Computer Science (13803), P205, DOI 10.1007/978-3-031-25066-8_9
[8]  
Chauhan S., 2021, Data Science and Data Analytics: Opportunities and Challenges, V1, DOI [10.1201/9781003111290, DOI 10.1201/9781003111290]
[9]  
Howard AG, 2017, Arxiv, DOI arXiv:1704.04861
[10]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778