Research and Application of Real-Time High-Resolution Video Matting Algorithm

被引:0
作者
Duan, Beibei [1 ]
Fan, Xinggang [2 ]
Lei, Yanjing [1 ]
Feng, Zehui [1 ]
Chan, Sixian [1 ]
机构
[1] Zhejiang Univ Technol, Coll Comp Sci & Technol, Hangzhou, Zhejiang, Peoples R China
[2] Zhejiang Univ Technol, Coll Zhijiang, Hangzhou, Zhejiang, Peoples R China
来源
2022 INTERNATIONAL CONFERENCE ON COMPUTERS AND ARTIFICIAL INTELLIGENCE TECHNOLOGIES, CAIT | 2022年
关键词
video matting; foreground segmentation; temporal information; high resolution;
D O I
10.1109/CAIT56099.2022.10072171
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Nowadays, many application tools provide the video matting function. The accuracy of the results of video matting is of great importance in practical applications. Existing video matting methods view video as multiple consecutive frames. The matting for video is also a continuous single-frame matting, and the synthesized video will have obvious flickering problems. We introduce a human video matting method that can address this problem well. We use the temporal information existing in the video to perform video matting. Our method uses a recurrent structure to exploit the temporal information in videos, resulting in improvements in both temporal coherence and matting quality. We train the segmentation and matting on the network at the same time, and take the results of semantic segmentation as input. The method does not require any auxiliary inputs, such as trimap or pre-captured background images, and can be widely applied to existing human matting applications. A large number of experimental results show that our model is superior to MODNet in terms of evaluation metrics, where the lift value is 2.73 on MAD(Mean Absolute Difference), 1.83 on MSE(Mean Squared Error), 0.46 on Grad( Spatial Gradient), 0.3 on Conn(Connectivity), and 0.49 on dtSSD. We also designed a simple, real-time, visual, user-friendly and understandable video matting system, which is convenient for users to achieve video matting.
引用
收藏
页码:85 / 92
页数:8
相关论文
共 36 条
[1]   Designing Effective Inter-Pixel Information Flow for Natural Image Matting [J].
Aksoy, Yagiz ;
Aydin, Tunc Ozan ;
Pollefeys, Marc .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :228-236
[2]  
[Anonymous], 27 INT C MACH LEARN
[3]  
Ballas N., 2016, INT C LEARNING REPRE
[4]   CaMap: Camera-based Map Manipulation on Mobile Devices [J].
Chen, Liang ;
Chen, Dongyi .
PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND APPLICATION ENGINEERING (CSAE2018), 2018,
[5]  
Chen LC, 2017, Arxiv, DOI [arXiv:1706.05587, DOI 10.48550/ARXIV.1706.05587]
[6]   KNN Matting [J].
Chen, Qifeng ;
Li, Dingzeyu ;
Tang, Chi-Keung .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (09) :2175-2188
[7]  
Cho KYHY, 2014, Arxiv, DOI [arXiv:1406.1078, DOI 10.48550/ARXIV.1406.1078]
[8]  
Chuang YY, 2001, PROC CVPR IEEE, P264
[9]  
Erofeev M., 2015, BMVC
[10]  
Forte Marco, 2020, ABS200307711 CORR