Ultrahigh Resolution Image/Video Matting with Spatio-Temporal Sparsity

被引：0

作者：

Sun, Yanan ^{[1
]}

Tang, Chi-Keung ^{[1
]}

Tai, Yu-Wing ^{[1
]}

机构：

[1] HKUST, Hong Kong, Peoples R China

来源：

2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2023年

关键词：

D O I：

10.1109/CVPR52729.2023.01356

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Commodity ultrahigh definition (UHD) displays are becoming more affordable which demand imaging in ultrahigh resolution (UHR). This paper proposes SparseMat, a computationally efficient approach for UHR image/video matting. Note that it is infeasible to directly process UHR images at full resolution in one shot using existing matting algorithms without running out of memory on consumer-level computational platforms, e.g., Nvidia 1080Ti with 11G memory, while patch-based approaches can introduce unsightly artifacts due to patch partitioning. Instead, our method resorts to spatial and temporal sparsity for addressing general UHR matting. When processing videos, huge computation redundancy can be reduced by exploiting spatial and temporal sparsity. In this paper, we show how to effectively detect spatio-temporal sparsity, which serves as a gate to activate input pixels for the matting model. Under the guidance of such sparsity, our method with sparse high-resolution module (SHM) can avoid patch-based inference while memory efficient for full-resolution matte refinement. Extensive experiments demonstrate that SparseMat can effectively and efficiently generate high-quality alpha matte for UHR images and videos at the original high resolution in a single pass. Project page is in https://github.com/nowsyn/SparseMat.git.

引用

页码：14112 / 14121

页数：10

共 50 条

[31] Video Segmentation with Spatio-Temporal Tubes
Trichet, Remi
Nevatia, Ramakant
2013 10TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED VIDEO AND SIGNAL BASED SURVEILLANCE (AVSS 2013), 2013, : 330 - 335
[32] Spatio-temporal querying in video databases
Köprülü, M
Çiçekli, NK
Yazici, A
FLEXIBLE QUERY ANSWERING SYSTEMS, PROCEEDINGS, 2002, 2522 : 251 - 262
[33] Spatio-Temporal Perturbations for Video Attribution
Li, Zhenqiang
Wang, Weimin
Li, Zuoyue
Huang, Yifei
Sato, Yoichi
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (04) : 2043 - 2056
[34] Lightweight video super-resolution based on hybrid spatio-temporal convolution
Xia, Zhenping
Chen, Hao
Zhang, Yuning
Cheng, Cheng
Hu, Fuyuan
Guangxue Jingmi Gongcheng/Optics and Precision Engineering, 2024, 32 (16): : 2564 - 2576
[35] SPATIO-TEMPORAL INTERACTION IN VISUAL RESOLUTION
RASHBASS, C
JOURNAL OF PHYSIOLOGY-LONDON, 1968, 196 (02): : P102 - &
[36] SPATIO-TEMPORAL VIDEO FILTERING FOR VIDEO SURVEILLANCE APPLICATIONS
Ben Hamida, Amal
Koubaa, Mohamed
Nicolas, Henri
Ben Amar, Chokri
ELECTRONIC PROCEEDINGS OF THE 2013 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO WORKSHOPS (ICMEW), 2013,
[37] Spatio-temporal conflict detection and resolution
Howarth R.J.
Tsang E.P.K.
Constraints, 1998, 3 (4) : 343 - 361
[38] Spatio-temporal conflict detection and resolution
Howarth, Richard J.
Tsang, Edward P. K.
Constraints, 1998, 3 (04) : 343 - 361
[39] Bidirectional spatio-temporal generative adversarial network for video super-resolution
Yang, Peng
Chen, Zhangquan
Sun, Yuankang
Hu, Zhongjian
Li, Bing
PATTERN ANALYSIS AND APPLICATIONS, 2025, 28 (01)
[40] Efficient Spatio-Temporal Network with Gated Fusion for Video Super-Resolution
Li, Changyu
Zhang, Dongyang
Xie, Ning
Shao, Jie
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2021, PT V, 2021, 12895 : 640 - 651

← 1 2 3 4 5 →