Ultrahigh Resolution Image/Video Matting with Spatio-Temporal Sparsity

被引:0
作者
Sun, Yanan [1 ]
Tang, Chi-Keung [1 ]
Tai, Yu-Wing [1 ]
机构
[1] HKUST, Hong Kong, Peoples R China
来源
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2023年
关键词
D O I
10.1109/CVPR52729.2023.01356
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Commodity ultrahigh definition (UHD) displays are becoming more affordable which demand imaging in ultrahigh resolution (UHR). This paper proposes SparseMat, a computationally efficient approach for UHR image/video matting. Note that it is infeasible to directly process UHR images at full resolution in one shot using existing matting algorithms without running out of memory on consumer-level computational platforms, e.g., Nvidia 1080Ti with 11G memory, while patch-based approaches can introduce unsightly artifacts due to patch partitioning. Instead, our method resorts to spatial and temporal sparsity for addressing general UHR matting. When processing videos, huge computation redundancy can be reduced by exploiting spatial and temporal sparsity. In this paper, we show how to effectively detect spatio-temporal sparsity, which serves as a gate to activate input pixels for the matting model. Under the guidance of such sparsity, our method with sparse high-resolution module (SHM) can avoid patch-based inference while memory efficient for full-resolution matte refinement. Extensive experiments demonstrate that SparseMat can effectively and efficiently generate high-quality alpha matte for UHR images and videos at the original high resolution in a single pass. Project page is in https://github.com/nowsyn/SparseMat.git.
引用
收藏
页码:14112 / 14121
页数:10
相关论文
共 50 条
  • [31] Video Segmentation with Spatio-Temporal Tubes
    Trichet, Remi
    Nevatia, Ramakant
    2013 10TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED VIDEO AND SIGNAL BASED SURVEILLANCE (AVSS 2013), 2013, : 330 - 335
  • [32] Spatio-temporal querying in video databases
    Köprülü, M
    Çiçekli, NK
    Yazici, A
    FLEXIBLE QUERY ANSWERING SYSTEMS, PROCEEDINGS, 2002, 2522 : 251 - 262
  • [33] Spatio-Temporal Perturbations for Video Attribution
    Li, Zhenqiang
    Wang, Weimin
    Li, Zuoyue
    Huang, Yifei
    Sato, Yoichi
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (04) : 2043 - 2056
  • [34] Lightweight video super-resolution based on hybrid spatio-temporal convolution
    Xia, Zhenping
    Chen, Hao
    Zhang, Yuning
    Cheng, Cheng
    Hu, Fuyuan
    Guangxue Jingmi Gongcheng/Optics and Precision Engineering, 2024, 32 (16): : 2564 - 2576
  • [35] SPATIO-TEMPORAL INTERACTION IN VISUAL RESOLUTION
    RASHBASS, C
    JOURNAL OF PHYSIOLOGY-LONDON, 1968, 196 (02): : P102 - &
  • [36] SPATIO-TEMPORAL VIDEO FILTERING FOR VIDEO SURVEILLANCE APPLICATIONS
    Ben Hamida, Amal
    Koubaa, Mohamed
    Nicolas, Henri
    Ben Amar, Chokri
    ELECTRONIC PROCEEDINGS OF THE 2013 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO WORKSHOPS (ICMEW), 2013,
  • [37] Spatio-temporal conflict detection and resolution
    Howarth R.J.
    Tsang E.P.K.
    Constraints, 1998, 3 (4) : 343 - 361
  • [38] Spatio-temporal conflict detection and resolution
    Howarth, Richard J.
    Tsang, Edward P. K.
    Constraints, 1998, 3 (04) : 343 - 361
  • [39] Bidirectional spatio-temporal generative adversarial network for video super-resolution
    Yang, Peng
    Chen, Zhangquan
    Sun, Yuankang
    Hu, Zhongjian
    Li, Bing
    PATTERN ANALYSIS AND APPLICATIONS, 2025, 28 (01)
  • [40] Efficient Spatio-Temporal Network with Gated Fusion for Video Super-Resolution
    Li, Changyu
    Zhang, Dongyang
    Xie, Ning
    Shao, Jie
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2021, PT V, 2021, 12895 : 640 - 651