A Survey on Deep Learning Technique for Video Segmentation

被引:88
作者
Zhou, Tianfei [1 ]
Porikli, Fatih [2 ]
Crandall, David J. [3 ]
Van Gool, Luc [1 ]
Wang, Wenguan [4 ]
机构
[1] Swiss Fed Inst Technol, CH-8092 Zurich, Switzerland
[2] Australian Natl Univ, Sch Comp Sci, Canberra, ACT 2601, Australia
[3] Indiana Univ, Luddy Sch Informat Comp & Engn, Bloomington, IN 47405 USA
[4] Univ Technol Sydney, Australian Artificial Intelligence Inst, ReLER Lab, Ultimo, NSW 2007, Australia
基金
澳大利亚研究理事会;
关键词
Object segmentation; Automobiles; Semantic segmentation; Task analysis; Motion segmentation; Deep learning; Roads; Video segmentation; video object segmentation; video semantic segmentation; deep learning; OBJECT SEGMENTATION; TRACKING; IMAGE; AGGREGATION; NETWORKS;
D O I
10.1109/TPAMI.2022.3225573
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Video segmentation-partitioning video frames into multiple segments or objects-plays a critical role in a broad range of practical applications, from enhancing visual effects in movie, to understanding scenes in autonomous driving, to creating virtual background in video conferencing. Recently, with the renaissance of connectionism in computer vision, there has been an influx of deep learning based approaches for video segmentation that have delivered compelling performance. In this survey, we comprehensively review two basic lines of research - generic object segmentation (of unknown categories) in videos, and video semantic segmentation - by introducing their respective task settings, background concepts, perceived need, development history, and main challenges. We also offer a detailed overview of representative literature on both methods and datasets. We further benchmark the reviewed methods on several well-known datasets. Finally, we point out open issues in this field, and suggest opportunities for further research. We also provide a public website to continuously track developments in this fast advancing field: https://github.com/tfzhou/VS-Survey.
引用
收藏
页码:7099 / 7122
页数:24
相关论文
共 50 条
  • [41] Temporal video scene segmentation using deep-learning
    Trojahn, Tiago Henrique
    Goularte, Rudinei
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (12) : 17487 - 17513
  • [42] Video restoration based on deep learning: a comprehensive survey
    Claudio Rota
    Marco Buzzelli
    Simone Bianco
    Raimondo Schettini
    [J]. Artificial Intelligence Review, 2023, 56 : 5317 - 5364
  • [43] A Survey of Deep Learning Video Super-Resolution
    Baniya, Arbind Agrahari
    Lee, Tsz-Kwan
    Eklund, Peter W.
    Aryal, Sunil
    [J]. IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2024, 8 (04): : 2655 - 2676
  • [44] Video restoration based on deep learning: a comprehensive survey
    Rota, Claudio
    Buzzelli, Marco
    Bianco, Simone
    Schettini, Raimondo
    [J]. ARTIFICIAL INTELLIGENCE REVIEW, 2023, 56 (06) : 5317 - 5364
  • [45] Video description: A comprehensive survey of deep learning approaches
    Ghazala Rafiq
    Muhammad Rafiq
    Gyu Sang Choi
    [J]. Artificial Intelligence Review, 2023, 56 : 13293 - 13372
  • [46] A brief survey on RGB-D semantic segmentation using deep learning*
    Wang, Changshuo
    Wang, Chen
    Li, Weijun
    Wang, Haining
    [J]. DISPLAYS, 2021, 70
  • [47] A survey on recent trends in deep learning for nucleus segmentation from histopathology images
    Basu, Anusua
    Senapati, Pradip
    Deb, Mainak
    Rai, Rebika
    Dhal, Krishna Gopal
    [J]. EVOLVING SYSTEMS, 2024, 15 (01) : 203 - 248
  • [48] Deep-Learning-Based Semantic Segmentation of Remote Sensing Images: A Survey
    Huang, Liwei
    Jiang, Bitao
    Lv, Shouye
    Liu, Yanbo
    Fu, Ying
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 8370 - 8396
  • [49] A Survey of Wound Image Analysis Using Deep Learning: Classification, Detection, and Segmentation
    Zhang, Ruyi
    Tian, Dingcheng
    Xu, Dechao
    Qian, Wei
    Yao, Yudong
    [J]. IEEE ACCESS, 2022, 10 : 79502 - 79515
  • [50] Fast body part segmentation and tracking of neonatal video data using deep learning
    Antink, Christoph Hoog
    Ferreira, Joana Carlos Mesquita
    Paul, Michael
    Lyra, Simon
    Heimann, Konrad
    Karthik, Srinivasa
    Joseph, Jayaraj
    Jayaraman, Kumutha
    Orlikowsky, Thorsten
    Sivaprakasam, Mohanasankar
    Leonhardt, Steffen
    [J]. MEDICAL & BIOLOGICAL ENGINEERING & COMPUTING, 2020, 58 (12) : 3049 - 3061