Exploiting recollection effects for memory-based video object segmentation

被引:0
|
作者
Cho E. [1 ]
Kim M. [1 ]
Kim H.-I. [2 ]
Moon J. [2 ]
Kim S.T. [1 ]
机构
[1] Department of Computer Science and Engineering, Kyung Hee University, Gyeonggi-do, Yongin-si
[2] Electronics and Telecommunications Research Institute (ETRI), Daejeon
基金
新加坡国家研究基金会;
关键词
Deep learning; Memory networks; Video object segmentation;
D O I
10.1016/j.imavis.2023.104866
中图分类号
学科分类号
摘要
Recent advances in deep learning have led to numerous studies on video object segmentation (VOS). Memory-based models, in particular, have demonstrated superior performance by leveraging the ability to store and recall information from previous frames. While extensive research efforts have been devoted to developing memory networks for effective VOS, only a few studies have investigated the quality of memory in terms of determining which information should be stored. In fact, in most recent memory-based VOS studies, the frame information is regularly stored in the memory without specific consideration. In other words, there is a lack of explicit criteria or guidelines for determining the essential information that should be retained in memory. In this study, we introduce a new method for evaluating the effect of storing the features, which can be used for various memory-based networks to improve performance in a plug-and-play manner. For this purpose, we introduce the concept of recollection effects, which refers to the stability of predictions based on the presence or absence of specific features in memory. By explicitly measuring the recollection effects, we establish a criterion for evaluating the relevance of information and determining whether features from a particular frame should be stored. This approach effectively encourages memory-based networks to construct memory that contains valuable cues. To validate the effectiveness of our method, we conduct comparative experiments. Experimental results demonstrate the effectiveness of our method to enhance the selection and retention of useful cues within the memory, leading to improving segmentation results. © 2023 Elsevier B.V.
引用
收藏
相关论文
共 50 条
  • [1] Learning Position and Target Consistency for Memory-based Video Object Segmentation
    Hu, Li
    Zhang, Peng
    Zhang, Bang
    Pan, Pan
    Xu, Yinghui
    Jin, Rong
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 4142 - 4152
  • [2] Memory-based moving object extraction for video indexing
    Wang, RRY
    Hong, PY
    Huang, T
    15TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 1, PROCEEDINGS: COMPUTER VISION AND IMAGE ANALYSIS, 2000, : 811 - 814
  • [3] Memory-based spatio-temporal real-time object segmentation for video surveillance
    Amer, A
    REAL-TIME IMAGING VII, 2003, 5012 : 10 - 21
  • [4] MIDFA: Memory-Based Instance Division and Feature Aggregation Network for Video Object Detection
    Chen, Qiaochuan
    Zhou, Min
    Yu, Hang
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2023, PT III, 2023, 13937 : 153 - 164
  • [5] Learning Video Object Segmentation with Visual Memory
    Tokmakov, Pavel
    Inria, Karteek Alahari
    Schmid, Cordelia
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 4491 - 4500
  • [6] Adaptive Memory Management for Video Object Segmentation
    Pourganjalikhan, Ali
    Poullis, Charalambos
    2022 19TH CONFERENCE ON ROBOTS AND VISION (CRV 2022), 2022, : 75 - 82
  • [7] Modulated Memory Network for Video Object Segmentation
    Lu, Hannan
    Guo, Zixian
    Zuo, Wangmeng
    MATHEMATICS, 2024, 12 (06)
  • [8] A video object segmentation algorithm based on temporal edge memory compensation
    Zhu, Shi-Ping
    Ma, Li
    Hou, Yang-Shuan
    Guangdianzi Jiguang/Journal of Optoelectronics Laser, 2010, 21 (08): : 1241 - 1246
  • [9] Video Object Segmentation using Point-based Memory Network
    Gao, Mingqi
    Han, Jungong
    Zheng, Feng
    Yu, James J. Q.
    Montana, Giovanni
    PATTERN RECOGNITION, 2023, 134
  • [10] MEMORY-BASED OBJECT DETECTION IN SURVEILLANCE SCENES
    Li, Xudong
    Ye, Mao
    Liu, Dan
    Zhang, Feng
    Tang, Song
    2016 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO (ICME), 2016,