Exploiting recollection effects for memory-based video object segmentation

被引:0
|
作者
Cho E. [1 ]
Kim M. [1 ]
Kim H.-I. [2 ]
Moon J. [2 ]
Kim S.T. [1 ]
机构
[1] Department of Computer Science and Engineering, Kyung Hee University, Gyeonggi-do, Yongin-si
[2] Electronics and Telecommunications Research Institute (ETRI), Daejeon
基金
新加坡国家研究基金会;
关键词
Deep learning; Memory networks; Video object segmentation;
D O I
10.1016/j.imavis.2023.104866
中图分类号
学科分类号
摘要
Recent advances in deep learning have led to numerous studies on video object segmentation (VOS). Memory-based models, in particular, have demonstrated superior performance by leveraging the ability to store and recall information from previous frames. While extensive research efforts have been devoted to developing memory networks for effective VOS, only a few studies have investigated the quality of memory in terms of determining which information should be stored. In fact, in most recent memory-based VOS studies, the frame information is regularly stored in the memory without specific consideration. In other words, there is a lack of explicit criteria or guidelines for determining the essential information that should be retained in memory. In this study, we introduce a new method for evaluating the effect of storing the features, which can be used for various memory-based networks to improve performance in a plug-and-play manner. For this purpose, we introduce the concept of recollection effects, which refers to the stability of predictions based on the presence or absence of specific features in memory. By explicitly measuring the recollection effects, we establish a criterion for evaluating the relevance of information and determining whether features from a particular frame should be stored. This approach effectively encourages memory-based networks to construct memory that contains valuable cues. To validate the effectiveness of our method, we conduct comparative experiments. Experimental results demonstrate the effectiveness of our method to enhance the selection and retention of useful cues within the memory, leading to improving segmentation results. © 2023 Elsevier B.V.
引用
收藏
相关论文
共 50 条
  • [31] ASDeM: Augmenting SAM With Decoupled Memory for Video Object Segmentation
    Liu, Xiaohu
    Luo, Yichuang
    Sun, Wei
    IEEE ACCESS, 2024, 12 : 73218 - 73227
  • [32] Dual Temporal Memory Network for Efficient Video Object Segmentation
    Zhang, Kaihua
    Wang, Long
    Liu, Dong
    Liu, Bo
    Liu, Qingshan
    Li, Zhu
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 1515 - 1523
  • [33] Unsupervised Video Object Segmentation via Prototype Memory Network
    Lee, Minhyeok
    Cho, Suhwan
    Lee, Seunghoon
    Park, Chaewon
    Lee, Sangyoun
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 5913 - 5923
  • [34] Temporally Consistent Referring Video Object Segmentation With Hybrid Memory
    Miao, Bo
    Bennamoun, Mohammed
    Gao, Yongsheng
    Shah, Mubarak
    Mian, Ajmal
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (11) : 11373 - 11385
  • [35] Memory-based cognitive modeling for robust object extraction and tracking
    Wang, Yanjiang
    Qi, Yujuan
    APPLIED INTELLIGENCE, 2013, 39 (03) : 614 - 629
  • [36] Memory-based state prediction in statistical visual object tracking
    Nakajima, T
    PROCEEDINGS OF THE IASTED INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE, 2005, : 444 - 449
  • [37] Memory-based cognitive modeling for robust object extraction and tracking
    Yanjiang Wang
    Yujuan Qi
    Applied Intelligence, 2013, 39 : 614 - 629
  • [38] Video object segmentation based on object enhancement and region merging
    Ryan, Ken
    Amer, Aishy
    Gagnon, Langis
    2006 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO - ICME 2006, VOLS 1-5, PROCEEDINGS, 2006, : 273 - +
  • [39] VideoMatch: Matching Based Video Object Segmentation
    Hu, Yuan-Ting
    Huang, Jia-Bin
    Schwing, Alexander G.
    COMPUTER VISION - ECCV 2018, PT VIII, 2018, 11212 : 56 - 73
  • [40] Video Object Segmentation Based on Superpixel Trajectories
    Abdelwahab, Mohamed A.
    Abdelwahab, Moataz M.
    Uchiyama, Hideaki
    Shimada, Atsushi
    Taniguchi, Rin-ichiro
    IMAGE ANALYSIS AND RECOGNITION (ICIAR 2016), 2016, 9730 : 191 - 197