Learning effective feature representation for video object segmentation via memory

被引:3
作者
Li, Jun [1 ]
Sun, Lijuan [2 ]
Ren, Hengyi [3 ]
Cao, Ying [1 ]
Li, Suya [1 ]
Xie, Xin [1 ]
机构
[1] Henan Univ, Jinming Campus,Jinming Ave, Kaifeng 475001, Peoples R China
[2] Nanjing Univ Posts & Telecommun, Xianlin Campus,9 Wenyuan Rd, Nanjing 210023, Peoples R China
[3] Nanjing Forestry Univ, 159 Longpan Rd, Nanjing 475001, Peoples R China
关键词
Video object segmentation; Effective feature representation; Per-object memory enhancement; Quality-aware weight assessment;
D O I
10.1016/j.knosys.2024.112020
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
To solve the problem of target discrimination being ignored in developing the feature of the current frame, this paper proposes the effective feature representation via memory (EFRM) method to form the effective and discriminative feature representation of the current frame from global and local perspectives by fully benefiting from the rich information contained in the memorized frames. First, the global discriminative feature, representing the differences between the foreground and background, is generated through the space- time memory read (STMR) with the nonlocal matching scheme. Second, the local discriminative feature, which has feature differences among targets that appear in the current frame, is generated by the designed per -object memory enhancement (PoME) by relying only on the diverse representations of the targets shown in the memorized frames. Finally, the segmentation of the current frame is generated based on the effective feature representation formed by the concatenation of global and local discriminative features. Evaluations on DAVIS 16, 17 and YouTube-VOS 18, 19 demonstrate the competitive performance of the proposed method.
引用
收藏
页数:11
相关论文
共 49 条
[1]   State-Aware Tracker for Real-Time Video Object Segmentation [J].
Chen, Xi ;
Li, Zuoxin ;
Yuan, Ye ;
Yu, Gang ;
Shen, Jianxin ;
Qi, Donglian .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :9381-9390
[2]  
Cheng HK, 2021, ADV NEUR IN, V34
[3]   Modular Interactive Video Object Segmentation: Interaction-to-Mask, Propagation and Difference-Aware Fusion [J].
Cheng, Ho Kei ;
Tai, Yu-Wing ;
Tang, Chi-Keung .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :5555-5564
[4]   Fast and Accurate Online Video Object Segmentation via Tracking Parts [J].
Cheng, Jingchun ;
Tsai, Yi-Hsuan ;
Hung, Wei-Chih ;
Wang, Shengjin ;
Yang, Ming-Hsuan .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :7415-7424
[5]   Pixel-Level Bijective Matching for Video Object Segmentation [J].
Cho, Suhwan ;
Lee, Heansung ;
Kim, Minjung ;
Jang, Sungjun ;
Lee, Sangyoun .
2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, :1453-1462
[6]   CapsuleVOS: Semi-Supervised Video Object Segmentation Using Capsule Routing [J].
Duarte, Kevin ;
Rawat, Yogesh S. ;
Shah, Mubarak .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :8479-8488
[7]   SSTVOS: Sparse Spatiotemporal Transformers for Video Object Segmentation [J].
Duke, Brendan ;
Ahmed, Abdalla ;
Wolf, Christian ;
Aarabi, Parham ;
Taylor, Graham W. .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :5908-5917
[8]   Deep learning for video object segmentation: a review [J].
Gao, Mingqi ;
Zheng, Feng ;
Yu, James J. Q. ;
Shan, Caifeng ;
Ding, Guiguang ;
Han, Jungong .
ARTIFICIAL INTELLIGENCE REVIEW, 2023, 56 (01) :457-531
[9]   Video Object Segmentation Using Global and Instance Embedding Learning [J].
Ge, Wenbin ;
Lu, Xiankai ;
Shen, Jianbing .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :16831-16840
[10]   Learning Position and Target Consistency for Memory-based Video Object Segmentation [J].
Hu, Li ;
Zhang, Peng ;
Zhang, Bang ;
Pan, Pan ;
Xu, Yinghui ;
Jin, Rong .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :4142-4152