Coherence-aware context aggregator for fast video object segmentation

被引：23

作者：

Lan, Meng ^{[1
]}

Zhang, Jing ^{[2
]}

Wang, Zengmao ^{[1
]}

机构：

[1] Wuhan Univ, Sch Comp Sci, Wuhan, Peoples R China

[2] Univ Sydney, Sch Comp Sci, Camperdown, Australia

来源：

PATTERN RECOGNITION | 2023年 / 136卷

基金：

中国国家自然科学基金;

关键词：

Video object segmentation; Semi-supervised learning; Spatio-temporal representation; Context;

D O I：

10.1016/j.patcog.2022.109214

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Semi-supervised video object segmentation (VOS) is a highly challenging problem that has attracted much research attention in recent years. Temporal context plays an important role in VOS by providing object clues from the past frames. However, most of the prevailing methods directly use the predicted temporal results to guide the segmentation of the current frame, while ignoring the coherence of tem-poral context, which may be misleading and degrade the performance. In this paper, we propose a novel model named Coherence-aware Context Aggregator (CCA) for VOS, which consists of three modules. First, a coherence-aware module (CAM) is proposed to evaluate the coherence of the predicted result of the current frame and then fuses the coherent features to update the temporal context. CAM can determine whether the prediction is accurate, thus guiding the update of the temporal context and avoiding the introduction of erroneous information. Second, we devise a spatio-temporal context aggregation (STCA) module to aggregate the temporal context with the spatial feature of the current frame to learn a robust and discriminative target representation in the decoder part. Third, we design a refinement module to refine the coarse feature generated from the STCA module for more precise segmentation. Additionally, CCA uses a cropping strategy and takes small-size images as input, thus making it computationally ef-ficient and achieving a real-time running speed. Extensive experiments on four challenging benchmarks show that CCA achieves a better trade-off between efficiency and accuracy compared to state-of-the-art methods. The code will be public. (c) 2022 Elsevier Ltd. All rights reserved.

引用

页数：12

共 50 条

[21] A Fast Video Object Segmentation Method Based on Inductive Learning and Transductive Reasoning
Xu K.
Li G.-R.
Hong D.-X.
Zhang W.-G.
Qi Y.-K.
Huang Q.-M.
Jisuanji Xuebao/Chinese Journal of Computers, 2022, 45 (10): : 2117 - 2132
[22] Adaptive Guidance and Attention-Refined Network for Fast Video Object Segmentation
Yaqian Li
Moran Li
Cunjun Xiao
Haibin Li
Neural Processing Letters, 2023, 55 : 7211 - 7225
[23] On guiding video object segmentation
Ortego, Diego
McGuinness, Kevin
SanMiguel, Juan C.
Arazo, Eric
Martinez, Jose M.
O'Connor, Noel E.
2019 INTERNATIONAL CONFERENCE ON CONTENT-BASED MULTIMEDIA INDEXING (CBMI), 2019,
[24] Video object segmentation based on temporal frame context information fusion and feature enhancement
Hou, Zhiqiang
Li, Fucheng
Wang, Shuiyuan
Dai, Nan
Ma, Sugang
Fan, Jiulun
APPLIED INTELLIGENCE, 2023, 53 (06) : 6496 - 6510
[25] Video object segmentation based on temporal frame context information fusion and feature enhancement
Zhiqiang Hou
Fucheng Li
Shuiyuan Wang
Nan Dai
Sugang Ma
Jiulun Fan
Applied Intelligence, 2023, 53 : 6496 - 6510
[26] 4G-VOS: Video Object Segmentation using guided context embedding
Fiaz, Mustansar
Zaheer, Muhammad Zaigham
Mahmood, Arif
Lee, Seung-Ik
Jung, Soon Ki
KNOWLEDGE-BASED SYSTEMS, 2021, 231
[27] Video object segmentation based on motion-aware ROI prediction and adaptive reference updating
Fu, Lihua
Zhao, Yu
Sun, Xiaowei
Huang, Jialiang
Wang, Dan
Ding, Yu
EXPERT SYSTEMS WITH APPLICATIONS, 2021, 167
[28] Symmetry Encoder-Decoder Network with Attention Mechanism for Fast Video Object Segmentation
Guo, Mingyue
Zhang, Dejun
Sun, Jun
Wu, Yiqi
SYMMETRY-BASEL, 2019, 11 (08):
[29] Video Object Segmentation Using Graphs
Marmol, Salvador B. Lopez
Artner, Nicole M.
Ion, Adrian
Kropatsch, Walter G.
Beleznai, Csaba
PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS AND APPLICATIONS, PROCEEDINGS, 2008, 5197 : 733 - +
[30] Weakly Supervised Video Object Segmentation
Wang, Yufei
Hu, Yongjiang
Liew, Alan Wee-Chung
Wang, Junhu
PROCEEDINGS OF TENCON 2018 - 2018 IEEE REGION 10 CONFERENCE, 2018, : 0315 - 0320

← 1 2 3 4 5 →