FUSION TARGET ATTENTION MASK GENERATION NETWORK FOR VIDEO SEGMENTATION

被引：0

作者：

Li, Yunyi ^{[1
]}

Chen, Fangping ^{[2
]}

Yang, Fan ^{[2
]}

Li, Yuan ^{[2
]}

Jia, Huizhu ^{[2
]}

Xie, Xiaodong ^{[2
]}

机构：

[1] Peking Univ, Shenzhen Grad Sch, Beijing, Peoples R China

[2] Peking Univ, Natl Engn Lab Video Technol, Beijing, Peoples R China

来源：

2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP) | 2020年

关键词：

video object segmentation; attention; optical flow; mask; loss function;

D O I：

10.1109/icip40778.2020.9190879

中图分类号：

TB8 [摄影技术];

学科分类号：

0804 ;

摘要：

Video segmentation aims to segment target objects in a video sequence, which remains a challenge due to the motion and deformation of objects. In this paper, we propose a novel attention-driven hybrid encoder-decoder network that generates object segmentation by fully leveraging spatial and temporal information. Firstly, a multi-branch network is designed to learn feature representation from object appearance, location and motion. Secondly, a target attention module is proposed to further exploit context information from learned representation. In addition, a novel edge loss is designed which constraints the model to generate salient edge features and accurate segmentation. The proposed model has been evaluated over two widely used public benchmarks, and experiments demonstrate its superior robustness and effectiveness as compared with the state of the arts.

引用

页码：2276 / 2280

页数：5

共 50 条

[1] Optical Flow-Guided Mask Generation Network For Video Segmentation
Li, Yunyi
Chen, Fangping
Yang, Fan
Ma, Cong
Li, Yuan
Jia, Huizhu
Xie, Xiaodong
2020 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2020,
[2] Asymmetric Attention Fusion for Unsupervised Video Object Segmentation
Jiang, Hongfan
Wu, Xiaojun
Xu, Tianyang
PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT VI, 2024, 14430 : 170 - 182
[3] Dual Attention Based Network with Hierarchical ConvLSTM for Video Object Segmentation
Zhao, Zongji
Zhao, Sanyuan
PATTERN RECOGNITION AND COMPUTER VISION, PT IV, 2021, 13022 : 323 - 335
[4] Attention-Guided Network for Semantic Video Segmentation
Li, Jiangyun
Zhao, Yikai
Fu, Jun
Wu, Jiajia
Liu, Jing
IEEE ACCESS, 2019, 7 : 140680 - 140689
[5] Multi-Attention Network for Unsupervised Video Object Segmentation
Zhang, Guifang
Wong, Hon-Cheng
Lo, Sio-Long
IEEE SIGNAL PROCESSING LETTERS, 2021, 28 : 71 - 75
[6] MAIN: Multi-Attention Instance Network for video segmentation
Alcazar, Juan Leon
Bravo, Maria A.
Jeanneret, Guillaume
Thabet, Ali K.
Brox, Thomas
Arbelaez, Pablo
Ghanem, Bernard
COMPUTER VISION AND IMAGE UNDERSTANDING, 2021, 210
[7] SSFNET-VOS: Semantic segmentation and fusion network for video object segmentation
Sharma, Vipal Kumar
Mir, Roohie Naaz
PATTERN RECOGNITION LETTERS, 2020, 140 : 49 - 58
[8] Weakly Supervised Video Object Segmentation via Dual-attention Cross-branch Fusion
Wei, Lili
Lang, Congyan
Liang, Liqian
Feng, Songhe
Wang, Tao
Chen, Shidi
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2022, 13 (03)
[9] COMatchNet: Co-Attention Matching Network for Video Object Segmentation
Huang, Lufei
Sun, Fengming
Yuan, Xia
PATTERN RECOGNITION, ACPR 2021, PT I, 2022, 13188 : 271 - 284
[10] MAF-DeepLab: A Multiscale Attention Fusion Network for Semantic Segmentation
Chen, Ning
Chen, Yupeng
Wang, Qinfeng
Wu, Shaopeng
Zhang, Hongyi
TRAITEMENT DU SIGNAL, 2022, 39 (02) : 407 - 417

← 1 2 3 4 5 →