Visual Attention Guided Video Object Segmentation

被引：0

作者：

Liang, Hao ^{[1
]}

Tan, Yihua ^{[1
]}

机构：

[1] Huazhong Univ Sci & Technol, Natl Key Lab Sci & Technol Multispectral Informat, Sch Automat, Wuhan, Peoples R China

来源：

PROCEEDINGS OF THE 2019 14TH IEEE CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS (ICIEA 2019) | 2019年

关键词：

video object segmentation; visual attention; visual guide; spatial guide;

D O I：

10.1109/iciea.2019.8834292

中图分类号：

T [工业技术];

学科分类号：

08 ;

摘要：

Recently, video object segmentation (VOS) is a new challenging research direction from DAVIS competition. Carrying on with these researches, we propose a visual attention guided framework in video object segmentation, which includes four main components: segmentation network, visual encoder, spatial encoder and guide. The segmentation network predicts the object mask in the current video frame, and the visual guide force segmentation network to focus on the annotated object by visual information from visual encoder, and the spatial guide provide spatial location by spatial encoder from previous frame. Visual attention mechanism plays an important role in the model on capturing annotated object without online fine-tuning as previous models. This approach has an advantage over previous methods on accuracy and efficiency, especially avoid the online fine-tuning in those one-shot learning approaches.

引用

页码：345 / 349

页数：5

共 27 条

[1]

[Anonymous], 2018, IEEE C COMP VIS PATT

[2] One-Shot Video Object Segmentation [J].

Caelles, S. ;

Maninis, K. -K. ;

Pont-Tuset, J. ;

Leal-Taixe, L. ;

Cremers, D. ;

Van Gool, L. .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :5320-5329

[3] MegDet: A Large Mini-Batch Object Detector [J].

Peng, Chao ;

Xiao, Tete ;

Li, Zeming ;

Jiang, Yuning ;

Zhang, Xiangyu ;

Jia, Kai ;

Yu, Gang ;

Sun, Jian .

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :6181-6189

[4]

Chen LC, 2014, ARXIV

[5] Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation [J].

Chen, Liang-Chieh ;

Zhu, Yukun ;

Papandreou, George ;

Schroff, Florian ;

Adam, Hartwig .

COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 :833-851

[6] DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs [J].

Chen, Liang-Chieh ;

Papandreou, George ;

Kokkinos, Iasonas ;

Murphy, Kevin ;

Yuille, Alan L. .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) :834-848

[7] The Cityscapes Dataset for Semantic Urban Scene Understanding [J].

Cordts, Marius ;

Omran, Mohamed ;

Ramos, Sebastian ;

Rehfeld, Timo ;

Enzweiler, Markus ;

Benenson, Rodrigo ;

Franke, Uwe ;

Roth, Stefan ;

Schiele, Bernt .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :3213-3223

[8]

De Vries El, 2017, ADV NEURAL INFORM PR, P6594

[9]

Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848

[10]

F Perazzi, 2016, P IEEE C COMP VIS PA, P724

← 1 2 3 →