FUSION TARGET ATTENTION MASK GENERATION NETWORK FOR VIDEO SEGMENTATION

被引:0
|
作者
Li, Yunyi [1 ]
Chen, Fangping [2 ]
Yang, Fan [2 ]
Li, Yuan [2 ]
Jia, Huizhu [2 ]
Xie, Xiaodong [2 ]
机构
[1] Peking Univ, Shenzhen Grad Sch, Beijing, Peoples R China
[2] Peking Univ, Natl Engn Lab Video Technol, Beijing, Peoples R China
来源
2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP) | 2020年
关键词
video object segmentation; attention; optical flow; mask; loss function;
D O I
10.1109/icip40778.2020.9190879
中图分类号
TB8 [摄影技术];
学科分类号
0804 ;
摘要
Video segmentation aims to segment target objects in a video sequence, which remains a challenge due to the motion and deformation of objects. In this paper, we propose a novel attention-driven hybrid encoder-decoder network that generates object segmentation by fully leveraging spatial and temporal information. Firstly, a multi-branch network is designed to learn feature representation from object appearance, location and motion. Secondly, a target attention module is proposed to further exploit context information from learned representation. In addition, a novel edge loss is designed which constraints the model to generate salient edge features and accurate segmentation. The proposed model has been evaluated over two widely used public benchmarks, and experiments demonstrate its superior robustness and effectiveness as compared with the state of the arts.
引用
收藏
页码:2276 / 2280
页数:5
相关论文
共 50 条
  • [1] Optical Flow-Guided Mask Generation Network For Video Segmentation
    Li, Yunyi
    Chen, Fangping
    Yang, Fan
    Ma, Cong
    Li, Yuan
    Jia, Huizhu
    Xie, Xiaodong
    2020 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2020,
  • [2] Asymmetric Attention Fusion for Unsupervised Video Object Segmentation
    Jiang, Hongfan
    Wu, Xiaojun
    Xu, Tianyang
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT VI, 2024, 14430 : 170 - 182
  • [3] Dual Attention Based Network with Hierarchical ConvLSTM for Video Object Segmentation
    Zhao, Zongji
    Zhao, Sanyuan
    PATTERN RECOGNITION AND COMPUTER VISION, PT IV, 2021, 13022 : 323 - 335
  • [4] Attention-Guided Network for Semantic Video Segmentation
    Li, Jiangyun
    Zhao, Yikai
    Fu, Jun
    Wu, Jiajia
    Liu, Jing
    IEEE ACCESS, 2019, 7 : 140680 - 140689
  • [5] Multi-Attention Network for Unsupervised Video Object Segmentation
    Zhang, Guifang
    Wong, Hon-Cheng
    Lo, Sio-Long
    IEEE SIGNAL PROCESSING LETTERS, 2021, 28 : 71 - 75
  • [6] MAIN: Multi-Attention Instance Network for video segmentation
    Alcazar, Juan Leon
    Bravo, Maria A.
    Jeanneret, Guillaume
    Thabet, Ali K.
    Brox, Thomas
    Arbelaez, Pablo
    Ghanem, Bernard
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2021, 210
  • [7] SSFNET-VOS: Semantic segmentation and fusion network for video object segmentation
    Sharma, Vipal Kumar
    Mir, Roohie Naaz
    PATTERN RECOGNITION LETTERS, 2020, 140 : 49 - 58
  • [8] Weakly Supervised Video Object Segmentation via Dual-attention Cross-branch Fusion
    Wei, Lili
    Lang, Congyan
    Liang, Liqian
    Feng, Songhe
    Wang, Tao
    Chen, Shidi
    ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2022, 13 (03)
  • [9] COMatchNet: Co-Attention Matching Network for Video Object Segmentation
    Huang, Lufei
    Sun, Fengming
    Yuan, Xia
    PATTERN RECOGNITION, ACPR 2021, PT I, 2022, 13188 : 271 - 284
  • [10] MAF-DeepLab: A Multiscale Attention Fusion Network for Semantic Segmentation
    Chen, Ning
    Chen, Yupeng
    Wang, Qinfeng
    Wu, Shaopeng
    Zhang, Hongyi
    TRAITEMENT DU SIGNAL, 2022, 39 (02) : 407 - 417