FUSION TARGET ATTENTION MASK GENERATION NETWORK FOR VIDEO SEGMENTATION

被引:0
作者
Li, Yunyi [1 ]
Chen, Fangping [2 ]
Yang, Fan [2 ]
Li, Yuan [2 ]
Jia, Huizhu [2 ]
Xie, Xiaodong [2 ]
机构
[1] Peking Univ, Shenzhen Grad Sch, Beijing, Peoples R China
[2] Peking Univ, Natl Engn Lab Video Technol, Beijing, Peoples R China
来源
2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP) | 2020年
关键词
video object segmentation; attention; optical flow; mask; loss function;
D O I
10.1109/icip40778.2020.9190879
中图分类号
TB8 [摄影技术];
学科分类号
0804 ;
摘要
Video segmentation aims to segment target objects in a video sequence, which remains a challenge due to the motion and deformation of objects. In this paper, we propose a novel attention-driven hybrid encoder-decoder network that generates object segmentation by fully leveraging spatial and temporal information. Firstly, a multi-branch network is designed to learn feature representation from object appearance, location and motion. Secondly, a target attention module is proposed to further exploit context information from learned representation. In addition, a novel edge loss is designed which constraints the model to generate salient edge features and accurate segmentation. The proposed model has been evaluated over two widely used public benchmarks, and experiments demonstrate its superior robustness and effectiveness as compared with the state of the arts.
引用
收藏
页码:2276 / 2280
页数:5
相关论文
共 50 条
[21]   An Instance Segmentation Method for Insulator Defects Based on an Attention Mechanism and Feature Fusion Network [J].
Wu, Junpeng ;
Deng, Qitong ;
Xian, Ran ;
Tao, Xinguang ;
Zhou, Zhi .
APPLIED SCIENCES-BASEL, 2024, 14 (09)
[22]   Triaxial modality attention fusion with top-down mask generation for enhanced multimodal sentiment analysis [J].
Feng, Cheng ;
Yang, Hai ;
Wang, Shuxian ;
Li, Xue .
JOURNAL OF SUPERCOMPUTING, 2025, 81 (08)
[23]   Visual Attention Guided Video Object Segmentation [J].
Liang, Hao ;
Tan, Yihua .
PROCEEDINGS OF THE 2019 14TH IEEE CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS (ICIEA 2019), 2019, :345-349
[24]   An Attention based Method for Video Semantic Segmentation [J].
Huang, Yuan ;
Huang, Qian ;
Huang, Shuai ;
Li, Yanping .
TWELFTH INTERNATIONAL CONFERENCE ON DIGITAL IMAGE PROCESSING (ICDIP 2020), 2020, 11519
[25]   A Ship Target Location and Mask Generation Algorithms Base on Mask RCNN [J].
Lin Shaodan ;
Feng Chen ;
Chen Zhide .
INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2019, 12 (02) :1134-1143
[26]   A Ship Target Location and Mask Generation Algorithms Base on Mask RCNN [J].
Lin Shaodan ;
Feng Chen ;
Chen Zhide .
International Journal of Computational Intelligence Systems, 2019, 12 :1134-1143
[27]   Hierarchical Co-Attention Propagation Network for Zero-Shot Video Object Segmentation [J].
Pei, Gensheng ;
Yao, Yazhou ;
Shen, Fumin ;
Huang, Dan ;
Huang, Xingguo ;
Shen, Heng-Tao .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 :2348-2359
[28]   Video Object Segmentation Using Multi-Scale Attention-Based Siamese Network [J].
Zhu, Zhiliang ;
Qiu, Leiningxin ;
Wang, Jiaxin ;
Xiong, Jinquan ;
Peng, Hua .
ELECTRONICS, 2023, 12 (13)
[29]   Gated attention unit and mask attention network for traffic flow forecasting [J].
Leng, Sen .
Neural Computing and Applications, 2025, 37 (20) :14889-14905
[30]   Attention-Based Abnormal-Aware Fusion Network for Radiology Report Generation [J].
Xie, Xiancheng ;
Xiong, Yun ;
Yu, Philip S. ;
Li, Kangan ;
Zhang, Suhua ;
Zhu, Yangyong .
DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, 2019, 11448 :448-452