Bidirectional Relationship Inferring Network for Referring Image Localization and Segmentation

被引:11
|
作者
Feng, Guang [1 ]
Hu, Zhiwei [1 ]
Zhang, Lihe [1 ]
Sun, Jiayu [1 ]
Lu, Huchuan [1 ]
机构
[1] Dalian Univ Technol, Sch Informat & Commun Engn, Dalian 116024, Peoples R China
基金
中国国家自然科学基金;
关键词
Image segmentation; Location awareness; Visualization; Task analysis; Linguistics; Semantics; Feature extraction; Language-guided visual attention; referring image localization and segmentation; segmentation-guided feature augmentation; vision-guided linguistic attention (VLAM);
D O I
10.1109/TNNLS.2021.3106153
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, referring image localization and segmentation has aroused widespread interest. However, the existing methods lack a clear description of the interdependence between language and vision. To this end, we present a bidirectional relationship inferring network (BRINet) to effectively address the challenging tasks. Specifically, we first employ a vision-guided linguistic attention module to perceive the keywords corresponding to each image region. Then, language-guided visual attention adopts the learned adaptive language to guide the update of the visual features. Together, they form a bidirectional cross-modal attention module (BCAM) to achieve the mutual guidance between language and vision. They can help the network align the cross-modal features better. Based on the vanilla language-guided visual attention, we further design an asymmetric language-guided visual attention, which significantly reduces the computational cost by modeling the relationship between each pixel and each pooled subregion. In addition, a segmentation-guided bottom-up augmentation module (SBAM) is utilized to selectively combine multilevel information flow for object localization. Experiments show that our method outperforms other state-of-the-art methods on three referring image localization datasets and four referring image segmentation datasets.
引用
收藏
页码:2246 / 2258
页数:13
相关论文
共 50 条
  • [21] Image Segmentation With Language Referring Expression and Comprehension
    Sun, Jiaxing
    Li, Yujie
    Cai, Jintong
    Lu, Huimin
    Serikawa, Seiichi
    IEEE SENSORS JOURNAL, 2022, 22 (18) : 17406 - 17413
  • [22] Video-Instrument Synergistic Network for Referring Video Instrument Segmentation in Robotic Surgery
    Wang, Hongqiu
    Yang, Guang
    Zhang, Shichen
    Qin, Jing
    Guo, Yike
    Xu, Bo
    Jin, Yueming
    Zhu, Lei
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2024, 43 (12) : 4457 - 4469
  • [23] Tri-Path Backbone Network for Image Manipulation Localization
    Song, H.
    Lin, Baichuan
    Ye, D.
    IEEE ACCESS, 2024, 12 : 83217 - 83227
  • [24] Multiscale Attention Network for Detection and Localization of Image Splicing Forgery
    Xu, Yanzhi
    Irfan, Muhammad
    Fang, Aiqing
    Zheng, Jiangbin
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2023, 72
  • [25] BiFNet: Bidirectional Fusion Network for Road Segmentation
    Li, Haoran
    Chen, Yaran
    Zhang, Qichao
    Zhao, Dongbin
    IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (09) : 8617 - 8628
  • [26] Multitask Semantic Boundary Awareness Network for Remote Sensing Image Segmentation
    Li, Aijin
    Jiao, Licheng
    Zhu, Hao
    Li, Lingling
    Liu, Fang
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [27] Boundary-Aware Gradient Operator Network for Medical Image Segmentation
    Yu, Li
    Min, Wenwen
    Wang, Shunfang
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2024, 28 (08) : 4711 - 4723
  • [28] Guided Filter Network for Semantic Image Segmentation
    Zhang, Xiang
    Zhao, Wanqing
    Zhang, Wei
    Peng, Jinye
    Fan, Jianping
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 2695 - 2709
  • [29] An Efficient and Rapid Medical Image Segmentation Network
    Su, Diwei
    Luo, Jianxu
    Fei, Cheng
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2024, 28 (05) : 2979 - 2990
  • [30] A Supervised Segmentation Network for Hyperspectral Image Classification
    Sun, Hao
    Zheng, Xiangtao
    Lu, Xiaoqiang
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 2810 - 2825