Fine-grained multimodal named entity recognition with heterogeneous image-text similarity graphs

被引:1
|
作者
Wang, Yongpeng [1 ]
Jiang, Chunmao [1 ]
机构
[1] Fujian Univ Technol, Sch Comp Sci & Math, Fuzhou, Peoples R China
关键词
Image-text similarity graph; Semantic correlations; Graph convolutional networks; Gated aggregation modules; NEURAL MACHINE TRANSLATION;
D O I
10.1007/s13042-024-02398-8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multimodal Named Entity Recognition (MNER) leverages semantic information from multiple modalities to enhance the identification and classification of named entities in text. Effective MNER requires a thorough understanding of the intricate semantic correlations across different modalities. However, existing MNER methods often overlook fine-grained correlation information between textual and visual modalities, resulting in a loss of crucial semantic details for accurate entity recognition. We propose a novel Similarity Multimodal Reinforcement Graph (SMRG) framework for MNER to address this issue. SMRG quantifies the relevance between words and grid-level image regions to establish a nuanced heterogeneous image-text similarity graph. By leveraging the feature propagation capabilities of graph convolutional networks, SMRG captures rich semantic relationships across modalities. Moreover, SMRG employs gated aggregation modules to selectively integrate visual semantics with corresponding textual representations, thereby enhancing the expressiveness of text features for MNER. Extensive experiments on two benchmark Twitter datasets demonstrate the superiority of SMRG over state-of-the-art methods in self-domain and cross-domain scenarios.
引用
收藏
页码:2401 / 2415
页数:15
相关论文
共 50 条
  • [21] Fine-grained Multimodal Entity Linking for Videos
    Zhao H.-Q.
    Wang X.-W.
    Li J.-L.
    Li Z.-X.
    Xiao Y.-H.
    Ruan Jian Xue Bao/Journal of Software, 2024, 35 (03): : 1140 - 1153
  • [22] Fine-Grained Named Entity Recognition with Distant Supervision in COVID-19 Literature
    Wang, Xuan
    Song, Xiangchen
    Li, Bangzheng
    Zhou, Kang
    Li, Qi
    Han, Jiawei
    2020 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2020, : 491 - 494
  • [23] Fine-grained Named Entity Recognition using Conditional Random Fields for Question Answering
    Lee, Changki
    Hwang, Yi-Gyu
    Oh, Hyo-Jung
    Lim, Soojong
    Heo, Jeong
    Lee, Chung-Hee
    Kim, Hyeon-Jin
    Wang, Ji-Hyun
    Jang, Myung-Gil
    INFORMATION RETRIEVAL TECHNOLOGY, PROCEEDINGS, 2006, 4182 : 581 - 587
  • [24] Text-Image Scene Graph Fusion for Multimodal Named Entity Recognition
    Cheng J.
    Long K.
    Zhang S.
    Zhang T.
    Ma L.
    Cheng S.
    Guo Y.
    IEEE Transactions on Artificial Intelligence, 2024, 5 (06): : 2828 - 2839
  • [25] Few-shot Named Entity Recognition Based on Fine-grained Prototypical Network
    Qi, Rong-Zhi
    Zhou, Jun-Yu
    Li, Shui-Yan
    Mao, Ying-Chi
    Ruan Jian Xue Bao/Journal of Software, 2024, 35 (10): : 4751 - 4765
  • [26] Learning Relationship-Enhanced Semantic Graph for Fine-Grained Image-Text Matching
    Liu, Xin
    He, Yi
    Cheung, Yiu-Ming
    Xu, Xing
    Wang, Nannan
    IEEE TRANSACTIONS ON CYBERNETICS, 2024, 54 (02) : 948 - 961
  • [27] Fine-grained Image-text Matching by Cross-modal Hard Aligning Network
    Pan, Zhengxin
    Wu, Fangyu
    Zhang, Bailing
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 19275 - 19284
  • [28] ALADIN: Distilling Fine-grained Alignment Scores for Efficient Image-Text Matching and Retrieval
    Messina, Nicola
    Stefanini, Matteo
    Cornia, Marcella
    Baraldi, Lorenzo
    Falchi, Fabrizio
    Amato, Giuseppe
    Cucchiara, Rita
    19TH INTERNATIONAL CONFERENCE ON CONTENT-BASED MULTIMEDIA INDEXING, CBMI 2022, 2022, : 64 - 70
  • [29] Fine-Grained Bidirectional Attention-Based Generative Networks for Image-Text Matching
    Li, Zhixin
    Zhu, Jianwei
    Wei, Jiahui
    Zeng, Yufei
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2022, PT III, 2023, 13715 : 390 - 406
  • [30] Fine-Grained Named Entity Classification with Wikipedia Article Vectors
    Suzuki, Masatoshi
    Matsuda, Koji
    Sekine, Satoshi
    Okazaki, Naoaki
    Inui, Kentaro
    2016 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE (WI 2016), 2016, : 483 - 486