Fine-grained multimodal named entity recognition with heterogeneous image-text similarity graphs

被引:1
|
作者
Wang, Yongpeng [1 ]
Jiang, Chunmao [1 ]
机构
[1] Fujian Univ Technol, Sch Comp Sci & Math, Fuzhou, Peoples R China
关键词
Image-text similarity graph; Semantic correlations; Graph convolutional networks; Gated aggregation modules; NEURAL MACHINE TRANSLATION;
D O I
10.1007/s13042-024-02398-8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multimodal Named Entity Recognition (MNER) leverages semantic information from multiple modalities to enhance the identification and classification of named entities in text. Effective MNER requires a thorough understanding of the intricate semantic correlations across different modalities. However, existing MNER methods often overlook fine-grained correlation information between textual and visual modalities, resulting in a loss of crucial semantic details for accurate entity recognition. We propose a novel Similarity Multimodal Reinforcement Graph (SMRG) framework for MNER to address this issue. SMRG quantifies the relevance between words and grid-level image regions to establish a nuanced heterogeneous image-text similarity graph. By leveraging the feature propagation capabilities of graph convolutional networks, SMRG captures rich semantic relationships across modalities. Moreover, SMRG employs gated aggregation modules to selectively integrate visual semantics with corresponding textual representations, thereby enhancing the expressiveness of text features for MNER. Extensive experiments on two benchmark Twitter datasets demonstrate the superiority of SMRG over state-of-the-art methods in self-domain and cross-domain scenarios.
引用
收藏
页码:2401 / 2415
页数:15
相关论文
共 50 条
  • [31] Fine-grained Named Entity Annotations for German Biographic Interviews
    Ruppenhofer, Josef
    Rehbein, Ines
    Flinz, Carolina
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 4605 - 4614
  • [32] Enhancing Low-resource Fine-grained Named Entity Recognition by Leveraging Coarse-grained Datasets
    Lee, Su Ah
    Oh, Seokjin
    Jung, Woohwan
    2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING, EMNLP 2023, 2023, : 3269 - 3279
  • [33] Medical Image Synthesis via Fine-Grained Image-Text Alignment and Anatomy-Pathology Prompting
    Chen, Wenting
    Wang, Pengyu
    Ren, Hui
    Sun, Lichao
    Li, Quanzheng
    Yuan, Yixuan
    Li, Xiang
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2024, PT XII, 2024, 15012 : 240 - 250
  • [34] Fine-Grained Grounding for Multimodal Speech Recognition
    Srinivasan, Tejas
    Sanabria, Ramon
    Metze, Florian
    Elliott, Desmond
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2020, 2020, : 2667 - 2677
  • [35] A German Corpus for Fine-Grained Named Entity Recognition and Relation Extraction of Traffic and Industry Events
    Schiersch, Martin
    Mironova, Veselina
    Schmitt, Maximilian
    Thomas, Philippe
    Gabryszak, Aleksandra
    Hennig, Leonhard
    PROCEEDINGS OF THE ELEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2018), 2018, : 4437 - 4444
  • [36] Short Text Entity Linking with Fine-grained Topics
    Chen, Lihan
    Liang, Jiaqing
    Xie, Chenhao
    Xiao, Yanghua
    CIKM'18: PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2018, : 457 - 466
  • [37] Embeddings of Label Components for Sequence Labeling: A Case Study of Fine-grained Named Entity Recognition
    Kato, Takuma
    Abe, Kaori
    Ouchi, Hiroki
    Miyawaki, Shumpei
    Suzuki, Jun
    Inui, Kentaro
    58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020): STUDENT RESEARCH WORKSHOP, 2020, : 222 - 229
  • [38] CHEMNER: Fine-Grained Chemistry Named Entity Recognition with Ontology-Guided Distant Supervision
    Wang, Xuan
    Hu, Vivian
    Song, Xiangchen
    Garg, Shweta
    Xiao, Jinfeng
    Han, Jiawei
    2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 5227 - 5240
  • [39] Enhancing Legal Named Entity Recognition Using RoBERTa-GCN with CRF: A Nuanced Approach for Fine-Grained Entity Recognition
    Jain, Arihant
    Sharma, Raksha
    ADVANCES IN INFORMATION RETRIEVAL, ECIR 2024, PT III, 2024, 14610 : 261 - 267
  • [40] Towards Fast and Accurate Image-Text Retrieval With Self-Supervised Fine-Grained Alignment
    Zhuang, Jiamin
    Yu, Jing
    Ding, Yang
    Qu, Xiangyan
    Hu, Yue
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 1361 - 1372