Fine-grained multimodal named entity recognition with heterogeneous image-text similarity graphs

被引:1
|
作者
Wang, Yongpeng [1 ]
Jiang, Chunmao [1 ]
机构
[1] Fujian Univ Technol, Sch Comp Sci & Math, Fuzhou, Peoples R China
关键词
Image-text similarity graph; Semantic correlations; Graph convolutional networks; Gated aggregation modules; NEURAL MACHINE TRANSLATION;
D O I
10.1007/s13042-024-02398-8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multimodal Named Entity Recognition (MNER) leverages semantic information from multiple modalities to enhance the identification and classification of named entities in text. Effective MNER requires a thorough understanding of the intricate semantic correlations across different modalities. However, existing MNER methods often overlook fine-grained correlation information between textual and visual modalities, resulting in a loss of crucial semantic details for accurate entity recognition. We propose a novel Similarity Multimodal Reinforcement Graph (SMRG) framework for MNER to address this issue. SMRG quantifies the relevance between words and grid-level image regions to establish a nuanced heterogeneous image-text similarity graph. By leveraging the feature propagation capabilities of graph convolutional networks, SMRG captures rich semantic relationships across modalities. Moreover, SMRG employs gated aggregation modules to selectively integrate visual semantics with corresponding textual representations, thereby enhancing the expressiveness of text features for MNER. Extensive experiments on two benchmark Twitter datasets demonstrate the superiority of SMRG over state-of-the-art methods in self-domain and cross-domain scenarios.
引用
收藏
页码:2401 / 2415
页数:15
相关论文
共 50 条
  • [1] Fine-Grained Multimodal Named Entity Recognition and Grounding with a Generative Framework
    Wang, Jieming
    Li, Ziyan
    Yu, Jianfei
    Yang, Li
    Xia, Rui
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 3934 - 3943
  • [2] Fine-Grained Named Entity Recognition for Sinhala
    Azeez, Rameela
    Ranathunga, Surangika
    MERCON 2020: 6TH INTERNATIONAL MULTIDISCIPLINARY MORATUWA ENGINEERING RESEARCH CONFERENCE (MERCON), 2020, : 295 - 300
  • [3] Fine-grained Named Entity Recognition for Turkish
    Khudoyberdieva, Lola
    Diri, Banu
    32ND IEEE SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, SIU 2024, 2024,
  • [4] Multilingual Fine-Grained Named Entity Recognition
    Lupancu, Viorica-Camelia
    Iftene, Adrian
    COMPUTER SCIENCE JOURNAL OF MOLDOVA, 2023, 31 (03) : 321 - 339
  • [5] Fine-grained Dutch named entity recognition
    Desmet, Bart
    Hoste, Veronique
    LANGUAGE RESOURCES AND EVALUATION, 2014, 48 (02) : 307 - 343
  • [6] Fine-grained Dutch named entity recognition
    Bart Desmet
    Véronique Hoste
    Language Resources and Evaluation, 2014, 48 : 307 - 343
  • [7] Multimodal fine-grained grocery product recognition using image and OCR text
    Pettersson, Tobias
    Riveiro, Maria
    Lofstrom, Tuwe
    MACHINE VISION AND APPLICATIONS, 2024, 35 (04)
  • [8] Chinese Fine-Grained Geological Named Entity Recognition With Rules and FLAT
    Chen, Siying
    Hua, Weihua
    Liu, Xiuguo
    Deng, Xiaotong
    Zeng, Xinling
    Duan, Jianchao
    EARTH AND SPACE SCIENCE, 2022, 9 (12)
  • [9] Fine-Grained Multimodal DeepFake Classification via Heterogeneous Graphs
    Yin, Qilin
    Lu, Wei
    Cao, Xiaochun
    Luo, Xiangyang
    Zhou, Yicong
    Huang, Jiwu
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024, 132 (11) : 5255 - 5269
  • [10] Collaborative fine-grained interaction learning for image-text sentiment analysis
    Xiao, Xingwang
    Pu, Yuanyuan
    Zhou, Dongming
    Cao, Jinde
    Gu, Jinjing
    Zhao, Zhengpeng
    Xu, Dan
    KNOWLEDGE-BASED SYSTEMS, 2023, 279