Fine-grained multimodal named entity recognition with heterogeneous image-text similarity graphs

被引:1
|
作者
Wang, Yongpeng [1 ]
Jiang, Chunmao [1 ]
机构
[1] Fujian Univ Technol, Sch Comp Sci & Math, Fuzhou, Peoples R China
关键词
Image-text similarity graph; Semantic correlations; Graph convolutional networks; Gated aggregation modules; NEURAL MACHINE TRANSLATION;
D O I
10.1007/s13042-024-02398-8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multimodal Named Entity Recognition (MNER) leverages semantic information from multiple modalities to enhance the identification and classification of named entities in text. Effective MNER requires a thorough understanding of the intricate semantic correlations across different modalities. However, existing MNER methods often overlook fine-grained correlation information between textual and visual modalities, resulting in a loss of crucial semantic details for accurate entity recognition. We propose a novel Similarity Multimodal Reinforcement Graph (SMRG) framework for MNER to address this issue. SMRG quantifies the relevance between words and grid-level image regions to establish a nuanced heterogeneous image-text similarity graph. By leveraging the feature propagation capabilities of graph convolutional networks, SMRG captures rich semantic relationships across modalities. Moreover, SMRG employs gated aggregation modules to selectively integrate visual semantics with corresponding textual representations, thereby enhancing the expressiveness of text features for MNER. Extensive experiments on two benchmark Twitter datasets demonstrate the superiority of SMRG over state-of-the-art methods in self-domain and cross-domain scenarios.
引用
收藏
页码:2401 / 2415
页数:15
相关论文
共 50 条
  • [41] Memorize, Associate and Match: Embedding Enhancement via Fine-Grained Alignment for Image-Text Retrieval
    Li, Jiangtong
    Liu, Liu
    Niu, Li
    Zhang, Liqing
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 (30) : 9193 - 9207
  • [42] Multi-level network based on transformer encoder for fine-grained image-text matching
    Yang, Lei
    Feng, Yong
    Zhou, Mingliang
    Xiong, Xiancai
    Wang, Yongheng
    Qiang, Baohua
    MULTIMEDIA SYSTEMS, 2023, 29 (04) : 1981 - 1994
  • [43] VSR plus plus : Improving Visual Semantic Reasoning for Fine-Grained Image-Text Matching
    Yuan, Hui
    Huang, Yan
    Zhang, Dongbo
    Chen, Zerui
    Cheng, Wenlong
    Wang, Liang
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 3728 - 3735
  • [44] Multimodal Named Entity Recognition with Image Attributes and Image Knowledge
    Chen, Dawei
    Li, Zhixu
    Gu, Binbin
    Chen, Zhigang
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS (DASFAA 2021), PT II, 2021, 12682 : 186 - 201
  • [45] Towards Fast and Accurate Image-Text Retrieval with Self-Supervised Fine-Grained Alignment
    Zhuang, Jiamin
    Yu, Jing
    Ding, Yang
    Qu, Xiangyan
    Hu, Yue
    arXiv, 2023,
  • [46] Multimodal Fake News Analysis Based on Image-Text Similarity
    Zhang, Xichen
    Dadkhah, Sajjad
    Weismann, Alexander Gerald
    Kanaani, Mohammad Amin
    Ghorbani, Ali A.
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2024, 11 (01) : 959 - 972
  • [47] NumER: A Fine-Grained Numeral Entity Recognition Dataset
    Julavanich, Thanakrit
    Aizawa, Akiko
    NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS (NLDB 2021), 2021, 12801 : 64 - 75
  • [48] Enhancing fine-grained geographic named entity recognition by Multi-scale Siamese Reconstruction Network
    Huang, Guanhua
    Gao, Bofei
    Chen, Jiaze
    Zhang, Yuchen
    Yang, Zhouwang
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2025, 148
  • [49] MultiCoNER v2: a Large Multilingual dataset for Fine-grained and Noisy Named Entity Recognition
    Fetahu, Besnik
    Chen, Zhiyu
    Kar, Sudipta
    Rokhlenko, Oleg
    Malmasi, Shervin
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS - EMNLP 2023, 2023, : 2027 - 2051
  • [50] Distantly-Supervised Named Entity Recognition with Adaptive Teacher Learning and Fine-Grained Student Ensemble
    Qu, Xiaoye
    Zeng, Jun
    Liu, Daizong
    Wang, Zhefeng
    Huai, Baoxing
    Zhou, Pan
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 11, 2023, : 13501 - 13509