A Fine-Grained Network for Joint Multimodal Entity-Relation Extraction

被引:0
|
作者
Yuan, Li [1 ]
Cai, Yi [1 ]
Xu, Jingyu [1 ]
Li, Qing [2 ]
Wang, Tao [3 ]
机构
[1] South China Univ Technol, Minist Educ, Sch Software Engn, Key Lab Big Data & Intelligent Robot, Guangzhou 510641, Peoples R China
[2] Hong Kong Polytech Univ, Dept Comp, Hong Kong, Peoples R China
[3] Kings Coll London, Dept Biostat & Hlth Informat, London WC2R 2LS, England
基金
中国国家自然科学基金;
关键词
Visualization; Data mining; Contrastive learning; Feature extraction; Brain modeling; Tagging; Semantics; Predictive models; Pipelines; Logic gates; entity-relation extraction; multimodal; pre-trained model; RECOGNITION;
D O I
10.1109/TKDE.2024.3485107
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Joint multimodal entity-relation extraction (JMERE) is a challenging task that involves two joint subtasks, i.e., named entity recognition and relation extraction, from multimodal data such as text sentences with associated images. Previous JMERE methods have primarily employed 1) pipeline models, which apply pre-trained unimodal models separately and ignore the interaction between tasks, or 2) word-pair relation tagging methods, which neglect neighboring word pairs. To address these limitations, we propose a fine-grained network for JMERE. Specifically, we introduce a fine-grained alignment module that utilizes a phrase-patch to establish connections between text phrases and visual objects. This module can learn consistent multimodal representations from multimodal data. Furthermore, we address the task-irrelevant image information issue by proposing a gate fusion module, which mitigates the impact of image noise and ensures a balanced representation between image objects and text representations. Furthermore, we design a multi-word decoder that enables ensemble prediction of tags for each word pair. This approach leverages the predicted results of neighboring word pairs, improving the ability to extract multi-word entities. Evaluation results from a series of experiments demonstrate the superiority of our proposed model over state-of-the-art models in JMERE.
引用
收藏
页码:1 / 14
页数:14
相关论文
共 50 条
  • [1] A knowledge-enhanced network for joint multimodal entity-relation extraction
    Huang, Shubin
    Cai, Yi
    Yuan, Li
    Wang, Jiexin
    INFORMATION PROCESSING & MANAGEMENT, 2025, 62 (03)
  • [2] Span-Based Fine-Grained Entity-Relation Extraction via Sub-Prompts Combination
    Yu, Ning
    Liu, Jianyi
    Shi, Yu
    APPLIED SCIENCES-BASEL, 2023, 13 (02):
  • [3] A region-based hypergraph network for joint entity-relation extraction
    Wan, Qian
    Wei, Luona
    Chen, Xinhai
    Liu, Jie
    KNOWLEDGE-BASED SYSTEMS, 2021, 228
  • [4] Fine-grained Multimodal Entity Linking for Videos
    Zhao H.-Q.
    Wang X.-W.
    Li J.-L.
    Li Z.-X.
    Xiao Y.-H.
    Ruan Jian Xue Bao/Journal of Software, 2024, 35 (03): : 1140 - 1153
  • [5] Joint multimodal entity-relation extraction based on temporal enhancement and similarity-gated attention
    Wang, Guoxiang
    Liu, Jin
    Xie, Jialong
    Zhu, Zhenwei
    Zhou, Fengyu
    KNOWLEDGE-BASED SYSTEMS, 2024, 304
  • [6] Joint Multimodal Entity-Relation Extraction Based on Edge-Enhanced Graph Alignment Network andWord-Pair Relation Tagging
    Yuan, Li
    Cai, Yi
    Wang, Jin
    Li, Qing
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 9, 2023, : 11051 - 11059
  • [7] A Two-Phase Paradigm for Joint Entity-Relation Extraction
    Ji, Bin
    Xu, Hao
    Yu, Jie
    Li, Shasha
    Ma, Jun
    Ji, Yuke
    Liu, Huijun
    CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 74 (01): : 1303 - 1318
  • [8] Fine-Grained Temporal Relation Extraction
    Vashishtha, Siddharth
    Van Durme, Benjamin
    White, Aaron Steven
    57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 2906 - 2919
  • [9] Utber: Utilizing Fine-Grained Entity Types to Relation Extraction with Distant Supervision
    Wu, Chengmin
    Chen, Lei
    2020 IEEE INTERNATIONAL CONFERENCE ON SMART DATA SERVICES (SMDS 2020), 2020, : 63 - 71
  • [10] Fine-Grained Relation Extraction for Drug Instructions Using Contrastive Entity Enhancement
    Gao, Feng
    Song, Xuren
    Gu, Jinguang
    Zhang, Lihua
    Liu, Yun
    Zhang, Xiaoliang
    Liu, Yu
    Jing, Shenqi
    IEEE ACCESS, 2023, 11 : 51777 - 51788