Multimodal weighted graph representation for information extraction from visually rich documents

被引:2
作者
Gbada, Hamza [1 ,2 ]
Kalti, Karim [2 ,3 ]
Mahjoub, Mohamed Ali [2 ]
机构
[1] Univ Sousse, Higher Inst Informat & Commun Technol, Sousse, Tunisia
[2] Natl Engn Sch Sousse ENISo, Lab Adv Technol & Intelligent Syst LATIS, Sousse, Tunisia
[3] Univ Monastir, Fac Sci Monastir, Monastir, Tunisia
关键词
Information extraction; Visually rich documents; Graph convolutional net works;
D O I
10.1016/j.neucom.2023.127223
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper introduces a novel system for information extraction from visually rich documents (VRD) using a weighted graph representation. The proposed method aims to improve the performance of the information extraction task by capturing the relationships between various VRD components. The VRD is modeled as a weighted graph, in which visual, textual, and spatial features of text regions are encoded in nodes and edges representing the relationships between neighboring text regions. The information extraction task from VRD is performed as a node classification task through the use of a graph convolutional networks, where the VRD graphs are fed into the network. The approach is evaluated across diverse documents, encompassing invoices and receipts, revealing achievement levels equal to or surpassing robust baselines.
引用
收藏
页数:9
相关论文
共 49 条
[21]   XML as a means to support information extraction from legal documents [J].
Martínez, MM ;
de la Fuente, P ;
Derniame, JC .
COMPUTER SYSTEMS SCIENCE AND ENGINEERING, 2003, 18 (05) :263-277
[22]   Jointly Learning Span Extraction and Sequence Labeling for Information Extraction from Business Documents [J].
Nguyen Hong Son ;
Hieu M Yu ;
Tuan-Anh D Nguyen ;
Minh-Tien Nguyen .
2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
[23]   Automatic Information Extraction from Electronic Documents Using Machine Learning [J].
Kamaleson, Nishanthan ;
Chu, Dominique ;
Otero, Fernando E. B. .
ARTIFICIAL INTELLIGENCE XXXVIII, 2021, 13101 :183-194
[24]   Semantic Structuring of and Information Extraction from Medical Documents Using the UMLS [J].
Denecke, K. .
METHODS OF INFORMATION IN MEDICINE, 2008, 47 (05) :425-434
[25]   Information Extraction from Web Documents Based on unranked Tree Automaton Inference [J].
Huang Zhaohua ;
Yang Fan .
2012 FOURTH INTERNATIONAL CONFERENCE ON MULTIMEDIA INFORMATION NETWORKING AND SECURITY (MINES 2012), 2012, :195-198
[26]   Information Security Requirement Extraction from Regulatory Documents using GATE/ANNIC [J].
Janpitak, Nanta ;
Sathitwiriyawong, Chanboon ;
Pipatthanaudomdee, Phatwarat .
2019 7TH INTERNATIONAL ELECTRICAL ENGINEERING CONGRESS (IEECON 2019), 2019,
[27]   Towards a System for Ontology-Based Information Extraction from PDF Documents [J].
Oro, Ermelinda ;
Ruffolo, Massimo .
ON THE MOVE TO MEANINGFUL INTERNET SYSTEMS: OTM 2008, PT II, PROCEEDINGS, 2008, 5332 :1482-1499
[28]   Information Extraction: Evaluating Named Entity Recognition from Classical Malay Documents [J].
Sazali, Siti Syakirah ;
Rahman, Nurazzah Abdul ;
Abu Bakar, Zainab .
2016 THIRD INTERNATIONAL CONFERENCE ON INFORMATION RETRIEVAL AND KNOWLEDGE MANAGEMENT (CAMP), 2016, :48-53
[29]   Information Extraction from Clinical Documents: Towards Disease/Disorder Template Filling [J].
Chikka, Veera Raghavendra ;
Mariyasagayam, Nestor ;
Niwa, Yoshiki ;
Karlapalem, Kamalakar .
EXPERIMENTAL IR MEETS MULTILINGUALITY, MULTIMODALITY, AND INTERACTION, 2015, 9283 :389-401
[30]   OmEGa(Ω): Ontology-based information extraction framework for constructing task-centric knowledge graph from manufacturing documents with large language model [J].
Shim, Midan ;
Choi, Hyojun ;
Koo, Heeyeon ;
Um, Kaehyun ;
Lee, Kyong-Ho ;
Lee, Sanghyun .
ADVANCED ENGINEERING INFORMATICS, 2025, 64