Multimodal weighted graph representation for information extraction from visually rich documents

被引:5
作者
Gbada, Hamza [1 ,2 ]
Kalti, Karim [2 ,3 ]
Mahjoub, Mohamed Ali [2 ]
机构
[1] Univ Sousse, Higher Inst Informat & Commun Technol, Sousse, Tunisia
[2] Natl Engn Sch Sousse ENISo, Lab Adv Technol & Intelligent Syst LATIS, Sousse, Tunisia
[3] Univ Monastir, Fac Sci Monastir, Monastir, Tunisia
关键词
Information extraction; Visually rich documents; Graph convolutional net works;
D O I
10.1016/j.neucom.2023.127223
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper introduces a novel system for information extraction from visually rich documents (VRD) using a weighted graph representation. The proposed method aims to improve the performance of the information extraction task by capturing the relationships between various VRD components. The VRD is modeled as a weighted graph, in which visual, textual, and spatial features of text regions are encoded in nodes and edges representing the relationships between neighboring text regions. The information extraction task from VRD is performed as a node classification task through the use of a graph convolutional networks, where the VRD graphs are fed into the network. The approach is evaluated across diverse documents, encompassing invoices and receipts, revealing achievement levels equal to or surpassing robust baselines.
引用
收藏
页数:9
相关论文
共 50 条
[31]   Towards a System for Ontology-Based Information Extraction from PDF Documents [J].
Oro, Ermelinda ;
Ruffolo, Massimo .
ON THE MOVE TO MEANINGFUL INTERNET SYSTEMS: OTM 2008, PT II, PROCEEDINGS, 2008, 5332 :1482-1499
[32]   Information Extraction from Scanned Invoice Documents Using Deep Learning Methods [J].
Avci, Ufuk Ilke ;
Goularas, Dionysis ;
Korkmaz, Emin Erkan ;
Deveci, Baris .
2024 IEEE THIRTEENTH INTERNATIONAL CONFERENCE ON IMAGE PROCESSING THEORY, TOOLS AND APPLICATIONS, IPTA 2024, 2024,
[33]   Information Extraction: Evaluating Named Entity Recognition from Classical Malay Documents [J].
Sazali, Siti Syakirah ;
Rahman, Nurazzah Abdul ;
Abu Bakar, Zainab .
2016 THIRD INTERNATIONAL CONFERENCE ON INFORMATION RETRIEVAL AND KNOWLEDGE MANAGEMENT (CAMP), 2016, :48-53
[34]   Information Extraction from Clinical Documents: Towards Disease/Disorder Template Filling [J].
Chikka, Veera Raghavendra ;
Mariyasagayam, Nestor ;
Niwa, Yoshiki ;
Karlapalem, Kamalakar .
EXPERIMENTAL IR MEETS MULTILINGUALITY, MULTIMODALITY, AND INTERACTION, 2015, 9283 :389-401
[35]   Subgraph-Induced Extraction Technique for Information (SETI) from Administrative Documents [J].
Kafle, Dipendra Sharma ;
Thomas, Eliott ;
Coustaty, Mickael ;
Joseph, Aurelie ;
Doucet, Antoine ;
DAndecy, Vincent Poulain .
DOCUMENT ANALYSIS AND RECOGNITION - ICDAR 2023 WORKSHOPS, PT II, 2023, 14194 :108-122
[36]   OmEGa(Ω): Ontology-based information extraction framework for constructing task-centric knowledge graph from manufacturing documents with large language model [J].
Shim, Midan ;
Choi, Hyojun ;
Koo, Heeyeon ;
Um, Kaehyun ;
Lee, Kyong-Ho ;
Lee, Sanghyun .
ADVANCED ENGINEERING INFORMATICS, 2025, 64
[37]   Information extraction from electronic medical documents: state of the art and future research directions [J].
Landolsi, Mohamed Yassine ;
Hlaoua, Lobna ;
Ben Romdhane, Lotfi .
KNOWLEDGE AND INFORMATION SYSTEMS, 2023, 65 (02) :463-516
[38]   Developing a mountaineering plan sharing system based on information extraction from unstructured documents [J].
Nohara, Akihiro ;
Shiramatsu, Shun ;
Ozono, Tadachika ;
Shintani, Toramatsu .
IEEJ Transactions on Electronics, Information and Systems, 2015, 135 (12) :1470-1480
[39]   Autonomous Deblurring Images and Information Extraction from Documents Using CycleGAN and Mask RCNN [J].
Hoque, Oishee Bintey ;
Rashid, Maisha Binte ;
Jawad, K. M. Tawsik .
2020 23RD INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY (ICCIT 2020), 2020,
[40]   Information extraction from electronic medical documents: state of the art and future research directions [J].
Mohamed Yassine Landolsi ;
Lobna Hlaoua ;
Lotfi Ben Romdhane .
Knowledge and Information Systems, 2023, 65 :463-516