A French Corpus and Annotation Schema for Named Entity Recognition and Relation Extraction of Financial News

被引:0
作者
Jabbari, Ali [1 ]
Sauvage, Olivier [1 ]
Zeine, Hamada [1 ]
Chergui, Hamza [1 ]
机构
[1] Skaizen Grp, Paris, France
来源
PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020) | 2020年
关键词
Annotation Schema; Named Entity Recognition; Relation Extraction;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In financial services industry, compliance involves a series of practices and controls in order to meet key regulatory standards which aim to reduce financial risk and crime, e.g. money laundering and financing of terrorism. Faced with the growing risks, it is imperative for financial institutions to seek automated information extraction techniques for monitoring financial activities of their customers. This work describes an ontology of compliance-related concepts and relationships along with a corpus annotated according to it. The presented corpus consists of financial news articles in French and allows for training and evaluating domain-specific named entity recognition and relation extraction algorithms. We present some of our experimental results on named entity recognition and relation extraction using our annotated corpus. We aim to furthermore use the the proposed ontology towards construction of a knowledge base of financial relations.
引用
收藏
页码:2293 / 2299
页数:7
相关论文
共 50 条
[41]   A Method for Building a Labeled Named Entity Recognition Corpus Using Ontologies [J].
Ngoc-Trinh Vu ;
Van-Hien Tran ;
Thi-Huyen-Trang Doan ;
Hoang-Quynh Le ;
Mai-Vu Tran .
ADVANCED COMPUTATIONAL METHODS FOR KNOWLEDGE ENGINEERING, 2015, 358 :141-149
[42]   Enhanced Named Entity Recognition algorithm for financial document verification [J].
Ahmet Toprak ;
Metin Turan .
The Journal of Supercomputing, 2023, 79 :19431-19451
[43]   Wojood: Nested Arabic Named Entity Corpus and Recognition using BERT [J].
Jarrar, Mustafa ;
Khalilia, Mohammed ;
Ghanem, Sana .
LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, :3626-3636
[44]   Fine-tuned encoder models with data augmentation beat ChatGPT in agricultural named entity recognition and relation extraction [J].
De, Sayan ;
Sanyal, Debarshi Kumar ;
Mukherjee, Imon .
EXPERT SYSTEMS WITH APPLICATIONS, 2025, 277
[45]   An Arabic Dataset for Disease Named Entity Recognition with Multi-Annotation Schemes [J].
Alshammari, Nasser ;
Alanazi, Saad .
DATA, 2020, 5 (03) :1-8
[46]   Named entity recognition for Psychological domain: challenges in document annotation for the Arabic Language [J].
Lakel, Kheira ;
Bendella, Fatima ;
Benkhadda, Samira .
PROCEEDINGS OF 2017 FIRST INTERNATIONAL CONFERENCE ON EMBEDDED & DISTRIBUTED SYSTEMS (EDIS 2017), 2017, :39-43
[47]   MER: a shell script and annotation server for minimal named entity recognition and linking [J].
Couto, Francisco M. ;
Lamurias, Andre .
JOURNAL OF CHEMINFORMATICS, 2018, 10
[48]   Optimizing Relation Extraction Based on the Type Tag of Named Entity [J].
Zhang, Yixing ;
Zhang, Yangsen ;
Huang, Gaijuan ;
Guo, Zhengbin .
CHINESE LEXICAL SEMANTICS, CLSW 2018, 2018, 11173 :532-541
[49]   Named Entity Recognition via Unified Information Extraction Framework [J].
Chen, Xinyue ;
Zhang, Zhenguo ;
Lu, Xinghua .
2024 4TH INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATION AND ARTIFICIAL INTELLIGENCE, CCAI 2024, 2024, :308-313
[50]   MER: a shell script and annotation server for minimal named entity recognition and linking [J].
Francisco M. Couto ;
Andre Lamurias .
Journal of Cheminformatics, 10