A Blackboard Model for Flexible and Parallel Text Annotation

被引:0
作者
Ocana, Marc Gallofre [1 ]
Opdahl, Andreas L. [1 ]
机构
[1] Univ Bergen, Dept Informat Sci & Media Studies, N-5004 Bergen, Vestland, Norway
关键词
Annotations; Task analysis; Vocabulary; Unified modeling language; Transformers; Knowledge graphs; Big Data; Deep learning; Semantics; Natural language processing; Information retrieval; Blackboard model; knowledge graph; big data; deep learning; semantic technologies; natural language processing; information extraction; KNOWLEDGE;
D O I
10.1109/ACCESS.2024.3369409
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Creating rich semantic text annotations is a complex process that involves combining multiple natural-language annotation approaches. This annotation process is often approached sequentially and includes pre-processing steps and techniques that build on the outputs of others. However, combining them is not trivial, because some annotation approaches comprise chains of steps or build on other already pre-existing annotations, some pre-processing steps may be common to several techniques, and many newer techniques are even end-to-end which have diluted the need for specific pre-processing steps. Yet it can be beneficial to combine the different approaches because they solve different annotation problems and, even when they solve the same problem, they may have complementary strengths. Whereas existing works often approach the annotation process sequentially, we argue that it can instead be implemented as a partly sequential, partly parallel and concurrent collaboration between independent components. The Blackboard Model is a long-established problem-solving paradigm that deals with complex problems where multiple knowledge sources contribute independently towards the solution. In this work, we study the feasibility of the Blackboard Model for creating rich semantic annotations from text as part of a larger big-data-ready AI system for supporting journalists and newsrooms.
引用
收藏
页码:30507 / 30517
页数:11
相关论文
共 50 条
[31]   SHORT TEXT SEMANTIC MATCHING MODEL BASED ON BERT AND ADVERSARIAL NETWORK [J].
Zhang, Tao ;
Zhang, Zhe ;
Li, Xiang ;
Wu, Yulin ;
Peng, Bo ;
Qian, Yurong ;
Ma, Mengnan ;
Leng, Hongyong .
2022 INTERNATIONAL CONFERENCE ON NETWORKING AND NETWORK APPLICATIONS, NANA, 2022, :397-401
[32]   Automated Social Text Annotation With Joint Multilabel Attention Networks [J].
Dong, Hang ;
Wang, Wei ;
Huang, Kaizhu ;
Coenen, Frans .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 32 (05) :2224-2238
[33]   A survey of text classification based on pre-trained language model [J].
Wu, Yujia ;
Wan, Jun .
NEUROCOMPUTING, 2025, 616
[34]   Ensemble-NQG-T5: Ensemble Neural Question Generation Model Based on Text-to-Text Transfer Transformer [J].
Hwang, Myeong-Ha ;
Shin, Jikang ;
Seo, Hojin ;
Im, Jeong-Seon ;
Cho, Hee ;
Lee, Chun-Kwon .
APPLIED SCIENCES-BASEL, 2023, 13 (02)
[35]   Academic News Text Classification Model Based on Attention Mechanism and RCNN [J].
Lin, Ronghua ;
Fu, Chengzhou ;
Mao, Chengjie ;
Wei, Jingmin ;
Li, Jianguo .
COMPUTER SUPPORTED COOPERATIVE WORK AND SOCIAL COMPUTING, CHINESECSCW 2018, 2019, 917 :507-516
[36]   A Novel Deep Learning-Based Multilevel Parallel Attention Neural (MPAN) Model for Multidomain Arabic Sentiment Analysis [J].
El-Affendi, Mohammed A. ;
Alrajhi, Khawla ;
Hussain, Amir .
IEEE ACCESS, 2021, 9 :7508-7518
[37]   A Methodology for Generated Text Annotation for High Quality Speech Synthesis [J].
Spiliotopoulos, Dimitris ;
Vassilakis, Costas ;
Margaris, Dionisis ;
Kotis, Kostantinos .
2019 10TH INTERNATIONAL CONFERENCE ON INFORMATION, INTELLIGENCE, SYSTEMS AND APPLICATIONS (IISA), 2019, :371-378
[38]   PATC: Parallel Arabic Text Classifier [J].
Alshahrani, Mona ;
Alkhalifa, Shurug .
2018 21ST SAUDI COMPUTER SOCIETY NATIONAL COMPUTER CONFERENCE (NCC), 2018,
[39]   A Parallel Platform for Web Text Mining [J].
Ping Lu ;
Zhenjiang Dong ;
Shengmei Luo ;
Lixia Liu ;
Shanshan Guan ;
Shengyu Liu ;
Qingcai Chen .
ZTECommunications, 2013, 11 (03) :56-61
[40]   Collaborative text-annotation resource for disease-centered relation extraction from biomedical text [J].
Cano, C. ;
Monaghan, T. ;
Blanco, A. ;
Wall, D. P. ;
Peshkin, L. .
JOURNAL OF BIOMEDICAL INFORMATICS, 2009, 42 (05) :967-977