A Blackboard Model for Flexible and Parallel Text Annotation

被引:0
作者
Ocana, Marc Gallofre [1 ]
Opdahl, Andreas L. [1 ]
机构
[1] Univ Bergen, Dept Informat Sci & Media Studies, N-5004 Bergen, Vestland, Norway
关键词
Annotations; Task analysis; Vocabulary; Unified modeling language; Transformers; Knowledge graphs; Big Data; Deep learning; Semantics; Natural language processing; Information retrieval; Blackboard model; knowledge graph; big data; deep learning; semantic technologies; natural language processing; information extraction; KNOWLEDGE;
D O I
10.1109/ACCESS.2024.3369409
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Creating rich semantic text annotations is a complex process that involves combining multiple natural-language annotation approaches. This annotation process is often approached sequentially and includes pre-processing steps and techniques that build on the outputs of others. However, combining them is not trivial, because some annotation approaches comprise chains of steps or build on other already pre-existing annotations, some pre-processing steps may be common to several techniques, and many newer techniques are even end-to-end which have diluted the need for specific pre-processing steps. Yet it can be beneficial to combine the different approaches because they solve different annotation problems and, even when they solve the same problem, they may have complementary strengths. Whereas existing works often approach the annotation process sequentially, we argue that it can instead be implemented as a partly sequential, partly parallel and concurrent collaboration between independent components. The Blackboard Model is a long-established problem-solving paradigm that deals with complex problems where multiple knowledge sources contribute independently towards the solution. In this work, we study the feasibility of the Blackboard Model for creating rich semantic annotations from text as part of a larger big-data-ready AI system for supporting journalists and newsrooms.
引用
收藏
页码:30507 / 30517
页数:11
相关论文
共 50 条
[21]   Cyberbullying Detection Model for Arabic Text Using Deep Learning [J].
Albayari, Reem ;
Abdallah, Sherief ;
Shaalan, Khaled .
JOURNAL OF INFORMATION & KNOWLEDGE MANAGEMENT, 2025, 24 (03)
[22]   Chinese Text Sentiment Analysis Model Based on BERT and BiTCN [J].
Chen, Jinlan ;
Zhang, Jian ;
Zhang, Jiajing ;
Chen, Shufeng .
2024 17TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER THEORY AND ENGINEERING, ICACTE, 2024, :127-133
[23]   sDTM: A Supervised Bayesian Deep Topic Model for Text Analytics [J].
Yang, Yi ;
Zhang, Kunpeng ;
Fan, Yangyang .
INFORMATION SYSTEMS RESEARCH, 2023, 34 (01) :137-156
[25]   Global-Local Mutual Attention Model for Text Classification [J].
Ma, Qianli ;
Yu, Liuhong ;
Tian, Shuai ;
Chen, Enhuan ;
Ng, Wing W. Y. .
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2019, 27 (12) :2127-2139
[26]   Enhancing Language Model Performance with a Novel Text Preprocessing Method [J].
Jalili, A. ;
Tabrizchi, H. ;
Mosavi, A. ;
Varkonyi-Koczy, A. R. .
ACTA PHYSICA POLONICA A, 2024, 146 (04) :542-552
[27]   Transformer Fault Recognition Based on Kbert Text Clustering Model [J].
Jiang C. ;
Wang Y. ;
Chen M. ;
Li C. ;
Wang Y. ;
Ma G. .
Gaodianya Jishu/High Voltage Engineering, 2022, 48 (08) :2991-3000
[28]   Stemming Algorithm for Arabic Text Using a Parallel Data Processing [J].
Bougar, Marieme ;
Ziyati, El Houssaine .
THIRD INTERNATIONAL CONGRESS ON INFORMATION AND COMMUNICATION TECHNOLOGY, 2019, 797 :261-268
[29]   Text data augmentation and pre-trained Language Model for enhancing text classification of low-resource languages [J].
Ziyaden, Atabay ;
Yelenov, Amir ;
Hajiyev, Fuad ;
Rustamov, Samir ;
Pak, Alexandr .
PEERJ COMPUTER SCIENCE, 2024, 10
[30]   An Annotation Schema for the Detection of Social Bias in Legal Text Corpora [J].
Gumusel, Ece ;
Malic, Vincent Quirante ;
Donaldson, Devan Ray ;
Ashley, Kevin ;
Liu, Xiaozhong .
INFORMATION FOR A BETTER WORLD: SHAPING THE GLOBAL FUTURE, PT I, 2022, 13192 :185-194