A Blackboard Model for Flexible and Parallel Text Annotation

被引:0
作者
Ocana, Marc Gallofre [1 ]
Opdahl, Andreas L. [1 ]
机构
[1] Univ Bergen, Dept Informat Sci & Media Studies, N-5004 Bergen, Vestland, Norway
关键词
Annotations; Task analysis; Vocabulary; Unified modeling language; Transformers; Knowledge graphs; Big Data; Deep learning; Semantics; Natural language processing; Information retrieval; Blackboard model; knowledge graph; big data; deep learning; semantic technologies; natural language processing; information extraction; KNOWLEDGE;
D O I
10.1109/ACCESS.2024.3369409
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Creating rich semantic text annotations is a complex process that involves combining multiple natural-language annotation approaches. This annotation process is often approached sequentially and includes pre-processing steps and techniques that build on the outputs of others. However, combining them is not trivial, because some annotation approaches comprise chains of steps or build on other already pre-existing annotations, some pre-processing steps may be common to several techniques, and many newer techniques are even end-to-end which have diluted the need for specific pre-processing steps. Yet it can be beneficial to combine the different approaches because they solve different annotation problems and, even when they solve the same problem, they may have complementary strengths. Whereas existing works often approach the annotation process sequentially, we argue that it can instead be implemented as a partly sequential, partly parallel and concurrent collaboration between independent components. The Blackboard Model is a long-established problem-solving paradigm that deals with complex problems where multiple knowledge sources contribute independently towards the solution. In this work, we study the feasibility of the Blackboard Model for creating rich semantic annotations from text as part of a larger big-data-ready AI system for supporting journalists and newsrooms.
引用
收藏
页码:30507 / 30517
页数:11
相关论文
共 50 条
[41]   A Parallel Platform for Web Text Mining [J].
Ping Lu ;
Zhenjiang Dong ;
Shengmei Luo ;
Lixia Liu ;
Shanshan Guan ;
Shengyu Liu ;
Qingcai Chen .
ZTE Communications, 2013, 11 (03) :56-61
[42]   Collaborative text-annotation resource for disease-centered relation extraction from biomedical text [J].
Cano, C. ;
Monaghan, T. ;
Blanco, A. ;
Wall, D. P. ;
Peshkin, L. .
JOURNAL OF BIOMEDICAL INFORMATICS, 2009, 42 (05) :967-977
[43]   A flood hazard cause classification model for substation flood prevention case text [J].
Ke, Xuanhua ;
Lou, Lan ;
Xu, Ruiwen ;
Peng, Jia .
ELECTRIC POWER SYSTEMS RESEARCH, 2025, 249
[44]   Text-image multimodal fusion model for enhanced fake news detection [J].
Lin, Szu-Yin ;
Chen, Yen-Chiu ;
Chang, Yu-Han ;
Lo, Shih-Hsin ;
Chao, Kuo-Ming .
SCIENCE PROGRESS, 2024, 107 (04)
[45]   Chinese Text Error Correction Based on PE-T5 Model [J].
Deng, Hua ;
Xu, Kang ;
Li, Rongsheng ;
Qi, Yifei .
2024 5TH INTERNATIONAL CONFERENCE ON COMPUTING, NETWORKS AND INTERNET OF THINGS, CNIOT 2024, 2024, :223-227
[46]   Sentiment analysis of Nepali social media text with a hybrid deep learning model [J].
Sitoula, Sameer ;
Shahi, Tej Bahadur ;
Wibowo, Santoso ;
Neupane, Arjun .
SOCIAL NETWORK ANALYSIS AND MINING, 2025, 15 (01)
[47]   Investigation on text generative model based on deep learning in natural language processing [J].
Zheng, Xianqiu ;
Zhang, Zhidong ;
Wang, Liqin ;
Wu, Jinhua ;
Dong, Zuofeng .
JOURNAL OF COMPUTATIONAL METHODS IN SCIENCES AND ENGINEERING, 2024, 24 (06) :4089-4100
[48]   Research on news text classification based on improved BERT-UNet model [J].
Li, Zeqin ;
Liu, Jianwen ;
Lin, Jin ;
Tan, Deli ;
Gong, Ruyue ;
Wang, Linglin .
PROCEEDINGS OF INTERNATIONAL CONFERENCE ON MODELING, NATURAL LANGUAGE PROCESSING AND MACHINE LEARNING, CMNM 2024, 2024, :1-7
[49]   Dual Triggered Correspondence Topic (DTCT)model for MeSH annotation [J].
Kim, Seonho ;
Yoon, Juntae .
IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2022, 19 (02) :899-911
[50]   A Model for Processing Arabic Text on Twitter [J].
Hegazi, Mohamed Osman ;
Al-Dossari, Yasser ;
Al-Yahya, Abdullah ;
Al-Sumaril, Abdulaziz ;
Hilal, Anwer .
INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2020, 20 (05) :150-157