A Blackboard Model for Flexible and Parallel Text Annotation

被引：0

作者：

Ocana, Marc Gallofre ^{[1
]}

Opdahl, Andreas L. ^{[1
]}

机构：

[1] Univ Bergen, Dept Informat Sci & Media Studies, N-5004 Bergen, Vestland, Norway

来源：

IEEE ACCESS | 2024年 / 12卷

关键词：

Annotations; Task analysis; Vocabulary; Unified modeling language; Transformers; Knowledge graphs; Big Data; Deep learning; Semantics; Natural language processing; Information retrieval; Blackboard model; knowledge graph; big data; deep learning; semantic technologies; natural language processing; information extraction; KNOWLEDGE;

D O I：

10.1109/ACCESS.2024.3369409

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Creating rich semantic text annotations is a complex process that involves combining multiple natural-language annotation approaches. This annotation process is often approached sequentially and includes pre-processing steps and techniques that build on the outputs of others. However, combining them is not trivial, because some annotation approaches comprise chains of steps or build on other already pre-existing annotations, some pre-processing steps may be common to several techniques, and many newer techniques are even end-to-end which have diluted the need for specific pre-processing steps. Yet it can be beneficial to combine the different approaches because they solve different annotation problems and, even when they solve the same problem, they may have complementary strengths. Whereas existing works often approach the annotation process sequentially, we argue that it can instead be implemented as a partly sequential, partly parallel and concurrent collaboration between independent components. The Blackboard Model is a long-established problem-solving paradigm that deals with complex problems where multiple knowledge sources contribute independently towards the solution. In this work, we study the feasibility of the Blackboard Model for creating rich semantic annotations from text as part of a larger big-data-ready AI system for supporting journalists and newsrooms.

引用

页码：30507 / 30517

页数：11

共 50 条

[41] A flood hazard cause classification model for substation flood prevention case text [J].

Ke, Xuanhua ;

Lou, Lan ;

Xu, Ruiwen ;

Peng, Jia .

ELECTRIC POWER SYSTEMS RESEARCH, 2025, 249

[42] Text-image multimodal fusion model for enhanced fake news detection [J].

Lin, Szu-Yin ;

Chen, Yen-Chiu ;

Chang, Yu-Han ;

Lo, Shih-Hsin ;

Chao, Kuo-Ming .

SCIENCE PROGRESS, 2024, 107 (04)

[43] Chinese Text Error Correction Based on PE-T5 Model [J].

Deng, Hua ;

Xu, Kang ;

Li, Rongsheng ;

Qi, Yifei .

2024 5TH INTERNATIONAL CONFERENCE ON COMPUTING, NETWORKS AND INTERNET OF THINGS, CNIOT 2024, 2024, :223-227

[44] Investigation on text generative model based on deep learning in natural language processing [J].

Zheng, Xianqiu ;

Zhang, Zhidong ;

Wang, Liqin ;

Wu, Jinhua ;

Dong, Zuofeng .

JOURNAL OF COMPUTATIONAL METHODS IN SCIENCES AND ENGINEERING, 2024, 24 (06) :4089-4100

[45] Research on news text classification based on improved BERT-UNet model [J].

Li, Zeqin ;

Liu, Jianwen ;

Lin, Jin ;

Tan, Deli ;

Gong, Ruyue ;

Wang, Linglin .

PROCEEDINGS OF INTERNATIONAL CONFERENCE ON MODELING, NATURAL LANGUAGE PROCESSING AND MACHINE LEARNING, CMNM 2024, 2024, :1-7

[46] A Model for Processing Arabic Text on Twitter [J].

Hegazi, Mohamed Osman ;

Al-Dossari, Yasser ;

Al-Yahya, Abdullah ;

Al-Sumaril, Abdulaziz ;

Hilal, Anwer .

INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2020, 20 (05) :150-157

[47] Dual Triggered Correspondence Topic (DTCT)model for MeSH annotation [J].

Kim, Seonho ;

Yoon, Juntae .

IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2022, 19 (02) :899-911

[48] Improved Identification of Imbalanced Multiple Annotation Intent Labels with a Hybrid BLSTM and CNN Model and Hybrid Loss Function [J].

Vatathanavaro, Supawit ;

Pasupa, Kitsuchart ;

Sirirattanajakarin, Sorratat ;

Suntisrivaraporn, Boontawee .

MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: APPLIED DATA SCIENCE AND DEMO TRACK, ECML PKDD 2020, PT V, 2021, 12461 :355-368

[49] Parallel-Based Corpus Annotation for Malay Health Documents [J].

Hafsah ;

Saad, Saidah ;

Zakaria, Lailatul Qadri ;

Naswir, Ahmad Fadhil .

APPLIED SCIENCES-BASEL, 2023, 13 (24)

[50] On the Effectiveness of Images in Multi-modal Text Classification: An Annotation Study [J].

Ma, Chunpeng ;

Shen, Aili ;

Yoshikawa, Hiyori ;

Iwakura, Tomoya ;

Beck, Daniel ;

Baldwin, Timothy .

ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2023, 22 (03)

← 1 2 3 4 5 →