Automated Semantic Annotation Deploying Machine Learning Approaches: A Systematic Review

被引:0
作者
Chang W.C. [1 ]
Sangodiah A. [2 ]
机构
[1] Faculty of Technology and Applied Science, Open University Malaysia
[2] School of Computing, Faculty of Computing and Engineering, Quest International University
关键词
Machine Learning; Quality Metrics; Semantic Annotation Automation; Semantic Web; Systematic Review;
D O I
10.13164/mendel.2023.2.111
中图分类号
学科分类号
摘要
Semantic Web is the vision to make Internet data machine-readable to achieve information retrieval with higher granularity and personalisation. Semantic annotation is the process that binds machine-understandable descriptions into Web resources such as text and images. Hence, the success of Semantic Web depends on the wide availability of semantically annotated Web resources. However, there remains a huge amount of unannotated Web resources due to the limited annotation capability available. In order to address this, machine learning approaches have been used to improve the automation process. This Systematic Review aims to summarise the existing state-of-the-art literature to answer five Research Questions focusing on machine learning driven semantic annotation automation. The analysis of 40 selected primary studies reveals that the use of unitary and combination of machine learning algorithms are both the current directions. Support Vector Machine (SVM) is the most-used algorithm, and supervised learning is the predominant machine learning type. Both semi-automated and fully automated annotation are almost nearly achieved. Meanwhile, text is the most annotated Web resource; and the availability of third-party annotation tools is in-line with this. While Precision, Recall, F-Measure and Accuracy are the most deployed quality metrics, not all the studies measured the quality of the annotated results. In the future, standardising quality measures is the direction for research. © 2023, Brno University of Technology. All rights reserved.
引用
收藏
页码:111 / 130
页数:19
相关论文
共 89 条
  • [1] Oxford learner’s dictionaries, (2022)
  • [2] Achimugu P., Selamat A., Ibrahim R., Mahrin M. N., A systematic literature review of software requirements prioritization research, Information and Software Technology, 56, pp. 568-585, (2014)
  • [3] Adebugbe O., Development and evaluation of a holistic, cloud-driven and microservices-based architecture for automated semantic annotation of web documents, (2019)
  • [4] Ahmed S., Frikha M., Hussein T., Rahebi J., Harris hawks optimization systems, 2022 International Congress on Human-Computer Interaction, Optimization and Robotic Applications (HORA), pp. 1-6, (2022)
  • [5] Al-Bukhitan S., Alnazer A., Helmy T., Semantic annotation arabic web documents using deep learning, Procedia Computer Science, 130, pp. 589-596, (2018)
  • [6] Al-Bukhitan S., Alnazer A., Helmy T., Semantic web annotation using deep learning with arabic morphology, Procedia Computer Science, 151, pp. 385-392, (2019)
  • [7] Al-Bukhitan S., Helmy T., Al- Mulhem M., Semantic annotation tool for annotating arabic web documents, Procedia Computer Science, 32, pp. 429-436, (2014)
  • [8] Andrade G., Semantic enrichment of american english corpora through automatic semantic annotation based on top-level ontologies using the crf classification model, (2018)
  • [9] Arcan M., Buitelaar P., Machine tranlsation of domain-specific expressions within ontologies and documents, (2017)
  • [10] Bastos E., Barcellos M., de Almeida Falbo R., Using semantic documentation to support software project management, Journal on Data Semantics, 7, pp. 107-132, (2018)