Deriving semantic validation rules from industrial standards: An OPC UA study

被引:4
作者
Bareedu, Yashoda Saisree [1 ]
Fruehwirth, Thomas [2 ]
Niedermeier, Christoph [1 ]
Sabou, Marta [3 ,4 ]
Steindl, Gernot [2 ]
Thuluva, Aparna Saisree [1 ,4 ]
Tsaneva, Stefani [3 ]
Ozkaya, Nilay Tufek [1 ]
机构
[1] Siemens AG, Corp Technol, Munich, Germany
[2] TU Wien, Inst Comp Engn, Vienna, Austria
[3] TU Wien, Inst Informat Syst Engn, Vienna, Austria
[4] Vienna Univ Econ & Business, Inst Data Proc & Knowledge Management, Vienna, Austria
关键词
Semantic validation; information extraction; natural language processing; human-in-the-loop; OPC UA; INFORMATION EXTRACTION; WEB; INTEROPERABILITY; CHALLENGES;
D O I
10.3233/SW-233342
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Industrial standards provide guidelines for data modeling to ensure interoperability between stakeholders of an industry branch (e.g., robotics). Most frequently, such guidelines are provided in an unstructured format (e.g., pdf documents) which hampers the automated validations of information objects (e.g., data models) that rely on such standards in terms of their compliance with the modeling constraints prescribed by the guidelines. This raises the risk of costly interoperability errors induced by the incorrect use of the standards. There is, therefore, an increased interest in automatic semantic validation of information objects based on industrial standards. In this paper we focus on an approach to semantic validation by formally representing the modeling constraints from unstructured documents as explicit, machine-actionable rules (to be then used for semantic validation) and (semi-)automatically extracting such rules from pdf documents. While our approach aims to be generically applicable, we exemplify an adaptation of the approach in the concrete context of the OPC UA industrial standard, given its large-scale adoption among important industrial stakeholders and the OPC UA internal efforts towards semantic validation. We conclude that (i) it is feasible to represent modeling constraints from the standard specifications as rules, which can be organized in a taxonomy and represented using Semantic Web technologies such as OWL and SPARQL; (ii) we could automatically identify modeling constraints in the specification documents by inspecting the tables (P = 87%) and text of these documents (F1 up to 94%); (iii) the translation of the modeling constraints into formal rules could be fully automated when constraints were extracted from tables and required a Human-in-the-loop approach for constraints extracted from text.
引用
收藏
页码:517 / 554
页数:38
相关论文
共 42 条
  • [1] Aydin M., 2019, Eurasian BIM Forum, P101
  • [2] Biffl S., 2016, Semantic web technologies for intelligent engineering applications, DOI [10.1007/978-3-319-41490-4, DOI 10.1007/978-3-319-41490-4]
  • [3] Rule extraction from scientific texts: Evaluation in the specialty of gynecology
    Boufrida, Amina
    Boufaida, Zizette
    [J]. JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2022, 34 (04) : 1150 - 1160
  • [4] Butzin B, 2017, IEEE IND ELEC, P8615, DOI 10.1109/IECON.2017.8217514
  • [5] Smart Cities
    Celino, Irene
    Kotoulas, Spyros
    [J]. IEEE INTERNET COMPUTING, 2013, 17 (06) : 8 - 11
  • [6] Five challenges for the Semantic Sensor Web
    Corcho, Oscar
    Garcia-Castro, Raul
    [J]. SEMANTIC WEB, 2010, 1 (1-2) : 121 - 125
  • [7] Natural language processing and information extraction: Qualitative analysis of financial news articles.
    Costantino, M
    Morgan, RG
    Collingham, RJ
    Garigliano, R
    [J]. PROCEEDINGS OF THE IEEE/IAFE 1997 COMPUTATIONAL INTELLIGENCE FOR FINANCIAL ENGINEERING (CIFER), 1997, : 116 - 122
  • [8] Information extraction
    Cowie, J
    Lehnert, W
    [J]. COMMUNICATIONS OF THE ACM, 1996, 39 (01) : 80 - 91
  • [9] Cunningham Hamish., 2005, ENCY LANGUAGE LINGUI
  • [10] da Rocha H, 2020, IEEE IND ELEC, P5243, DOI [10.1109/iecon43393.2020.9254274, 10.1109/IECON43393.2020.9254274]