Semantic Annotation of Data Processing Pipelines in Scientific Publications

被引:11
|
作者
Mesbah, Sepideh [1 ]
Fragkeskos, Kyriakos [1 ]
Lofi, Christoph [1 ]
Bozzon, Alessandro [1 ]
Houben, Geert-Jan [1 ]
机构
[1] Delft Univ Technol, Mekelweg 4, NL-2628 CD Delft, Netherlands
来源
SEMANTIC WEB ( ESWC 2017), PT I | 2017年 / 10249卷
关键词
D O I
10.1007/978-3-319-58068-5_20
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data processing pipelines are a core object of interest for data scientist and practitioners operating in a variety of data-related application domains. To effectively capitalise on the experience gained in the creation and adoption of such pipelines, the need arises for mechanisms able to capture knowledge about datasets of interest, data processing methods designed to achieve a given goal, and the performance achieved when applying such methods to the considered datasets. However, due to its distributed and often unstructured nature, this knowledge is not easily accessible. In this paper, we use (scientific) publications as source of knowledge about Data Processing Pipelines. We describe a method designed to classify sentences according to the nature of the contained information (i.e. scientific objective, dataset, method, software, result), and to extract relevant named entities. The extracted information is then semantically annotated and published as linked data in open knowledge repositories according to the DMS ontology for data processing metadata. To demonstrate the effectiveness and performance of our approach, we present the results of a quantitative and qualitative analysis performed on four different conference series.
引用
收藏
页码:321 / 336
页数:16
相关论文
共 50 条
  • [1] A semantic annotation framework for scientific publications
    Yuchul Jung
    Quality & Quantity, 2017, 51 : 1009 - 1025
  • [2] A semantic annotation framework for scientific publications
    Jung, Yuchul
    QUALITY & QUANTITY, 2017, 51 (03) : 1009 - 1025
  • [3] Open semantic annotation of scientific publications using DOMEO
    Paolo Ciccarese
    Marco Ocana
    Tim Clark
    Journal of Biomedical Semantics, 3 (Suppl 1)
  • [4] Semantic Annotation of Scientific Publications Based on Integration of Concept Knowledge
    Phyo, Shwe Sin
    Myo, Nyein Nyein
    EMERGING TRENDS IN INTELLIGENT COMPUTING AND INFORMATICS: DATA SCIENCE, INTELLIGENT INFORMATION SYSTEMS AND SMART COMPUTING, 2020, 1073 : 98 - 109
  • [5] Annotation and semantic search of scientific documents in Big Data environments
    Portilla Herrera, Nelson Alejandro
    Solarte Pabon, Oswaldo
    Lopez Gomez, Federico
    Bucheli Guerrero, Victor Andres
    OBRAS COLECTIVAS EN CIENCIAS DE LA COMPUTACION, 2018, : 499 - 503
  • [6] Semantic Representation of Scientific Publications
    Vahdati, Sahar
    Fathalla, Said
    Auer, Soeren
    Lange, Christoph
    Vidal, Maria-Esther
    DIGITAL LIBRARIES FOR OPEN KNOWLEDGE, TPDL 2019, 2019, 11799 : 375 - 379
  • [7] SEMANTIC DIGITAL LIBRARIES FOR SCIENTIFIC PUBLICATIONS
    Kalemi, E.
    Toti, L.
    EDULEARN15: 7TH INTERNATIONAL CONFERENCE ON EDUCATION AND NEW LEARNING TECHNOLOGIES, 2015, : 8396 - 8402
  • [8] Semantic annotation of summarized sensor data stream for effective query processing
    Shobharani Pacha
    Suresh Ramalingam Murugan
    R. Sethukarasi
    The Journal of Supercomputing, 2020, 76 : 4017 - 4039
  • [9] Semantic annotation of summarized sensor data stream for effective query processing
    Pacha, Shobharani
    Murugan, Suresh Ramalingam
    Sethukarasi, R.
    JOURNAL OF SUPERCOMPUTING, 2020, 76 (06): : 4017 - 4039
  • [10] An annotation scheme for references to research artefacts in scientific publications
    Schindler, David
    Yordanova, Kristina
    Krueger, Frank
    2019 IEEE INTERNATIONAL CONFERENCE ON PERVASIVE COMPUTING AND COMMUNICATIONS WORKSHOPS (PERCOM WORKSHOPS), 2019, : 52 - 57