Semantic Annotation of Data Processing Pipelines in Scientific Publications

被引:11
作者
Mesbah, Sepideh [1 ]
Fragkeskos, Kyriakos [1 ]
Lofi, Christoph [1 ]
Bozzon, Alessandro [1 ]
Houben, Geert-Jan [1 ]
机构
[1] Delft Univ Technol, Mekelweg 4, NL-2628 CD Delft, Netherlands
来源
SEMANTIC WEB ( ESWC 2017), PT I | 2017年 / 10249卷
关键词
D O I
10.1007/978-3-319-58068-5_20
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data processing pipelines are a core object of interest for data scientist and practitioners operating in a variety of data-related application domains. To effectively capitalise on the experience gained in the creation and adoption of such pipelines, the need arises for mechanisms able to capture knowledge about datasets of interest, data processing methods designed to achieve a given goal, and the performance achieved when applying such methods to the considered datasets. However, due to its distributed and often unstructured nature, this knowledge is not easily accessible. In this paper, we use (scientific) publications as source of knowledge about Data Processing Pipelines. We describe a method designed to classify sentences according to the nature of the contained information (i.e. scientific objective, dataset, method, software, result), and to extract relevant named entities. The extracted information is then semantically annotated and published as linked data in open knowledge repositories according to the DMS ontology for data processing metadata. To demonstrate the effectiveness and performance of our approach, we present the results of a quantitative and qualitative analysis performed on four different conference series.
引用
收藏
页码:321 / 336
页数:16
相关论文
共 50 条
  • [31] Ontology engineering for the semantic annotation of medical data
    Bontas, EP
    Schlangen, D
    Niepage, S
    [J]. SIXTEENTH INTERNATIONAL WORKSHOP ON DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2005, : 567 - 571
  • [32] Semantic Annotation and Publication of Linked Open Data
    Sorrentino, Serena
    Bergamaschi, Sonia
    Fusari, Elisa
    Beneventano, Domenico
    [J]. COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2013, PT V, 2013, 7975 : 462 - 474
  • [33] SemMobi: A Semantic Annotation System for Mobility Data
    Wu, Fei
    Wang, Hongjian
    Li, Zhenhui
    Lee, Wang-Chien
    Huang, Zhuojie
    [J]. WWW'15 COMPANION: PROCEEDINGS OF THE 24TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB, 2015, : 255 - 258
  • [34] Annotation of multimedia data using semantic metadata
    An, Hyoung-Keun
    Koh, Jae-Jin
    [J]. IFOST 2006: 1ST INTERNATIONAL FORUM ON STRATEGIC TECHNOLOGY, PROCEEDINGS: E-VEHICLE TECHNOLOGY, 2006, : 269 - +
  • [35] Semantic Trajectories: Mobility Data Computation and Annotation
    Yan, Zhixian
    Chakraborty, Dipanjan
    Parent, Christine
    Spaccapietra, Stefano
    Aberer, Karl
    [J]. ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2013, 4 (03)
  • [36] A Linked Data prototype for semantic cataloguing of publications
    Freitas Junior, Nilton
    de Azevedo Jacynto, Mark Douglas
    [J]. PERSPECTIVAS EM CIENCIA DA INFORMACAO, 2016, 21 (04): : 48 - 65
  • [37] SEMANTIC SEARCH IN OFFSHORE ENGINEERING WITH LINGUISTICS AND NEURAL PROCESSING PIPELINES
    Pol Goncalves, Flavio Jaime
    de Oliveira Carmo, Vinicius Cleves
    de Melo, Vinicius Toquetti
    Cunha, Rodrigo da Silva
    Santos, Ismael H. F.
    Barreira, Rodrigo Augusto
    Cugnasca, Carlos Eduardo
    Cozman, Fabio Gagliardi
    Gomi, Edson Satoshi
    [J]. PROCEEDINGS OF ASME 2021 40TH INTERNATIONAL CONFERENCE ON OCEAN, OFFSHORE AND ARCTIC ENGINEERING (OMAE2021), VOL 1, 2021,
  • [38] Toward the modeling of data provenance in scientific publications
    Mahmood, Tariq
    Jami, Syed Imran
    Shaikh, Zubair Ahmed
    Mughal, Muhammad Hussain
    [J]. COMPUTER STANDARDS & INTERFACES, 2013, 35 (01) : 6 - 29
  • [39] CITATION HISTORIES OF SCIENTIFIC PUBLICATIONS - THE DATA SOURCES
    VLACHY, J
    [J]. SCIENTOMETRICS, 1985, 7 (3-6) : 505 - 528
  • [40] Semantic Annotation of the ACL Anthology Corpus for the Automatic Analysis of Scientific Literature
    Gabor, Kata
    Zargayouna, Haifa
    Buscaldi, Davide
    Tellier, Isabelle
    Charnois, Thierry
    [J]. LREC 2016 - TENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2016, : 3694 - 3701