PoliViews: A comprehensive and modular approach to the conceptual modeling of genomic data

被引:0
作者
Bernasconi, Anna [1 ]
Garcia, S. Alberto [2 ]
Ceri, Stefano [1 ]
Pastor, Oscar [2 ]
机构
[1] Politecn Milan, Dept Elect Informat & Bioengn, Milan, Italy
[2] Univ Politecn Valencia, VRAIN Res Inst, PROS Res Ctr, Valencia, Spain
关键词
Conceptual modeling; Data repositories; Data integration; Biological datasets; Genomics; Scientific databases; GENE-EXPRESSION; INTEGRATIVE ANALYSIS; DNA ELEMENTS; ENCYCLOPEDIA; ENVIRONMENT; ATLAS;
D O I
10.1016/j.datak.2023.102201
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The human genome complexity is captured by many signals, representing for instance DNA variations, the expression of gene activity, or DNA's structural rearrangements; a rich set of data types and formats is used to record these signals. Conceptual models can support the description and explanation of the genome's elaborate structure and behavior. Among others, the Conceptual Schema of the Human Genome (CSG) provides a concept-oriented, top-down representation of the genome behavior, which is independent of data formats. The Genomic Conceptual Model (GCM) provides instead a data-oriented, bottom-up representation, targeting a well-organized, unified description of these formats. In this research, we join the two approaches to achieve PoliViews, a comprehensive model that links (1) a concepts layer, describing genome elements and their conceptual connections, with (2) a data layer, describing datasets derived from genome sequencing with specific technologies. Their dynamic connection is established when specific genomic data types are chosen in the data layer, thereby triggering the selection of a view in the concepts layer. The benefit is mutual: data records can be semantically described by high-level concepts exploiting their links and, in turn, the continuously evolving abstract model can be extended thanks to the input provided by real datasets. PoliViews enables expressing queries that employ a holistic conceptual perspective on the genome, directly translated onto data-oriented terms and organization. Here, we demonstrate the approach by linking two major genomic data types, namely DNA variation and gene expression. For each type, we consider different eminent data sources; we describe their mapping with the corresponding view in the concepts layer, enabling an intra-data-type integration. Then, leveraging on the connections available in the concepts layer, we show how the distinct data types can be interoperated, enabling an inter-data-type integration. The PoliViews approach is shown through several examples of biological interest and can be further extended to any kind of genomic information.
引用
收藏
页数:17
相关论文
共 50 条
  • [31] A Linguistic Approach to Conceptual Modeling with Semantic Types and Onto UML
    Castro, Lucia
    Baiao, Fernanda
    Guizzardi, Giancarlo
    2010 14TH IEEE INTERNATIONAL ENTERPRISE DISTRIBUTED OBJECT COMPUTING CONFERENCE WORKSHOPS (EDOCW 2010), 2010, : 215 - 224
  • [32] Business-driven data analytics: A conceptual modeling framework
    Nalchigar, Soroosh
    Yu, Eric
    DATA & KNOWLEDGE ENGINEERING, 2018, 117 : 359 - 372
  • [33] Towards a Combination of Three Representation Techniques for Conceptual Data Modeling
    Kop, Christian
    2009 FIRST INTERNATIONAL CONFERENCE ON ADVANCES IN DATABASES, KNOWLEDGE, AND DATA APPLICATIONS, 2009, : 95 - 100
  • [34] Conceptual modeling of data intensive and information intensive web applications
    Bochicchio, M
    Longo, A
    10TH INTERNATIONAL MULTIMEDIA MODELLING CONFERENCE, PROCEEDINGS, 2004, : 292 - 299
  • [35] A Conceptual Model-Based Approach to Improve the Representation and Management of Omics Data in Precision Medicine
    Garcia, S. Alberto
    Palacio, Ana Leon
    Roman, Jose Fabian Reyes
    Casamayor, Juan Carlos
    Pastor, Oscar
    IEEE ACCESS, 2021, 9 : 154071 - 154085
  • [36] Modeling Discrete Survival Time Using Genomic Feature Data
    Ferber, Kyle
    Archer, Kellie J.
    CANCER INFORMATICS, 2015, 14 : 37 - 43
  • [37] Conceptual Modeling of the Organisational Aspects for Distributed Applications: The Semantic Lifting Approach
    Hrgovcic, Vedran
    Karagiannis, Dimitris
    Woitsch, Robert
    2013 IEEE 37TH ANNUAL COMPUTER SOFTWARE AND APPLICATIONS CONFERENCE WORKSHOPS (COMPSACW), 2013, : 145 - 150
  • [38] Modeling vague spatial data warehouses using the VSCube conceptual model
    Thiago Luís Lopes Siqueira
    Cristina Dutra de Aguiar Ciferri
    Valéria Cesário Times
    Ricardo Rodrigues Ciferri
    GeoInformatica, 2014, 18 : 313 - 356
  • [39] Towards Understanding of Classes versus Data Types in Conceptual Modeling and UML
    Milicev, Dragan
    COMPUTER SCIENCE AND INFORMATION SYSTEMS, 2012, 9 (02) : 505 - 538
  • [40] Modeling vague spatial data warehouses using the VSCube conceptual model
    Lopes Siqueira, Thiago Luis
    de Aguiar Ciferri, Cristina Dutra
    Times, Valeria Cesario
    Ciferri, Ricardo Rodrigues
    GEOINFORMATICA, 2014, 18 (02) : 313 - 356