PoliViews: A comprehensive and modular approach to the conceptual modeling of genomic data

被引:0
|
作者
Bernasconi, Anna [1 ]
Garcia, S. Alberto [2 ]
Ceri, Stefano [1 ]
Pastor, Oscar [2 ]
机构
[1] Politecn Milan, Dept Elect Informat & Bioengn, Milan, Italy
[2] Univ Politecn Valencia, VRAIN Res Inst, PROS Res Ctr, Valencia, Spain
关键词
Conceptual modeling; Data repositories; Data integration; Biological datasets; Genomics; Scientific databases; GENE-EXPRESSION; INTEGRATIVE ANALYSIS; DNA ELEMENTS; ENCYCLOPEDIA; ENVIRONMENT; ATLAS;
D O I
10.1016/j.datak.2023.102201
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The human genome complexity is captured by many signals, representing for instance DNA variations, the expression of gene activity, or DNA's structural rearrangements; a rich set of data types and formats is used to record these signals. Conceptual models can support the description and explanation of the genome's elaborate structure and behavior. Among others, the Conceptual Schema of the Human Genome (CSG) provides a concept-oriented, top-down representation of the genome behavior, which is independent of data formats. The Genomic Conceptual Model (GCM) provides instead a data-oriented, bottom-up representation, targeting a well-organized, unified description of these formats. In this research, we join the two approaches to achieve PoliViews, a comprehensive model that links (1) a concepts layer, describing genome elements and their conceptual connections, with (2) a data layer, describing datasets derived from genome sequencing with specific technologies. Their dynamic connection is established when specific genomic data types are chosen in the data layer, thereby triggering the selection of a view in the concepts layer. The benefit is mutual: data records can be semantically described by high-level concepts exploiting their links and, in turn, the continuously evolving abstract model can be extended thanks to the input provided by real datasets. PoliViews enables expressing queries that employ a holistic conceptual perspective on the genome, directly translated onto data-oriented terms and organization. Here, we demonstrate the approach by linking two major genomic data types, namely DNA variation and gene expression. For each type, we consider different eminent data sources; we describe their mapping with the corresponding view in the concepts layer, enabling an intra-data-type integration. Then, leveraging on the connections available in the concepts layer, we show how the distinct data types can be interoperated, enabling an inter-data-type integration. The PoliViews approach is shown through several examples of biological interest and can be further extended to any kind of genomic information.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] A Comprehensive Approach for the Conceptual Modeling of Genomic Data
    Bernasconi, Anna
    Garcia S, Alberto
    Ceri, Stefano
    Pastor, Oscar
    CONCEPTUAL MODELING (ER 2022), 2022, 13607 : 194 - 208
  • [2] A category theory approach to conceptual data modeling
    Lippe, E
    terHofstede, AHM
    RAIRO-INFORMATIQUE THEORIQUE ET APPLICATIONS-THEORETICAL INFORMATICS AND APPLICATIONS, 1996, 30 (01): : 31 - 79
  • [3] Towards a Conceptual Design and Semantic Modeling Approach for Innovative Modular Products
    Aidara, Cherif Ahmed Tidiane
    Biaye, Bala Moussa
    Diagne, Serigne
    Gaye, Khalifa
    Coulibaly, Amadou
    AUTOMATED INVENTION FOR SMART INDUSTRIES, 2018, 541 : 180 - 190
  • [4] A modular segmentation approach for comprehensive electromagnetic modeling of the power distribution network
    Kollia, Varvara
    Cangellaris, Andreas C.
    58TH ELECTRONIC COMPONENTS & TECHNOLOGY CONFERENCE, PROCEEDINGS, 2008, : 638 - 645
  • [5] Conceptual Design of UAV Using Modular Approach
    Kumar, A. Sai
    Hardik, L. G.
    Kumar, D. Kushal
    Ramya, B.
    John, T.
    3RD INTERNATIONAL CONFERENCE ON ADVANCEMENTS IN AEROMECHANICAL MATERIALS FOR MANUFACTURING: ICAAMM-2020, 2021, 2317
  • [6] Genomic data modeling
    Chen, JY
    Carlis, JV
    INFORMATION SYSTEMS, 2003, 28 (04) : 287 - 310
  • [7] ADOPTING A MODULAR APPROACH TO MODELING
    HAVRANEK, WA
    BARTELLS, PS
    NUCLEAR ENGINEERING INTERNATIONAL, 1986, 31 (387): : 38 - 40
  • [8] Integrated Approach to Conceptual Modeling
    Hyseni, Lindita Nebiu
    Dika, Zamir
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2016, 7 (12) : 213 - 219
  • [9] Conceptual modeling for semistructured data
    Badia, A
    WISE 2002: PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON WEB INFORMATION SYSTEMS ENGINEERING (WORKSHOPS), 2002, : 170 - 177
  • [10] Mining Massive Amounts of Genomic Data: A Semiparametric Topic Modeling Approach
    Fang, Ethan X.
    Li, Min-Dian
    Jordan, Michael I.
    Liu, Han
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2017, 112 (519) : 921 - 932