Towards a Shared, Conceptual Model-Based Understanding of Proteins and Their Interactions

被引:5
作者
Leon, Ana [1 ]
Pastor, Oscar [1 ]
机构
[1] Univ Politecn Valencia, Res Ctr Software Prod Methods PROS, Valencia 46022, Spain
关键词
Proteins; Databases; Bioinformatics; Genomics; Organisms; Diseases; Task analysis; Conceptual modeling; genomics; proteins; MOLECULAR INTERACTION DATABASE; DISEASES;
D O I
10.1109/ACCESS.2021.3080040
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Understanding the human genome is a big research challenge. The huge complexity and amount of genome data require extremely effective and efficient data management policies. A first crucial point is to obtain a shared understanding of the domain, which becomes a very hard task considering the number of different genome data sources. To make things more complicated, those data sources deal with different parts of genome-based information: we not only need to understand them well, but also to integrate and intercommunicate all the relevant information. The protein perspective is a good example: rich, well-known repositories such as UniProt provide a lot of valuable information that it is not easy to interpret and manage when we want to generate useful results. Proteomes and basic information, protein-protein interaction, protein structure, protein processing events, protein function, etc. provide a lot of information is that needs to be conceptually characterized and delimited. To facilitate the essential common understanding of the domain, this paper uses the case of proteins to analyze the data provided by Uniprot in order to make a sound conceptualization work for identifying the relevant domain concepts. A conceptual model of proteins is the result of this conceptualization process, explained in detail in this work. This holistic conceptual model of proteins presented in this paper is the result of achieving a precise ontological commitment. It establishes concepts and their relationships that are significant in order to have a solid basis to efficiently manage relevant genome data related to proteins.
引用
收藏
页码:73608 / 73623
页数:16
相关论文
共 36 条
  • [21] Novak W. R. P, 2014, MOL LIFE SCI, P1
  • [22] Pastor O, 2008, LECT NOTES COMPUT SC, V5231, P1, DOI 10.1007/978-3-540-87877-3_1
  • [23] Protein-protein Interactions and their Role in Various Diseases and their Prediction Techniques
    Rabbani, Gulam
    Baig, Mohammad Hassan
    Ahmad, Khurshid
    Choi, Inho
    [J]. CURRENT PROTEIN & PEPTIDE SCIENCE, 2018, 19 (10) : 948 - 957
  • [24] Applying Conceptual Modeling to Better Understand the Human Genome
    Reyes Roman, Jose F.
    Pastor, Oscar
    Carlos Casamayor, Juan
    Valverde, Francisco
    [J]. CONCEPTUAL MODELING, ER 2016, 2016, 9974 : 404 - 412
  • [25] Disulfide Bond Formation in the Cytoplasm
    Saaranen, Mirva J.
    Ruddock, Lloyd W.
    [J]. ANTIOXIDANTS & REDOX SIGNALING, 2013, 19 (01) : 36 - 43
  • [26] The Database of Interacting Proteins: 2004 update
    Salwinski, L
    Miller, CS
    Smith, AJ
    Pettit, FK
    Bowie, JU
    Eisenberg, D
    [J]. NUCLEIC ACIDS RESEARCH, 2004, 32 : D449 - D451
  • [27] dbSNP: the NCBI database of genetic variation
    Sherry, ST
    Ward, MH
    Kholodov, M
    Baker, J
    Phan, L
    Smigielski, EM
    Sirotkin, K
    [J]. NUCLEIC ACIDS RESEARCH, 2001, 29 (01) : 308 - 311
  • [28] An Overview of the Prediction of Protein DNA-Binding Sites
    Si, Jingna
    Zhao, Rui
    Wu, Rongling
    [J]. INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2015, 16 (03): : 5194 - 5215
  • [29] Spreeuwenberg S., 2019, AIX: Artificial Intelligence Needs Explanation
  • [30] Analysis of protein isoforms: Can we do it better?
    Stastna, Miroslava
    Van Eyk, Jennifer E.
    [J]. PROTEOMICS, 2012, 12 (19-20) : 2937 - 2948