Modeling Hereditary Disease Behavior Using an Innovative Similarity Criterion and Ensemble Clustering

被引:21
作者
Mojarad, Musa [1 ,2 ]
Sarhangnia, Fariba [3 ]
Rezaeipanah, Amin [4 ]
Parvin, Hamin [5 ]
Nejatian, Samad [2 ,6 ]
机构
[1] Islamic Azad Univ, Dept Comp Engn, Firoozabad Branch, Firoozabad, Iran
[2] Islamic Azad Univ, Young Researchers & Elite Clubs, Firoozabad Branch, Firoozabad, Iran
[3] Islamic Azad Univ, Dept Comp Engn & Informat Technol, Bushehr Branch, Bushehr, Iran
[4] Univ Rahjuyan Danesh Borazjan, Dept Comp Engn, Bushehr, Iran
[5] Islamic Azad Univ, Dept Comp Engn, Nourabad Mamasani Branch, Nourabad Mamasani, Iran
[6] Islamic Azad Univ, Dept Elect Engn, Yasooj Branch, Yasuj, Iran
关键词
Patients behavior modeling; communications; clustering ensemble; topological graph structure; FANTOM5; cell; CELL;
D O I
10.2174/1574893616999210128175715
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Today, there are various theories about the causes of hereditary diseases, but doctors believe that both genetic and environmental factors play an essential role in the incidence and spread of these diseases. Objective: In order to identify genes that are cause the disease, inter-cell or inter-tissue communications must be determined. The inter-cells or inter-tissues interaction could be illustrated by applying the gene expression. The disorders that have led to widespread changes could be identified by investigating gene expression information. Methods: In this paper, identifying inter-cell and inter-tissue communications for various diseases has been accomplished utilizing an innovative similarity criterion of the graph topological structure characteristics and an extended clustering ensemble. The proposed method is performed in two stages: first, several clustering models have been combined to detect initial inter-cell or inter-tissue communications and produce better results than singular algorithms. Second, the cell-to-cell or tissue-to-tissue similarity in each cluster is identified through a similarity criterion based on the graph topological structure. Results: The evaluation of the proposed method has been carried out, benefiting the UCI and FAN-TOM5 datasets. The results of experiments over FANTOM5 dataset report that the Silhouette coefficient equals 0.901 in 18 clusters for cells and equal to 0.762 in 13 clusters for tissues. Conclusion: The maximum inter-cells or inter-tissues similarity in each cluster can be exploited to detect the relationships between diseases.
引用
收藏
页码:749 / 764
页数:16
相关论文
共 50 条
  • [1] Alqurashi T, 2014, LECT NOTES COMPUT SC, V8669, P142, DOI 10.1007/978-3-319-10840-7_18
  • [2] Arab M, 2017, 2017 3RD INTERNATIONAL CONFERENCE ON WEB RESEARCH (ICWR), P7, DOI 10.1109/ICWR.2017.7959298
  • [3] Aranganayagi S., 2007, ICCIMA 2007 INT C CO, VIII, P13, DOI DOI 10.1109/ICCIMA.2007.328
  • [4] A history of the study of solid tumour growth: The contribution of mathematical modelling
    Araujo, RP
    McElwain, DLS
    [J]. BULLETIN OF MATHEMATICAL BIOLOGY, 2004, 66 (05) : 1039 - 1091
  • [5] Accurate whole human genome sequencing using reversible terminator chemistry
    Bentley, David R.
    Balasubramanian, Shankar
    Swerdlow, Harold P.
    Smith, Geoffrey P.
    Milton, John
    Brown, Clive G.
    Hall, Kevin P.
    Evers, Dirk J.
    Barnes, Colin L.
    Bignell, Helen R.
    Boutell, Jonathan M.
    Bryant, Jason
    Carter, Richard J.
    Cheetham, R. Keira
    Cox, Anthony J.
    Ellis, Darren J.
    Flatbush, Michael R.
    Gormley, Niall A.
    Humphray, Sean J.
    Irving, Leslie J.
    Karbelashvili, Mirian S.
    Kirk, Scott M.
    Li, Heng
    Liu, Xiaohai
    Maisinger, Klaus S.
    Murray, Lisa J.
    Obradovic, Bojan
    Ost, Tobias
    Parkinson, Michael L.
    Pratt, Mark R.
    Rasolonjatovo, Isabelle M. J.
    Reed, Mark T.
    Rigatti, Roberto
    Rodighiero, Chiara
    Ross, Mark T.
    Sabot, Andrea
    Sankar, Subramanian V.
    Scally, Aylwyn
    Schroth, Gary P.
    Smith, Mark E.
    Smith, Vincent P.
    Spiridou, Anastassia
    Torrance, Peta E.
    Tzonev, Svilen S.
    Vermaas, Eric H.
    Walter, Klaudia
    Wu, Xiaolin
    Zhang, Lu
    Alam, Mohammed D.
    Anastasi, Carole
    [J]. NATURE, 2008, 456 (7218) : 53 - 59
  • [6] Bharadwaj KK, 2005, PROC WRLD ACAD SCI E, V8, P333
  • [7] Hypergraph imaging: an overview
    Bretto, A
    Cherifi, H
    Aboutajdine, D
    [J]. PATTERN RECOGNITION, 2002, 35 (03) : 651 - 658
  • [8] Dissecting cancer through mathematics: from the cell to the animal model
    Byrne, Helen M.
    [J]. NATURE REVIEWS CANCER, 2010, 10 (03) : 221 - 230
  • [9] Detection of gene communities in multi-networks reveals cancer drivers
    Cantini, Laura
    Medico, Enzo
    Fortunato, Santo
    Caselle, Michele
    [J]. SCIENTIFIC REPORTS, 2015, 5
  • [10] A comparison of methods accounting for batch effects in differential expression analysis of UMI count based single cell RNA sequencing
    Chen, Wenan
    Zhang, Silu
    Williams, Justin
    Ju, Bensheng
    Shaner, Bridget
    Easton, John
    Wu, Gang
    Chen, Xiang
    [J]. COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2020, 18 (18): : 861 - 873