Modeling Hereditary Disease Behavior Using an Innovative Similarity Criterion and Ensemble Clustering

被引:21
作者
Mojarad, Musa [1 ,2 ]
Sarhangnia, Fariba [3 ]
Rezaeipanah, Amin [4 ]
Parvin, Hamin [5 ]
Nejatian, Samad [2 ,6 ]
机构
[1] Islamic Azad Univ, Dept Comp Engn, Firoozabad Branch, Firoozabad, Iran
[2] Islamic Azad Univ, Young Researchers & Elite Clubs, Firoozabad Branch, Firoozabad, Iran
[3] Islamic Azad Univ, Dept Comp Engn & Informat Technol, Bushehr Branch, Bushehr, Iran
[4] Univ Rahjuyan Danesh Borazjan, Dept Comp Engn, Bushehr, Iran
[5] Islamic Azad Univ, Dept Comp Engn, Nourabad Mamasani Branch, Nourabad Mamasani, Iran
[6] Islamic Azad Univ, Dept Elect Engn, Yasooj Branch, Yasuj, Iran
关键词
Patients behavior modeling; communications; clustering ensemble; topological graph structure; FANTOM5; cell; GRAPH-THEORY; CELL;
D O I
10.2174/1574893616999210128175715
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Today, there are various theories about the causes of hereditary diseases, but doctors believe that both genetic and environmental factors play an essential role in the incidence and spread of these diseases. Objective: In order to identify genes that are cause the disease, inter-cell or inter-tissue communications must be determined. The inter-cells or inter-tissues interaction could be illustrated by applying the gene expression. The disorders that have led to widespread changes could be identified by investigating gene expression information. Methods: In this paper, identifying inter-cell and inter-tissue communications for various diseases has been accomplished utilizing an innovative similarity criterion of the graph topological structure characteristics and an extended clustering ensemble. The proposed method is performed in two stages: first, several clustering models have been combined to detect initial inter-cell or inter-tissue communications and produce better results than singular algorithms. Second, the cell-to-cell or tissue-to-tissue similarity in each cluster is identified through a similarity criterion based on the graph topological structure. Results: The evaluation of the proposed method has been carried out, benefiting the UCI and FAN-TOM5 datasets. The results of experiments over FANTOM5 dataset report that the Silhouette coefficient equals 0.901 in 18 clusters for cells and equal to 0.762 in 13 clusters for tissues. Conclusion: The maximum inter-cells or inter-tissues similarity in each cluster can be exploited to detect the relationships between diseases.
引用
收藏
页码:749 / 764
页数:16
相关论文
共 50 条
[1]  
Alqurashi T, 2014, LECT NOTES COMPUT SC, V8669, P142, DOI 10.1007/978-3-319-10840-7_18
[2]  
Arab M, 2017, 2017 3RD INTERNATIONAL CONFERENCE ON WEB RESEARCH (ICWR), P7, DOI 10.1109/ICWR.2017.7959298
[3]  
Aranganayagi S, 2007, ICCIMA 2007 INT C CO, VIII, P13, DOI DOI 10.1109/ICCIMA.2007.328
[4]   A history of the study of solid tumour growth: The contribution of mathematical modelling [J].
Araujo, RP ;
McElwain, DLS .
BULLETIN OF MATHEMATICAL BIOLOGY, 2004, 66 (05) :1039-1091
[5]   Accurate whole human genome sequencing using reversible terminator chemistry [J].
Bentley, David R. ;
Balasubramanian, Shankar ;
Swerdlow, Harold P. ;
Smith, Geoffrey P. ;
Milton, John ;
Brown, Clive G. ;
Hall, Kevin P. ;
Evers, Dirk J. ;
Barnes, Colin L. ;
Bignell, Helen R. ;
Boutell, Jonathan M. ;
Bryant, Jason ;
Carter, Richard J. ;
Cheetham, R. Keira ;
Cox, Anthony J. ;
Ellis, Darren J. ;
Flatbush, Michael R. ;
Gormley, Niall A. ;
Humphray, Sean J. ;
Irving, Leslie J. ;
Karbelashvili, Mirian S. ;
Kirk, Scott M. ;
Li, Heng ;
Liu, Xiaohai ;
Maisinger, Klaus S. ;
Murray, Lisa J. ;
Obradovic, Bojan ;
Ost, Tobias ;
Parkinson, Michael L. ;
Pratt, Mark R. ;
Rasolonjatovo, Isabelle M. J. ;
Reed, Mark T. ;
Rigatti, Roberto ;
Rodighiero, Chiara ;
Ross, Mark T. ;
Sabot, Andrea ;
Sankar, Subramanian V. ;
Scally, Aylwyn ;
Schroth, Gary P. ;
Smith, Mark E. ;
Smith, Vincent P. ;
Spiridou, Anastassia ;
Torrance, Peta E. ;
Tzonev, Svilen S. ;
Vermaas, Eric H. ;
Walter, Klaudia ;
Wu, Xiaolin ;
Zhang, Lu ;
Alam, Mohammed D. ;
Anastasi, Carole .
NATURE, 2008, 456 (7218) :53-59
[6]  
Bharadwaj KK, 2005, PROC WRLD ACAD SCI E, V8, P333
[7]   Hypergraph imaging: an overview [J].
Bretto, A ;
Cherifi, H ;
Aboutajdine, D .
PATTERN RECOGNITION, 2002, 35 (03) :651-658
[8]   Dissecting cancer through mathematics: from the cell to the animal model [J].
Byrne, Helen M. .
NATURE REVIEWS CANCER, 2010, 10 (03) :221-230
[9]   Detection of gene communities in multi-networks reveals cancer drivers [J].
Cantini, Laura ;
Medico, Enzo ;
Fortunato, Santo ;
Caselle, Michele .
SCIENTIFIC REPORTS, 2015, 5
[10]   A comparison of methods accounting for batch effects in differential expression analysis of UMI count based single cell RNA sequencing [J].
Chen, Wenan ;
Zhang, Silu ;
Williams, Justin ;
Ju, Bensheng ;
Shaner, Bridget ;
Easton, John ;
Wu, Gang ;
Chen, Xiang .
COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2020, 18 (18) :861-873