Effective cancer subtyping by employing density peaks clustering by using gene expression microarray

被引:12
作者
Mehmood, Rashid [1 ]
El-Ashram, Saeed [2 ]
Bie, Rongfang [1 ]
Sun, Yunchuan [3 ]
机构
[1] Beijing Normal Univ, Coll Informat Sci & Technol, Beijing 100875, Peoples R China
[2] Kafr El Sheikh Univ, Fac Sci, Kafr Al Sheikh, Egypt
[3] Beijing Normal Univ, Sch Business, Beijing 100875, Peoples R China
基金
中国国家自然科学基金;
关键词
Gene expression microarray; Data mining; Clustering; Density peaks; FAST SEARCH; FIND;
D O I
10.1007/s00779-018-1112-y
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Discovering the similar groups is a popular primary step in analysis of biomedical data, which cannot be identified manually. Many supervised and unsupervised machine learning and statistical approaches have been developed to solve this problem. Clustering is an unsupervised learning approach, which organizes the data into similar groups, and is used to discover the intrinsic hidden structure of data. In this paper, we used clustering by fast search and find of density peaks (CDP) approach for cancer subtyping and identification of normal tissues from tumor tissues. In additional, we also address the preprocessing and underlying distance matrix's impact on finalized groups. We have performed extensive experiments on real-world and synthetic cancer gene expression microarray data sets and compared obtained results with state-of-the-art clustering approaches.
引用
收藏
页码:615 / 619
页数:5
相关论文
共 18 条
[1]   Adaptive fuzzy clustering by fast search and find of density peaks [J].
Bie, Rongfang ;
Mehmood, Rashid ;
Ruan, Shanshan ;
Sun, Yunchuan ;
Dawood, Hussain .
PERSONAL AND UBIQUITOUS COMPUTING, 2016, 20 (05) :785-793
[2]   Selecting dissimilar genes for multi-class classification, an application in cancer subtyping [J].
Cai, Zhipeng ;
Goebel, Randy ;
Salavatipour, Mohammad R. ;
Lin, Guohui .
BMC BIOINFORMATICS, 2007, 8 (1)
[3]  
Cai Zhipeng, 2006, Journal of Bioinformatics and Computational Biology, V4, P935, DOI 10.1142/S0219720006002302
[4]   Patterns of resistance and incomplete response to docetaxel by gene expression profiling in breast cancer patients [J].
Chang, JC ;
Wooten, EC ;
Tsimelzon, A ;
Hilsenbeck, SG ;
Gutierrez, MC ;
Tham, YL ;
Kalidas, M ;
Elledge, R ;
Mohsin, S ;
Osborne, CK ;
Chamness, GC ;
Allred, DC ;
Lewis, MT ;
Wong, H ;
O'Connell, P .
JOURNAL OF CLINICAL ONCOLOGY, 2005, 23 (06) :1169-1177
[5]   Score normalization in multimodal biometric systems [J].
Jain, A ;
Nandakumar, K ;
Ross, A .
PATTERN RECOGNITION, 2005, 38 (12) :2270-2285
[6]   HIERARCHICAL CLUSTERING SCHEMES [J].
JOHNSON, SC .
PSYCHOMETRIKA, 1967, 32 (03) :241-254
[7]   The self-organizing map [J].
Kohonen, T .
NEUROCOMPUTING, 1998, 21 (1-3) :1-6
[8]   Transformation from committed progenitor to leukaemia stem cell initiated by MLL-AF9 [J].
Krivtsov, Andrei V. ;
Twomey, David ;
Feng, Zhaohui ;
Stubbs, Matthew C. ;
Wang, Yingzi ;
Faber, Joerg ;
Levine, Jason E. ;
Wang, Jing ;
Hahn, William C. ;
Gilliland, D. Gary ;
Golub, Todd R. ;
Armstrong, Scott A. .
NATURE, 2006, 442 (7104) :818-822
[9]  
MacQueen J., 1967, BERK S MATH STAT PRO, V5, P281, DOI DOI 10.1007/S11665-016-2173-6
[10]   Clustering by fast search and merge of local density peaks for gene expression microarray data [J].
Mehmood, Rashid ;
El-Ashram, Saeed ;
Bie, Rongfang ;
Dawood, Hussain ;
Kos, Anton .
SCIENTIFIC REPORTS, 2017, 7