Multiobjective Simulated Annealing-Based Clustering of Tissue Samples for Cancer Diagnosis

被引:20
作者
Acharya, Sudipta [1 ]
Saha, Sriparna [1 ]
Thadisina, Yamini [1 ]
机构
[1] Indian Inst Technol, Dept Comp Sci & Engn, Patna 800013, Bihar, India
关键词
Archived multiobjective simulated annealing (AMOSA); adjusted rand index (ARI); clustering; %CoA index; gene marker; multiobjective optimization (MOO); GENE-EXPRESSION DATA; ALGORITHM; CLASSIFICATION; OPTIMIZATION; PREDICTION;
D O I
10.1109/JBHI.2015.2404971
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In the field of pattern recognition, the study of the gene expression profiles of different tissue samples over different experimental conditions has become feasible with the arrival of microarray-based technology. In cancer research, classification of tissue samples is necessary for cancer diagnosis, which can be done with the help of microarray technology. In this paper, we have presented a multiobjective optimization (MOO)-based clustering technique utilizing archived multiobjective simulated annealing(AMOSA) as the underlying optimization strategy for classification of tissue samples from cancer datasets. The presented clustering technique is evaluated for three open source benchmark cancer datasets [Brain tumor dataset, Adult Malignancy, and Small Round Blood Cell Tumors (SRBCT)]. In order to evaluate the quality or goodness of produced clusters, two cluster quality measures viz, adjusted rand index and classification accuracy (%CoA) are calculated. Comparative results of the presented clustering algorithm with ten state-of-the-art existing clustering techniques are shown for three benchmark datasets. Also, we have conducted a statistical significance test called t-test to prove the superiority of our presented MOO-based clustering technique over other clustering techniques. Moreover, significant gene markers have been identified and demonstrated visually from the clustering solutions obtained. In the field of cancer subtype prediction, this study can have important impact.
引用
收藏
页码:691 / 698
页数:8
相关论文
共 27 条
[1]   Distinct types of diffuse large B-cell lymphoma identified by gene expression profiling [J].
Alizadeh, AA ;
Eisen, MB ;
Davis, RE ;
Ma, C ;
Lossos, IS ;
Rosenwald, A ;
Boldrick, JG ;
Sabet, H ;
Tran, T ;
Yu, X ;
Powell, JI ;
Yang, LM ;
Marti, GE ;
Moore, T ;
Hudson, J ;
Lu, LS ;
Lewis, DB ;
Tibshirani, R ;
Sherlock, G ;
Chan, WC ;
Greiner, TC ;
Weisenburger, DD ;
Armitage, JO ;
Warnke, R ;
Levy, R ;
Wilson, W ;
Grever, MR ;
Byrd, JC ;
Botstein, D ;
Brown, PO ;
Staudt, LM .
NATURE, 2000, 403 (6769) :503-511
[2]  
An Lingling, 2012, ISRN Bioinform, V2012, P537217, DOI 10.5402/2012/537217
[3]  
Bandyopadhyay S., 2012, CLASSICAL METAHEURIS
[4]   A simulated annealing-based multiobjective optimization algorithm: AMOSA [J].
Bandyopadhyay, Sanghamitra ;
Saha, Sriparna ;
Maulik, Ujjwal ;
Deb, Kalyanmoy .
IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2008, 12 (03) :269-283
[5]   An improved algorithm for clustering gene expression data [J].
Bandyopadhyay, Sanghamitra ;
Mukhopadhyay, Anirban ;
Maulik, Ujjwal .
BIOINFORMATICS, 2007, 23 (21) :2859-2865
[6]   Clustering cancer gene expression data: a comparative study [J].
de Souto, Marcilio C. P. ;
Costa, Ivan G. ;
de Araujo, Daniel S. A. ;
Ludermir, Teresa B. ;
Schliep, Alexander .
BMC BIOINFORMATICS, 2008, 9 (1)
[7]   A fast and elitist multiobjective genetic algorithm: NSGA-II [J].
Deb, K ;
Pratap, A ;
Agarwal, S ;
Meyarivan, T .
IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2002, 6 (02) :182-197
[8]   Clustering gene expression data using a diffraction-inspired framework [J].
Dinger, Steven C. ;
Van Wyk, Michael A. ;
Carmona, Sergio ;
Rubin, David M. .
BIOMEDICAL ENGINEERING ONLINE, 2012, 11
[9]  
Golub T., 2002, Google Patents, Patent No. [WO 2002061144 A2, 2002061144]
[10]   Molecular classification of cancer: Class discovery and class prediction by gene expression monitoring [J].
Golub, TR ;
Slonim, DK ;
Tamayo, P ;
Huard, C ;
Gaasenbeek, M ;
Mesirov, JP ;
Coller, H ;
Loh, ML ;
Downing, JR ;
Caligiuri, MA ;
Bloomfield, CD ;
Lander, ES .
SCIENCE, 1999, 286 (5439) :531-537