A data structure and function classification based method to evaluate clustering models for gene expression data

被引:0
作者
易东
杨梦苏
黄明辉
李辉智
王文昌
机构
[1] Department of Medical Statistics
[2] Third Military Medical University
[3] Chongqing
[4] China
[5] Applied Research Centre for Genomics Technology
[6] Department of Biology & Chemistry
[7] City University of Hong Kong
[8] Tat Chee Avenue
[9] Kowloon
[10] Hong Kong
[11] Department of Electronic Technology
[12] Southwest University of Politics and Law Science
[13] China
关键词
gene expression; evaluation of clustering; adjust-; FOM; entropy;
D O I
暂无
中图分类号
R311 [医用数学];
学科分类号
1001 ;
摘要
<正> Objective: To establish a systematic framework for selecting the best clustering algorithm and provide an evaluation method for clustering analyses of gene expression data. Methods: Based on data structure (internal information) and function classification (external information), the evaluation of gene expression data analyses were carried out by using 2 approaches. Firstly, to assess the predictive power of clustering algorithms, Entropy was introduced to measure the consistency between the clustering results from different algorithms and the known and validated functional classifications. Secondly, a modified method of figure of merit (adjust-FOM) was used as internal assessment method. In this method, one clustering algorithm was used to analyze all data but one experimental condition, the remaining condition was used to assess the predictive power of the resulting clusters. This method was applied on 3 gene expression data sets (2 from the Lyer's Serum Data Sets, and 1 from the Ferea's Saccharomyces
引用
收藏
页码:312 / 317
页数:6
相关论文
共 50 条
[41]   Context-specific Bayesian clustering for gene expression data [J].
Barash, Y ;
Friedman, N .
JOURNAL OF COMPUTATIONAL BIOLOGY, 2002, 9 (02) :169-191
[42]   Gradual representation of shadowed set for clustering gene expression data [J].
Bose, Ankita ;
Mali, Kalyani .
APPLIED SOFT COMPUTING, 2019, 83
[43]   Dynamic clustering of gene expression data using a fuzzy approach [J].
Sirbu, Adela-Maria ;
Czibula, Gabriela ;
Bocicor, Maria-Iuliana .
16TH INTERNATIONAL SYMPOSIUM ON SYMBOLIC AND NUMERIC ALGORITHMS FOR SCIENTIFIC COMPUTING (SYNASC 2014), 2014, :220-227
[44]   A Hierarchical Approach for Clustering and Pattern Matching of Gene Expression Data [J].
Hoque, Soriful ;
Istyaq, Salim ;
Riaz, Md Mushir .
2012 SIXTH INTERNATIONAL CONFERENCE ON GENETIC AND EVOLUTIONARY COMPUTING (ICGEC), 2012, :413-416
[45]   Entropy-based method to evaluate the data integrity [J].
Xu Peng ;
Ma Tianyu ;
Jin Yongjie .
NUCLEAR INSTRUMENTS & METHODS IN PHYSICS RESEARCH SECTION A-ACCELERATORS SPECTROMETERS DETECTORS AND ASSOCIATED EQUIPMENT, 2006, 569 (02) :412-415
[46]   A knowledge and data based hybrid approach to gene clustering [J].
Abhishek, K. ;
Karnick, H. ;
Mitra, P. .
PROCEEDINGS OF THE FIFTH INTERNATIONAL CONFERENCE ON BIOINFORMATICS OF GENOME REGULATION AND STRUCTURE, VOL 1, 2006, :19-+
[47]   Evaluation of clustering algorithms for gene expression data using gene ontology annotations [J].
Ma Ning ;
Zhang Zheng-guo .
CHINESE MEDICAL JOURNAL, 2012, 125 (17) :3048-3052
[48]   Gene Expression Data Analysis Using Feature Weighted Robust Fuzzy -Means Clustering [J].
Singh, Vikas ;
Verma, Nishchal K. K. .
IEEE TRANSACTIONS ON NANOBIOSCIENCE, 2023, 22 (01) :99-105
[49]   Gene selection in a gene decision space with application to gene expression data classification [J].
Wang, Yuxian ;
Li, Zhaowen ;
Zhang, Jie ;
Yu, Guangji .
JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2023, 45 (03) :5021-5044
[50]   Bayesian models for gene expression with DNA microarray data [J].
Ibrahim, JG ;
Chen, MH ;
Gray, RJ .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2002, 97 (457) :88-99