Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

被引:6
作者
Feng, Xin [1 ,2 ]
Wang, Shaofei [1 ,2 ]
Liu, Quewang [1 ,2 ]
Li, Han [3 ]
Liu, Jiamei [3 ]
Xu, Cheng [3 ]
Yang, Weifeng [3 ]
Shu, Yayun [3 ]
Zheng, Weiwei [1 ,2 ]
Yu, Bingxin [4 ]
Qi, Mingran [5 ]
Zhou, Wenyang [1 ,2 ]
Zhou, Fengfeng [1 ,2 ]
机构
[1] Jilin Univ, Coll Comp Sci & Technol, Changchun, Jilin, Peoples R China
[2] Jilin Univ, Key Lab Symbol Computat & Knowledge Engn, Minist Educ, Changchun, Jilin, Peoples R China
[3] Jilin Univ, Coll Software, Changchun, Jilin, Peoples R China
[4] Jilin Univ, Ultrasonog Dept, China Japan Union Hosp, Changchun, Jilin, Peoples R China
[5] Jilin Univ, Dept Pathogenobiol, Coll Basic Med Sci, Changchun, Jilin, Peoples R China
来源
JOVE-JOURNAL OF VISUALIZED EXPERIMENTS | 2018年 / 140期
关键词
Cancer Research; Issue; 140; Biomarker detection; feature selection; OMIC; binary classification; filter; wrapper; extreme learning machine; ELM; ACUTE LYMPHOBLASTIC-LEUKEMIA; GENE-EXPRESSION PROFILE; WIDE ASSOCIATION; CHROMOSOMAL LOCATION; SUSCEPTIBILITY; GENOME; SITES; GPS; INFORMATION; THERAPY;
D O I
10.3791/57738
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Biomarker detection is one of the more important biomedical questions for high-throughput 'omics' researchers, and almost all existing biomarker detection algorithms generate one biomarker subset with the optimized performance measurement for a given dataset. However, a recent study demonstrated the existence of multiple biomarker subsets with similarly effective or even identical classification performances. This protocol presents a simple and straightforward methodology for detecting biomarker subsets with binary classification performances, better than a user-defined cutoff. The protocol consists of data preparation and loading, baseline information summarization, parameter tuning, biomarker screening, result visualization and interpretation, biomarker gene annotations, and result and visualization exportation at publication quality. The proposed biomarker screening strategy is intuitive and demonstrates a general rule for developing biomarker detection algorithms. A user-friendly graphical user interface (GUI) was developed using the programming language Python, allowing biomedical researchers to have direct access to their results. The source code and manual of kSolutionVis can be downloaded from http://www.healthinformaticslab.org/supp/resources.php.
引用
收藏
页数:16
相关论文
共 66 条
[1]   OMIM.org: Online Mendelian Inheritance in Man (OMIM®), an online catalog of human genes and genetic disorders [J].
Amberger, Joanna S. ;
Bocchini, Carol A. ;
Schiettecatte, Francois ;
Scott, Alan F. ;
Hamosh, Ada .
NUCLEIC ACIDS RESEARCH, 2015, 43 (D1) :D789-D798
[2]   Novel target genes and a valid biomarker panel identified for cholangiocarcinoma [J].
Andresen, Kim ;
Boberg, Kirsten Muri ;
Vedeld, Hege Marie ;
Honne, Hilde ;
Hektoen, Merete ;
Wadsworth, Christopher A. ;
Clausen, Ole Petter ;
Karlsen, Tom Hemming ;
Foss, Aksel ;
Mathisen, Oystein ;
Schrumpf, Erik ;
Lothe, Ragnhild A. ;
Lind, Guro E. .
EPIGENETICS, 2012, 7 (11) :1249-1257
[3]   Photoanthropometric face iridial proportions for age estimation: An investigation using features selected via a joint mutual information criterion [J].
Borges, Dibio L. ;
Vidal, Flavio B. ;
Flores, Marta R. P. ;
Melani, Rodolfo F. H. ;
Guimaraes, Marco A. ;
Machado, Carlos E. P. .
FORENSIC SCIENCE INTERNATIONAL, 2018, 284 :9-14
[4]  
Boutet E, 2016, METHODS MOL BIOL, V1374, P23, DOI 10.1007/978-1-4939-3167-5_2
[5]   Epstein-Barr virus-negative boys with non-Hodgkin lymphoma are mutated in the SH2D1A gene, as are patients with X-linked lymphoproliferative disease (XLP) [J].
Brandau, O ;
Schuster, V ;
Weiss, M ;
Hellebrand, H ;
Fink, FM ;
Kreczy, A ;
Friedrich, W ;
Strahm, B ;
Niemeyer, C ;
Belohradsky, BH ;
Meindl, A .
HUMAN MOLECULAR GENETICS, 1999, 8 (13) :2407-2413
[6]  
BURNETT RC, 1994, BLOOD, V84, P1232
[7]   Gene expression profile of adult T-cell acute lymphocytic leukemia identifies distinct subsets of patients with different response to therapy and survival [J].
Chiaretti, S ;
Li, XC ;
Gentleman, R ;
Vitale, A ;
Vignetti, M ;
Mandelli, F ;
Ritz, J ;
Foa, R .
BLOOD, 2004, 103 (07) :2771-2778
[8]   Gene Expression Profiling of Colorectal Tumors and Normal Mucosa by Microarrays Meta-Analysis Using Prediction Analysis of Microarray, Artificial Neural Network, Classification, and Regression Trees [J].
Chu, Chi-Ming ;
Yao, Chung-Tay ;
Chang, Yu-Tien ;
Chou, Hsiu-Ling ;
Chou, Yu-Ching ;
Chen, Kang-Hua ;
Terng, Harn-Jing ;
Huang, Chi-Shuan ;
Lee, Chia-Cheng ;
Su, Sui-Lun ;
Liu, Yao-Chi ;
Lin, Fu-Gong ;
Wetter, Thomas ;
Chang, Chi-Wen .
DISEASE MARKERS, 2014, 2014
[9]   A methylome-wide mQTL analysis reveals associations of methylation sites with GAD1 and HDAC3 SNPs and a general psychiatric risk score [J].
Ciuculete, D. M. ;
Bostrom, A. E. ;
Voisin, S. ;
Philipps, H. ;
Titova, O. E. ;
Bandstein, M. ;
Nikontovic, L. ;
Williams, M. J. ;
Mwinyi, J. ;
Schioth, H. B. .
TRANSLATIONAL PSYCHIATRY, 2017, 7 :e1002-e1002
[10]  
Coghe G, 2018, J NEUROLOGY