Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

被引:6
作者
Feng, Xin [1 ,2 ]
Wang, Shaofei [1 ,2 ]
Liu, Quewang [1 ,2 ]
Li, Han [3 ]
Liu, Jiamei [3 ]
Xu, Cheng [3 ]
Yang, Weifeng [3 ]
Shu, Yayun [3 ]
Zheng, Weiwei [1 ,2 ]
Yu, Bingxin [4 ]
Qi, Mingran [5 ]
Zhou, Wenyang [1 ,2 ]
Zhou, Fengfeng [1 ,2 ]
机构
[1] Jilin Univ, Coll Comp Sci & Technol, Changchun, Jilin, Peoples R China
[2] Jilin Univ, Key Lab Symbol Computat & Knowledge Engn, Minist Educ, Changchun, Jilin, Peoples R China
[3] Jilin Univ, Coll Software, Changchun, Jilin, Peoples R China
[4] Jilin Univ, Ultrasonog Dept, China Japan Union Hosp, Changchun, Jilin, Peoples R China
[5] Jilin Univ, Dept Pathogenobiol, Coll Basic Med Sci, Changchun, Jilin, Peoples R China
来源
JOVE-JOURNAL OF VISUALIZED EXPERIMENTS | 2018年 / 140期
关键词
Cancer Research; Issue; 140; Biomarker detection; feature selection; OMIC; binary classification; filter; wrapper; extreme learning machine; ELM; ACUTE LYMPHOBLASTIC-LEUKEMIA; GENE-EXPRESSION PROFILE; WIDE ASSOCIATION; CHROMOSOMAL LOCATION; SUSCEPTIBILITY; GENOME; SITES; GPS; INFORMATION; THERAPY;
D O I
10.3791/57738
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Biomarker detection is one of the more important biomedical questions for high-throughput 'omics' researchers, and almost all existing biomarker detection algorithms generate one biomarker subset with the optimized performance measurement for a given dataset. However, a recent study demonstrated the existence of multiple biomarker subsets with similarly effective or even identical classification performances. This protocol presents a simple and straightforward methodology for detecting biomarker subsets with binary classification performances, better than a user-defined cutoff. The protocol consists of data preparation and loading, baseline information summarization, parameter tuning, biomarker screening, result visualization and interpretation, biomarker gene annotations, and result and visualization exportation at publication quality. The proposed biomarker screening strategy is intuitive and demonstrates a general rule for developing biomarker detection algorithms. A user-friendly graphical user interface (GUI) was developed using the programming language Python, allowing biomedical researchers to have direct access to their results. The source code and manual of kSolutionVis can be downloaded from http://www.healthinformaticslab.org/supp/resources.php.
引用
收藏
页数:16
相关论文
共 66 条
  • [1] OMIM.org: Online Mendelian Inheritance in Man (OMIM®), an online catalog of human genes and genetic disorders
    Amberger, Joanna S.
    Bocchini, Carol A.
    Schiettecatte, Francois
    Scott, Alan F.
    Hamosh, Ada
    [J]. NUCLEIC ACIDS RESEARCH, 2015, 43 (D1) : D789 - D798
  • [2] Novel target genes and a valid biomarker panel identified for cholangiocarcinoma
    Andresen, Kim
    Boberg, Kirsten Muri
    Vedeld, Hege Marie
    Honne, Hilde
    Hektoen, Merete
    Wadsworth, Christopher A.
    Clausen, Ole Petter
    Karlsen, Tom Hemming
    Foss, Aksel
    Mathisen, Oystein
    Schrumpf, Erik
    Lothe, Ragnhild A.
    Lind, Guro E.
    [J]. EPIGENETICS, 2012, 7 (11) : 1249 - 1257
  • [3] Photoanthropometric face iridial proportions for age estimation: An investigation using features selected via a joint mutual information criterion
    Borges, Dibio L.
    Vidal, Flavio B.
    Flores, Marta R. P.
    Melani, Rodolfo F. H.
    Guimaraes, Marco A.
    Machado, Carlos E. P.
    [J]. FORENSIC SCIENCE INTERNATIONAL, 2018, 284 : 9 - 14
  • [4] Boutet E, 2016, METHODS MOL BIOL, V1374, P23, DOI 10.1007/978-1-4939-3167-5_2
  • [5] Epstein-Barr virus-negative boys with non-Hodgkin lymphoma are mutated in the SH2D1A gene, as are patients with X-linked lymphoproliferative disease (XLP)
    Brandau, O
    Schuster, V
    Weiss, M
    Hellebrand, H
    Fink, FM
    Kreczy, A
    Friedrich, W
    Strahm, B
    Niemeyer, C
    Belohradsky, BH
    Meindl, A
    [J]. HUMAN MOLECULAR GENETICS, 1999, 8 (13) : 2407 - 2413
  • [6] BURNETT RC, 1994, BLOOD, V84, P1232
  • [7] Gene expression profile of adult T-cell acute lymphocytic leukemia identifies distinct subsets of patients with different response to therapy and survival
    Chiaretti, S
    Li, XC
    Gentleman, R
    Vitale, A
    Vignetti, M
    Mandelli, F
    Ritz, J
    Foa, R
    [J]. BLOOD, 2004, 103 (07) : 2771 - 2778
  • [8] Gene Expression Profiling of Colorectal Tumors and Normal Mucosa by Microarrays Meta-Analysis Using Prediction Analysis of Microarray, Artificial Neural Network, Classification, and Regression Trees
    Chu, Chi-Ming
    Yao, Chung-Tay
    Chang, Yu-Tien
    Chou, Hsiu-Ling
    Chou, Yu-Ching
    Chen, Kang-Hua
    Terng, Harn-Jing
    Huang, Chi-Shuan
    Lee, Chia-Cheng
    Su, Sui-Lun
    Liu, Yao-Chi
    Lin, Fu-Gong
    Wetter, Thomas
    Chang, Chi-Wen
    [J]. DISEASE MARKERS, 2014, 2014
  • [9] A methylome-wide mQTL analysis reveals associations of methylation sites with GAD1 and HDAC3 SNPs and a general psychiatric risk score
    Ciuculete, D. M.
    Bostrom, A. E.
    Voisin, S.
    Philipps, H.
    Titova, O. E.
    Bandstein, M.
    Nikontovic, L.
    Williams, M. J.
    Mwinyi, J.
    Schioth, H. B.
    [J]. TRANSLATIONAL PSYCHIATRY, 2017, 7 : e1002 - e1002
  • [10] Coghe G, 2018, J NEUROLOGY