Classification of mislabelled microarrays using robust sparse logistic regression

被引:31
|
作者
Bootkrajang, Jakramate [1 ]
Kaban, Ata [1 ]
机构
[1] Univ Birmingham, Sch Comp Sci, Birmingham B15 2TT, W Midlands, England
关键词
DISCRIMINANT-ANALYSIS; INITIAL SAMPLES; GENE SELECTION; CANCER;
D O I
10.1093/bioinformatics/btt078
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Previous studies reported that labelling errors are not uncommon in microarray datasets. In such cases, the training set may become misleading, and the ability of classifiers to make reliable inferences from the data is compromised. Yet, few methods are currently available in the bioinformatics literature to deal with this problem. The few existing methods focus on data cleansing alone, without reference to classification, and their performance crucially depends on some tuning parameters. Results: In this article, we develop a new method to detect mislabelled arrays simultaneously with learning a sparse logistic regression classifier. Our method may be seen as a label-noise robust extension of the well-known and successful Bayesian logistic regression classifier. To account for possible mislabelling, we formulate a label-flipping process as part of the classifier. The regularization parameter is automatically set using Bayesian regularization, which not only saves the computation time that cross-validation would take, but also eliminates any unwanted effects of label noise when setting the regularization parameter. Extensive experiments with both synthetic data and real microarray datasets demonstrate that our approach is able to counter the bad effects of labelling errors in terms of predictive performance, it is effective at identifying marker genes and simultaneously it detects mislabelled arrays to high accuracy.
引用
收藏
页码:870 / 877
页数:8
相关论文
共 50 条
  • [41] Random feature selection using random subspace logistic regression
    Wichitaksorn, Nuttanan
    Kang, Yingyue
    Zhang, Faqiang
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 217
  • [42] Comparison of standard maximum likelihood classification and polytomous logistic regression used in remote sensing
    Hogland, John
    Billor, Nedret
    Anderson, Nathaniel
    EUROPEAN JOURNAL OF REMOTE SENSING, 2013, 46 : 623 - 640
  • [43] Sparsity regularization enhances gene selection and leukemia subtype classification via logistic regression
    Mahmood, Nozad Hussein
    Kadir, Dler Hussein
    LEUKEMIA RESEARCH, 2025, 150
  • [44] Regularized logistic regression without a penalty term: An application to cancer classification with microarray data
    Bielza, Concha
    Robles, Victor
    Larranaga, Pedro
    EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (05) : 5110 - 5118
  • [45] Semisupervised Hyperspectral Image Classification via Discriminant Analysis and Robust Regression
    Cheng, Guangliang
    Zhu, Feiyun
    Xiang, Shiming
    Wang, Ying
    Pan, Chunhong
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2016, 9 (02) : 595 - 608
  • [46] Classification of DNA Microarrays Using Artificial Bee Colony (ABC) Algorithm
    Aurora Garro, Beatriz
    Antonio Vazquez, Roberto
    Rodriguez, Katya
    ADVANCES IN SWARM INTELLIGENCE, PT1, 2014, 8794 : 207 - 214
  • [47] USING SPARSE REGRESSION TO LEARN EFFECTIVE PROJECTIONS FOR FACE RECOGNITION
    Xi, Yongxin Taylor
    Ramadge, Peter J.
    2009 16TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-6, 2009, : 3333 - 3336
  • [48] LogSum+L2 penalized logistic regression model for biomarker selection and cancer classification
    Liu, Xiao-Ying
    Wu, Sheng-Bing
    Zeng, Wen-Quan
    Yuan, Zhan-Jiang
    Xu, Hong-Bo
    SCIENTIFIC REPORTS, 2020, 10 (01)
  • [49] Rank-k 2-D Multinomial Logistic Regression for Matrix Data Classification
    Song, Kun
    Nie, Feiping
    Han, Junwei
    Li, Xuelong
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (08) : 3524 - 3537
  • [50] Estimating the causes of traffic accidents using logistic regression and discriminant analysis
    Karacasu, Murat
    Ergul, Baris
    Yavuz, Arzu Altin
    INTERNATIONAL JOURNAL OF INJURY CONTROL AND SAFETY PROMOTION, 2014, 21 (04) : 305 - 312