Semi-supervised classification using sparse representation for cancer recurrence prediction

被引:0
作者
Cui, Yan [1 ]
Cai, Xiaodong [2 ]
Jin, Zhong [1 ]
机构
[1] Nanjing Univ Sci & Technol, Dept Comp Sci & Engr, Nanjing, Jiangsu, Peoples R China
[2] Univ Miami, Dept Elect & Comp Engr, Coral Gables, FL 33146 USA
来源
2013 IEEE INTERNATIONAL WORKSHOP ON GENOMIC SIGNAL PROCESSING AND STATISTICS (GENSIPS 2013) | 2013年
关键词
BREAST-CANCER; MODELS;
D O I
暂无
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Gene expression profiles have been used to predict cancer recurrence or other clinical outcomes of cancer patients. However, clinical information of cancer patients is often incomplete, which yields many unlabeled samples that cannot be used in supervised learning. In this is paper, we develop a novel semi-supervised leaning (SSL) method that uses both labeled and unlabeled patient samples to predict cancer recurrence. Our new SSL algorithm employs a sparse representation approach where a labeled sample is represented as a combination of a small number of properly chosen unlabeled samples. Experiments with a set of gene expression data from patients with colorectal cancer(CRC) demonstrate that our SSL algorithm can improve prediction accuracy compared to other two SSL methods including TSVM and T3VM, and the traditional support vector machine.
引用
收藏
页码:102 / 105
页数:4
相关论文
共 20 条
  • [1] [Anonymous], 2006, BOOK REV IEEE T NEUR
  • [2] [Anonymous], 2017, ELEMENTS STAT LEARNI
  • [3] [Anonymous], 2009, Introduction to semi-supervised learning
  • [4] Semi-supervised methods to predict patient survival from gene expression data
    Bair, E
    Tibshirani, R
    [J]. PLOS BIOLOGY, 2004, 2 (04) : 511 - 522
  • [5] Chapelle O, 2008, J MACH LEARN RES, V9, P203
  • [6] MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM
    DEMPSTER, AP
    LAIRD, NM
    RUBIN, DB
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01): : 1 - 38
  • [7] Regularization Paths for Generalized Linear Models via Coordinate Descent
    Friedman, Jerome
    Hastie, Trevor
    Tibshirani, Rob
    [J]. JOURNAL OF STATISTICAL SOFTWARE, 2010, 33 (01): : 1 - 22
  • [8] Semi-supervised support vector machines for unlabeled data classification
    Fung, G
    Mangasarian, OL
    [J]. OPTIMIZATION METHODS & SOFTWARE, 2001, 15 (01) : 29 - 44
  • [9] Joachims T, 1999, MACHINE LEARNING, PROCEEDINGS, P200
  • [10] Identification of a low-risk subgroup of HER-2-positive breast cancer by the 70-gene prognosis signature
    Knauer, M.
    Cardoso, F.
    Wesseling, J.
    Bedard, P. L.
    Linn, S. C.
    Rutgers, E. J. T.
    van't Veer, L. J.
    [J]. BRITISH JOURNAL OF CANCER, 2010, 103 (12) : 1788 - 1793