TV-DCT: METHOD TO IMPUTE GENE EXPRESSION DATA USING DCT BASED SPARSITY AND TOTAL VARIATION DENOISING

被引:0
作者
Farswan, Akanksha [1 ]
Gupta, Anubha [1 ]
机构
[1] IIIT Delhi, Dept ECE, Signal Proc & Biomed Imaging Lab SBILab, New Delhi, India
来源
2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2019年
关键词
Gene expression data; matrix imputation; sparse recovery; machine learning; cancer treatment; MISSING VALUE IMPUTATION; CLASSIFICATION; PREDICTION; ALGORITHM; MODEL;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Most of the bioinformatics tools used in the analysis of gene expression data require complete data matrices. Missing values in data can adversely influence the downstream analysis for diagnostics and treatment. Several methods to impute missing values in gene data have been developed. However, most of these work at high levels of observability. In this paper, we have proposed a novel 2-stage method, namely, TV-DCT for imputing incomplete gene expression matrices using Total Variation denoising and Discrete Cosine Transform Domain Sparsity (TV-DCT) that achieves smaller imputation errors, consistently, at all levels of observability. The proposed method has been compared with three state-of-the-art matrix completion methods on three different cancer datasets and is observed to perform better. The validation of imputed data has been demonstrated on the application of classification.
引用
收藏
页码:1244 / 1248
页数:5
相关论文
共 26 条
[1]   Distinct types of diffuse large B-cell lymphoma identified by gene expression profiling [J].
Alizadeh, AA ;
Eisen, MB ;
Davis, RE ;
Ma, C ;
Lossos, IS ;
Rosenwald, A ;
Boldrick, JG ;
Sabet, H ;
Tran, T ;
Yu, X ;
Powell, JI ;
Yang, LM ;
Marti, GE ;
Moore, T ;
Hudson, J ;
Lu, LS ;
Lewis, DB ;
Tibshirani, R ;
Sherlock, G ;
Chan, WC ;
Greiner, TC ;
Weisenburger, DD ;
Armitage, JO ;
Warnke, R ;
Levy, R ;
Wilson, W ;
Grever, MR ;
Byrd, JC ;
Botstein, D ;
Brown, PO ;
Staudt, LM .
NATURE, 2000, 403 (6769) :503-511
[2]  
[Anonymous], 2010, White paper
[3]  
[Anonymous], 2007, SPGL1 SOLVER LARGE S
[4]  
[Anonymous], 2016, ADV NEURAL INFORM PR
[5]  
[Anonymous], 2015, ROBUST LOW RANK SPAR
[6]   A hybrid method for imputation of missing values using optimized fuzzy c-means with support vector regression and a genetic algorithm [J].
Aydilek, Ibrahim Berkan ;
Arslan, Ahmet .
INFORMATION SCIENCES, 2013, 233 :25-35
[7]   A Fast Iterative Shrinkage-Thresholding Algorithm for Linear Inverse Problems [J].
Beck, Amir ;
Teboulle, Marc .
SIAM JOURNAL ON IMAGING SCIENCES, 2009, 2 (01) :183-202
[8]   Classification of human lung carcinomas by mRNA expression profiling reveals distinct adenocarcinoma subclasses [J].
Bhattacharjee, A ;
Richards, WG ;
Staunton, J ;
Li, C ;
Monti, S ;
Vasa, P ;
Ladd, C ;
Beheshti, J ;
Bueno, R ;
Gillette, M ;
Loda, M ;
Weber, G ;
Mark, EJ ;
Lander, ES ;
Wong, W ;
Johnson, BE ;
Golub, TR ;
Sugarbaker, DJ ;
Meyerson, M .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2001, 98 (24) :13790-13795
[9]   Molecular classification of cancer: Class discovery and class prediction by gene expression monitoring [J].
Golub, TR ;
Slonim, DK ;
Tamayo, P ;
Huard, C ;
Gaasenbeek, M ;
Mesirov, JP ;
Coller, H ;
Loh, ML ;
Downing, JR ;
Caligiuri, MA ;
Bloomfield, CD ;
Lander, ES .
SCIENCE, 1999, 286 (5439) :531-537
[10]  
Gupta Anubha, 2018, J FRANKLIN I