Discriminant Projection Shared Dictionary Learning for Classification of Tumors Using Gene Expression Data

被引:2
作者
Peng, Shaoliang [1 ]
Yang, Yaning [1 ]
Liu, Wei [1 ]
Li, Fei [2 ]
Liao, Xiangke [3 ]
机构
[1] Hunan Univ, Coll Comp Sci & Elect Engn, Changsha 410082, Peoples R China
[2] Chinese Acad Sci, Comp Network Informat Ctr, Beijing 100190, Peoples R China
[3] Natl Univ Def Technol, Sch Comp Sci, Changsha 410073, Peoples R China
基金
国家重点研发计划;
关键词
Dictionaries; Tumors; Gene expression; Machine learning; Training; Encoding; Face; Tumor classification; gene expression profile; dictionary learning; LINCS; discriminant projection; CONNECTIVITY MAP; CANCER; SELECTION; REPRESENTATION; PREDICTION; SIGNATURES;
D O I
10.1109/TCBB.2019.2950209
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
With a variety of tumor subtypes, personalized treatments need to identify the subtype of a tumor as accurately as possible. The development of DNA microarrays provides an opportunity to predict tumor classification. One strategy is to use gene expression profiling to extend current biological insights into the disease. However, overfitting problems exist in most machine learning methods when classifying tumor gene expression profile data characterized by high dimensional, small samples and nonlinearities. As a new machine learning methods, dictionary learning has become a more effective algorithm for gene expression profile classification. Here, a new method called discriminant projection shared dictionary learning (DPSDL) is proposed for classifying tumor subtypes using LINCS gene expression profile data. The method trains a shared dictionary, embeds Fisher discriminant criteria to obtain a class-specific sub-dictionary and coding coefficients. At the same time, a projection matrix is trained to widen the distance between different classes of samples. Experimental results show that our method performs better classification based on gene expression profile than the other dictionary learning methods and machine learning methods.
引用
收藏
页码:1464 / 1473
页数:10
相关论文
共 46 条
[1]   Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays [J].
Alon, U ;
Barkai, N ;
Notterman, DA ;
Gish, K ;
Ybarra, S ;
Mack, D ;
Levine, AJ .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1999, 96 (12) :6745-6750
[2]   Exploring the new world of the genome with DNA microarrays [J].
Brown, PO ;
Botstein, D .
NATURE GENETICS, 1999, 21 (Suppl 1) :33-37
[3]   Robust uncertainty principles:: Exact signal reconstruction from highly incomplete frequency information [J].
Candès, EJ ;
Romberg, J ;
Tao, T .
IEEE TRANSACTIONS ON INFORMATION THEORY, 2006, 52 (02) :489-509
[4]   Gene selection in cancer classification using sparse logistic regression with Bayesian regularization [J].
Cawley, Gavin C. ;
Talbot, Nicola L. C. .
BIOINFORMATICS, 2006, 22 (19) :2348-2355
[5]   Genome-Wide Signatures of Transcription Factor Activity: Connecting Transcription Factors, Disease, and Small Molecules [J].
Chen, Jing ;
Hu, Zhen ;
Phatak, Mukta ;
Reichard, John ;
Freudenberg, Johannes M. ;
Sivaganesan, Siva ;
Medvedovic, Mario .
PLOS COMPUTATIONAL BIOLOGY, 2013, 9 (09)
[6]  
Cho S.B., 2003, P 1 AS PAC BIOINF C, V19, P189
[7]  
Dalton L, 2012, IEEE INT WORK GENOM, P164, DOI 10.1109/GENSIPS.2012.6507754
[8]   Improving gene set analysis of microarray data by SAM-GS [J].
Dinu, Irina ;
Potter, John D. ;
Mueller, Thomas ;
Liu, Qi ;
Adewale, Adeniyi J. ;
Jhangri, Gian S. ;
Einecke, Gunilla ;
Famulski, Konrad S. ;
Halloran, Philip ;
Yasui, Yutaka .
BMC BIOINFORMATICS, 2007, 8 (1)
[9]   LINCS Canvas Browser: interactive web app to query, browse and interrogate LINCS L1000 gene expression signatures [J].
Duan, Qiaonan ;
Flynn, Corey ;
Niepel, Mario ;
Hafner, Marc ;
Muhlich, Jeremy L. ;
Fernandez, Nicolas F. ;
Rouillard, Andrew D. ;
Tan, Christopher M. ;
Chen, Edward Y. ;
Golub, Todd R. ;
Sorger, Peter K. ;
Subramanian, Aravind ;
Ma'ayan, Avi .
NUCLEIC ACIDS RESEARCH, 2014, 42 (W1) :W449-W460
[10]   Support vector machine classification and validation of cancer tissue samples using microarray expression data [J].
Furey, TS ;
Cristianini, N ;
Duffy, N ;
Bednarski, DW ;
Schummer, M ;
Haussler, D .
BIOINFORMATICS, 2000, 16 (10) :906-914