A Convex Multi-view Low-Rank Sparse Regression for Feature Selection and Clustering

被引：0

作者：

Lu, Yao ^{[1
]}

Liu, Jin-Xing ^{[1
]}

Kong, Xiang-Zhen ^{[1
]}

Shang, Jun-Liang ^{[1
]}

机构：

[1] Qufu Normal Univ, Sch Informat Sci & Engn, Rizhao, Peoples R China

来源：

2017 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM) | 2017年

关键词：

Cluster; Feature selection; L-2; L-1-norm constraint; Multi-view data; Regression analysis; Trace norm constraint; OPEN-SOURCE SOFTWARE;

D O I：

暂无

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

Many real-world problems involve multi-view high-dimension-small-sample-size data analysis, such as multiomics data. The combination of multi-view databases is supposed to provide a better biological significance. However, the multi-view data always contain noise and outlying entries that result in inaccurate and unreliable. It has become an urgent need how to effectively analyze these data. We proposed a novel convex multi-view low-rank sparse regression (CMLSR) algorithm to do cluster and feature selection. The model was constructed by imposing L-2,L-1-norm and trace norm constraints on the regularization functions. It can diminish the impact of noises and outliers and produce more precise results. Clustering quality was determined by both sparse constraint and low-rank constraint. Finally, we selected characteristic genes based on the projection matrix. The method was used in TCGA multi-view genes expression data sets, annotated according to Gene Ontology (GO). In this paper, we demonstrated the effectiveness of the proposed algorithm through comparing it with the existing methods.

引用

页码：2183 / 2186

页数：4

共 16 条

[1]

[Anonymous], J COMPUTATIONAL GRAP

[2]

[Anonymous], MULTIVIEW CLUSTERING

[3] GO::TermFinder - open source software for accessing Gene Ontology information and finding significantly enriched Gene Ontology terms associated with a list of genes [J].

Boyle, EI ;

Weng, SA ;

Gollub, J ;

Jin, H ;

Botstein, D ;

Cherry, JM ;

Sherlock, G .

BIOINFORMATICS, 2004, 20 (18) :3710-3715

[4]

Fazel M, 2002, P AM CONTR C, V6, P4734

[5] Molecular classification of cancer: Class discovery and class prediction by gene expression monitoring [J].

Golub, TR ;

Slonim, DK ;

Tamayo, P ;

Huard, C ;

Gaasenbeek, M ;

Mesirov, JP ;

Coller, H ;

Loh, ML ;

Downing, JR ;

Caligiuri, MA ;

Bloomfield, CD ;

Lander, ES .

SCIENCE, 1999, 286 (5439) :531-537

[6] Feature selection: Evaluation, application, and small sample performance [J].

Jain, A ;

Zongker, D .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1997, 19 (02) :153-158

[7] A joint-L2,1-norm-constraint-based semi-supervised feature extraction for RNA-Seq data analysis [J].

Liu, Jin-Xing ;

Wang, Dong ;

Gao, Ying-Lian ;

Zheng, Chun-Hou ;

Shang, Jun-Liang ;

Liu, Feng ;

Xu, Yong .

NEUROCOMPUTING, 2017, 228 :263-269

[8] Robust PCA based method for discovering differentially expressed genes [J].

Liu, Jin-Xing ;

Wang, Yu-Tian ;

Zheng, Chun-Hou ;

Sha, Wen ;

Mi, Jian-Xun ;

Xu, Yong .

BMC BIOINFORMATICS, 2013, 14

[9] Characteristic Gene Selection via Weighting Principal Components by Singular Values [J].

Liu, Jin-Xing ;

Xu, Yong ;

Zheng, Chun-Hou ;

Wang, Yi ;

Yang, Jing-Yu .

PLOS ONE, 2012, 7 (07)

[10]

Nie F., 2010, ADV NEURAL INFORM PR, P1813

← 1 2 →