Data integration in genomics and systems biology

被引:0
作者
Serra, Angela [1 ]
Fratello, Michele [2 ]
Greco, Dario [3 ]
Tagliaferri, Roberto [1 ]
机构
[1] Univ Salerno, DISA MIS, NeuRoNe Lab, Salerno, Italy
[2] Second Univ Napoli, Dept Med Surg Neurol Metab & Ageing Sci, Naples, Italy
[3] Finnish Inst Occupat Hlth, Unit Syst Toxicol & Nanosafety Res Ctr, Helsinki, Finland
来源
2016 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC) | 2016年
关键词
GENE-EXPRESSION; MICROARRAY DATA; FEATURE-SELECTION; METAANALYSIS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-view learning is the branch of machine learning that deals with multi modal data, i.e. with patterns represented by different sets of features. The fast spread of this learning technique is motivated by the continuing increase of real applications based on multi-view data. For example, in bioinformatics multiple experiments can be available (mRNA, miRNA and protein expression, genome wide association studies (GWAS) and others) for a set of samples. In bioinformatics multi-view approaches are useful since heterogeneous genome-wide data sources capture information on different aspects of complex biological systems. Each view provides a distinct facet of the same domain, encoding different biologically-relevant patterns. The integration of such views can provide a richer model of the underlying system than those produced by a single view alone. This paper provides a review of the literature with respect to bioinformatics, with the purpose to understand the principles and operation modes of the existing methods and their possible applications. In order to organize the proposed methods in literature and to find similarities between them, these approaches are organized according to three categories: the type of data used in the papers, the statistical problem and the stage of integration.
引用
收藏
页码:1272 / 1279
页数:8
相关论文
共 45 条
[1]  
Aggarwal C.C., 2001, SURPRISING BEHAV DIS
[2]  
[Anonymous], ICML WORKSH LEARN MU
[3]  
[Anonymous], 2008, P 2008 SIAM INT C DA
[4]  
[Anonymous], 2002, ADV NEURAL INFORM PR
[5]  
Bickel S., P IEEE, p[19, 204]
[6]   A Meta-analysis of Lung Cancer Gene Expression Identifies PTK7 as a Survival Gene in Lung Adenocarcinoma [J].
Chen, Ron ;
Khatri, Purvesh ;
Mazur, Pawel K. ;
Polin, Melanie ;
Zheng, Yanyan ;
Vaka, Dedeepya ;
Hoang, Chuong D. ;
Shrager, Joseph ;
Xu, Yue ;
Vicent, Silvestre ;
Butte, Atul J. ;
Sweet-Cordero, E. Alejandro .
CANCER RESEARCH, 2014, 74 (10) :2892-2902
[7]   TW-k-Means: Automated Two-Level Variable Weighting Clustering Algorithm for Multiview Data [J].
Chen, Xiaojun ;
Xu, Xiaofei ;
Huang, Joshua Zhexue ;
Ye, Yunming .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2013, 25 (04) :932-944
[8]   Combining multiple microarray studies and modeling interstudy variation [J].
Choi, Jung Kyoon ;
Yu, Ungsik ;
Kim, Sangsoo ;
Yoo, Ook Joon .
BIOINFORMATICS, 2003, 19 :i84-i90
[9]   First InP/InGaAs PNPHBT grown by metal organic chemical vapor deposition [J].
Cui, DL ;
Hsu, S ;
Pavlidis, D .
2001 INTERNATIONAL CONFERENCE ON INDIUM PHOSPHIDE AND RELATED MATERIALS, CONFERENCE PROCEEDINGS, 2001, :224-227
[10]  
DeConde R, 2006, STAT APPL GENET MOL, V5