Data integration in the era of omics: current and future challenges

被引:229
作者
Gomez-Cabrero, David [1 ,2 ]
Abugessaisa, Imad [1 ,2 ]
Maier, Dieter [3 ]
Teschendorff, Andrew [4 ]
Merkenschlager, Matthias [5 ]
Gisel, Andreas [6 ]
Ballestar, Esteban [7 ]
Bongcam-Rudloff, Erik [8 ]
Conesa, Ana [9 ]
Tegner, Jesper [1 ,2 ]
机构
[1] Karolinska Inst, Dept Med, Ctr Mol Med, Unit Computat Med, Stockholm, Sweden
[2] Karolinska Univ Hosp, Stockholm, Sweden
[3] Biomax Informat AG, Munich, Germany
[4] UCL, UCL Canc Inst, Ctr Math & Phys Life Sci & Expt Biol, London WC1E 6BT, England
[5] Univ London Imperial Coll Sci Technol & Med, Lymphocyte Dev Grp, MRC Clin Sci Ctr, London W12 0NN, England
[6] Ist Tecnol Biomed CNR, Unita Org, I-70126 Bari, Italy
[7] Bellvitge Biomed Res Inst IDIBELL, Canc Epigenet & Biol Program PEBC, Chromatin & Dis Grp, Barcelona, Spain
[8] Swedish Univ Agr Sci, SLU Global Bioinformat Ctr, Dept Anim Breeding & Genet, Uppsala, Sweden
[9] Ctr Invest Principe Felipe, Computat Genom Program, Valencia, Spain
关键词
GENE-EXPRESSION; DATA SETS; METAANALYSIS; KNOWLEDGE; GENOMICS; NETWORK; DISCOVERY; NATION; GROWTH;
D O I
10.1186/1752-0509-8-S2-I1
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
To integrate heterogeneous and large omics data constitutes not only a conceptual challenge but a practical hurdle in the daily analysis of omics data. With the rise of novel omics technologies and through large-scale consortia projects, biological systems are being further investigated at an unprecedented scale generating heterogeneous and often large data sets. These data-sets encourage researchers to develop novel data integration methodologies. In this introduction we review the definition and characterize current efforts on data integration in the life sciences. We have used a web-survey to assess current research projects on data-integration to tap into the views, needs and challenges as currently perceived by parts of the research community.
引用
收藏
页数:10
相关论文
共 75 条
  • [1] Challenges and Opportunities in Mining Neuroscience Data
    Akil, Huda
    Martone, Maryann E.
    Van Essen, David C.
    [J]. SCIENCE, 2011, 331 (6018) : 708 - 712
  • [2] Generalized singular value decomposition for comparative analysis of genome-scale expression data sets of two different organisms
    Alter, O
    Brown, PO
    Botstein, D
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2003, 100 (06) : 3351 - 3356
  • [3] A map of human genome variation from population-scale sequencing
    Altshuler, David
    Durbin, Richard M.
    Abecasis, Goncalo R.
    Bentley, David R.
    Chakravarti, Aravinda
    Clark, Andrew G.
    Collins, Francis S.
    De la Vega, Francisco M.
    Donnelly, Peter
    Egholm, Michael
    Flicek, Paul
    Gabriel, Stacey B.
    Gibbs, Richard A.
    Knoppers, Bartha M.
    Lander, Eric S.
    Lehrach, Hans
    Mardis, Elaine R.
    McVean, Gil A.
    Nickerson, DebbieA.
    Peltonen, Leena
    Schafer, Alan J.
    Sherry, Stephen T.
    Wang, Jun
    Wilson, Richard K.
    Gibbs, Richard A.
    Deiros, David
    Metzker, Mike
    Muzny, Donna
    Reid, Jeff
    Wheeler, David
    Wang, Jun
    Li, Jingxiang
    Jian, Min
    Li, Guoqing
    Li, Ruiqiang
    Liang, Huiqing
    Tian, Geng
    Wang, Bo
    Wang, Jian
    Wang, Wei
    Yang, Huanming
    Zhang, Xiuqing
    Zheng, Huisong
    Lander, Eric S.
    Altshuler, David L.
    Ambrogio, Lauren
    Bloom, Toby
    Cibulskis, Kristian
    Fennell, Tim J.
    Gabriel, Stacey B.
    [J]. NATURE, 2010, 467 (7319) : 1061 - 1073
  • [4] [Anonymous], INT J PUBLIC INFORM
  • [5] [Anonymous], STEPS LARGE SCALE DA
  • [6] [Anonymous], DIVISIVE SHUFFLING A
  • [7] [Anonymous], 2012, Nature
  • [8] [Anonymous], 3 DECADES DAT INTEGR
  • [9] [Anonymous], THESIS LUDWIG MAXIMI
  • [10] [Anonymous], WORKSH GEN DAT INT