The MARK-AGE extended database: data integration and pre-processing

被引:5
作者
Baur, J. [1 ]
Koetter, T. [2 ]
Moreno-Villanueva, M. [1 ]
Sindlinger, T. [1 ]
Berthold, M. R. [2 ]
Buerkle, A. [1 ]
Junk, M. [3 ]
机构
[1] Univ Konstanz, Chair Mol Toxicol, D-78457 Constance, Germany
[2] Univ Konstanz, Chair Bioinformat & Informat Min, D-78457 Constance, Germany
[3] Univ Konstanz, Dept Math & Stat, D-78457 Constance, Germany
关键词
Database; Data entry; Data integration; Data processing; Data extraction; KNIME;
D O I
10.1016/j.mad.2015.05.006
中图分类号
Q2 [细胞生物学];
学科分类号
071009 ; 090102 ;
摘要
MARK-AGE is a recently completed European population study, where bioanalytical and anthropometric data were collected from human subjects at a large scale. To facilitate data analysis and mathematical modelling, an extended database had to be constructed, integrating the data sources that were part of the project. This step involved checking, transformation and documentation of data. The success of downstream analysis mainly depends on the preparation and quality of the integrated data. Here, we present the pre-processing steps applied to the MARK-AGE data to ensure high quality and reliability in the MARK-AGE Extended Database. Various kinds of obstacles that arose during the project are highlighted and solutions are presented. (C) 2015 Elsevier Ireland Ltd. All rights reserved.
引用
收藏
页码:31 / 37
页数:7
相关论文
共 16 条
[1]  
[Anonymous], 2007, STUDIES CLASSIFICATI
[2]  
[Anonymous], SIGFIDET 74 P 1974 A
[3]  
Chapman AD., 2005, Principles and methods of data cleaning, P1
[4]  
Codd E.F., 1970, COMMUN ACM, V13, P6
[5]   The Digital Ageing Atlas: integrating the diversity of age-related changes into a unified resource [J].
Craig, Thomas ;
Smelick, Chris ;
Tacutu, Robi ;
Wuttke, Daniel ;
Wood, Shona H. ;
Stanley, Henry ;
Janssens, Georges ;
Savitskaya, Ekaterina ;
Moskalev, Alexey ;
Arking, Robert ;
de Magalhaes, Joao Pedro .
NUCLEIC ACIDS RESEARCH, 2015, 43 (D1) :D873-D878
[6]   HAGR: the human ageing genomic resources [J].
de Magalhaes, JP ;
Costa, J ;
Toussaint, O .
NUCLEIC ACIDS RESEARCH, 2005, 33 :D537-D543
[7]  
Hellerstein Joseph M., 2008, United Nations Economic Commission for Europe (UNECE), V25, P1
[8]  
Mackey J., 2004, CURRENT PROTOCOLS BI
[9]   HOMEOSTASIS MODEL ASSESSMENT - INSULIN RESISTANCE AND BETA-CELL FUNCTION FROM FASTING PLASMA-GLUCOSE AND INSULIN CONCENTRATIONS IN MAN [J].
MATTHEWS, DR ;
HOSKER, JP ;
RUDENSKI, AS ;
NAYLOR, BA ;
TREACHER, DF ;
TURNER, RC .
DIABETOLOGIA, 1985, 28 (07) :412-419
[10]   Body mass index in children and adolescents: considerations for population-based applications [J].
Must, A ;
Anderson, SE .
INTERNATIONAL JOURNAL OF OBESITY, 2006, 30 (04) :590-594