共 23 条
Missing value imputation strategies for metabolomics data
被引:122
作者:

Grace Armitage, Emily
论文数: 0 引用数: 0
h-index: 0
机构:
Univ CEU San Pablo, Fac Farm, Ctr Metabol & Bioanal CEMBIO, Madrid 28668, Spain Univ CEU San Pablo, Fac Farm, Ctr Metabol & Bioanal CEMBIO, Madrid 28668, Spain

Godzien, Joanna
论文数: 0 引用数: 0
h-index: 0
机构:
Univ CEU San Pablo, Fac Farm, Ctr Metabol & Bioanal CEMBIO, Madrid 28668, Spain Univ CEU San Pablo, Fac Farm, Ctr Metabol & Bioanal CEMBIO, Madrid 28668, Spain

Alonso-Herranz, Vanesa
论文数: 0 引用数: 0
h-index: 0
机构:
Univ CEU San Pablo, Fac Farm, Ctr Metabol & Bioanal CEMBIO, Madrid 28668, Spain Univ CEU San Pablo, Fac Farm, Ctr Metabol & Bioanal CEMBIO, Madrid 28668, Spain

Lopez-Gonzalvez, Angeles
论文数: 0 引用数: 0
h-index: 0
机构:
Univ CEU San Pablo, Fac Farm, Ctr Metabol & Bioanal CEMBIO, Madrid 28668, Spain Univ CEU San Pablo, Fac Farm, Ctr Metabol & Bioanal CEMBIO, Madrid 28668, Spain

Barbas, Coral
论文数: 0 引用数: 0
h-index: 0
机构:
Univ CEU San Pablo, Fac Farm, Ctr Metabol & Bioanal CEMBIO, Madrid 28668, Spain Univ CEU San Pablo, Fac Farm, Ctr Metabol & Bioanal CEMBIO, Madrid 28668, Spain
机构:
[1] Univ CEU San Pablo, Fac Farm, Ctr Metabol & Bioanal CEMBIO, Madrid 28668, Spain
关键词:
CE-MS;
Data;
False-discovery rate;
Imputation;
k-nearest neighbour;
Metabolomics;
Missing values;
D O I:
10.1002/elps.201500352
中图分类号:
Q5 [生物化学];
学科分类号:
071010 ;
081704 ;
摘要:
The origin of missing values can be caused by different reasons and depending on these origins missing values should be considered differently and dealt with in different ways. In this research, four methods of imputation have been compared with respect to revealing their effects on the normality and variance of data, on statistical significance and on the approximation of a suitable threshold to accept missing data as truly missing. Additionally, the effects of different strategies for controlling familywise error rate or false discovery and how they work with the different strategies for missing value imputation have been evaluated. Missing values were found to affect normality and variance of data and k-means nearest neighbour imputation was the best method tested for restoring this. Bonferroni correction was the best method for maximizing true positives and minimizing false positives and it was observed that as low as 40% missing data could be truly missing. The range between 40 and 70% missing values was defined as a "gray area" and therefore a strategy has been proposed that provides a balance between the optimal imputation strategy that was k-means nearest neighbor and the best approximation of positioning real zeros.
引用
收藏
页码:3050 / 3060
页数:11
相关论文
共 23 条
[1]
Large-scale human metabolomics studies: A strategy for data (pre-) processing and validation
[J].
Bijlsma, S
;
Bobeldijk, L
;
Verheij, ER
;
Ramaker, R
;
Kochhar, S
;
Macdonald, IA
;
van Ommen, B
;
Smilde, AK
.
ANALYTICAL CHEMISTRY,
2006, 78 (02)
:567-574

Bijlsma, S
论文数: 0 引用数: 0
h-index: 0
机构: TNO, Business Unit Analyt Sci, NL-3700 AJ Zeist, Netherlands

Bobeldijk, L
论文数: 0 引用数: 0
h-index: 0
机构: TNO, Business Unit Analyt Sci, NL-3700 AJ Zeist, Netherlands

Verheij, ER
论文数: 0 引用数: 0
h-index: 0
机构: TNO, Business Unit Analyt Sci, NL-3700 AJ Zeist, Netherlands

Ramaker, R
论文数: 0 引用数: 0
h-index: 0
机构: TNO, Business Unit Analyt Sci, NL-3700 AJ Zeist, Netherlands

Kochhar, S
论文数: 0 引用数: 0
h-index: 0
机构: TNO, Business Unit Analyt Sci, NL-3700 AJ Zeist, Netherlands

Macdonald, IA
论文数: 0 引用数: 0
h-index: 0
机构: TNO, Business Unit Analyt Sci, NL-3700 AJ Zeist, Netherlands

van Ommen, B
论文数: 0 引用数: 0
h-index: 0
机构: TNO, Business Unit Analyt Sci, NL-3700 AJ Zeist, Netherlands

Smilde, AK
论文数: 0 引用数: 0
h-index: 0
机构: TNO, Business Unit Analyt Sci, NL-3700 AJ Zeist, Netherlands
[2]
The human circadian metabolome
[J].
Dallmann, Robert
;
Viola, Antoine U.
;
Tarokh, Leila
;
Cajochen, Christian
;
Brown, Steven A.
.
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA,
2012, 109 (07)
:2625-2629

Dallmann, Robert
论文数: 0 引用数: 0
h-index: 0
机构:
Univ Zurich, Chronobiol & Sleep Res Grp, Inst Pharmacol & Toxicol, CH-8057 Zurich, Switzerland Univ Zurich, Chronobiol & Sleep Res Grp, Inst Pharmacol & Toxicol, CH-8057 Zurich, Switzerland

Viola, Antoine U.
论文数: 0 引用数: 0
h-index: 0
机构:
Univ Basel, Hosp Psychiat, Ctr Chronobiol, CH-4012 Basel, Switzerland Univ Zurich, Chronobiol & Sleep Res Grp, Inst Pharmacol & Toxicol, CH-8057 Zurich, Switzerland

Tarokh, Leila
论文数: 0 引用数: 0
h-index: 0
机构:
Univ Zurich, Chronobiol & Sleep Res Grp, Inst Pharmacol & Toxicol, CH-8057 Zurich, Switzerland Univ Zurich, Chronobiol & Sleep Res Grp, Inst Pharmacol & Toxicol, CH-8057 Zurich, Switzerland

论文数: 引用数:
h-index:
机构:

Brown, Steven A.
论文数: 0 引用数: 0
h-index: 0
机构:
Univ Zurich, Chronobiol & Sleep Res Grp, Inst Pharmacol & Toxicol, CH-8057 Zurich, Switzerland Univ Zurich, Chronobiol & Sleep Res Grp, Inst Pharmacol & Toxicol, CH-8057 Zurich, Switzerland
[3]
Mass spectrometry-based metabolic profiling reveals different metabolite patterns in invasive ovarian carcinomas and ovarian borderline tumors
[J].
Denkert, Carsten
;
Budczies, Jan
;
Kind, Tobias
;
Weichert, Wilko
;
Tablack, Peter
;
Sehouli, Jalid
;
Niesporek, Silvia
;
Koensgen, Dorninique
;
Dietel, Manfred
;
Fiehn, Oliver
.
CANCER RESEARCH,
2006, 66 (22)
:10795-10804

Denkert, Carsten
论文数: 0 引用数: 0
h-index: 0
机构: Charite, Inst Pathol, Berlin, Germany

Budczies, Jan
论文数: 0 引用数: 0
h-index: 0
机构: Charite, Inst Pathol, Berlin, Germany

Kind, Tobias
论文数: 0 引用数: 0
h-index: 0
机构: Charite, Inst Pathol, Berlin, Germany

Weichert, Wilko
论文数: 0 引用数: 0
h-index: 0
机构: Charite, Inst Pathol, Berlin, Germany

Tablack, Peter
论文数: 0 引用数: 0
h-index: 0
机构: Charite, Inst Pathol, Berlin, Germany

Sehouli, Jalid
论文数: 0 引用数: 0
h-index: 0
机构: Charite, Inst Pathol, Berlin, Germany

Niesporek, Silvia
论文数: 0 引用数: 0
h-index: 0
机构: Charite, Inst Pathol, Berlin, Germany

Koensgen, Dorninique
论文数: 0 引用数: 0
h-index: 0
机构: Charite, Inst Pathol, Berlin, Germany

Dietel, Manfred
论文数: 0 引用数: 0
h-index: 0
机构: Charite, Inst Pathol, Berlin, Germany

Fiehn, Oliver
论文数: 0 引用数: 0
h-index: 0
机构: Charite, Inst Pathol, Berlin, Germany
[4]
Controlling the quality of metabolomics data: new strategies to get the best out of the QC sample
[J].
Godzien, Joanna
;
Alonso-Herranz, Vanesa
;
Barbas, Coral
;
Grace Armitage, Emily
.
METABOLOMICS,
2015, 11 (03)
:518-528

Godzien, Joanna
论文数: 0 引用数: 0
h-index: 0
机构:
Univ CEU San Pablo, Fac Farm, Ctr Metabol & Bioanal CEMBIO, Madrid 28668, Spain Univ CEU San Pablo, Fac Farm, Ctr Metabol & Bioanal CEMBIO, Madrid 28668, Spain

Alonso-Herranz, Vanesa
论文数: 0 引用数: 0
h-index: 0
机构:
Univ CEU San Pablo, Fac Farm, Ctr Metabol & Bioanal CEMBIO, Madrid 28668, Spain Univ CEU San Pablo, Fac Farm, Ctr Metabol & Bioanal CEMBIO, Madrid 28668, Spain

Barbas, Coral
论文数: 0 引用数: 0
h-index: 0
机构:
Univ CEU San Pablo, Fac Farm, Ctr Metabol & Bioanal CEMBIO, Madrid 28668, Spain Univ CEU San Pablo, Fac Farm, Ctr Metabol & Bioanal CEMBIO, Madrid 28668, Spain

Grace Armitage, Emily
论文数: 0 引用数: 0
h-index: 0
机构:
Univ CEU San Pablo, Fac Farm, Ctr Metabol & Bioanal CEMBIO, Madrid 28668, Spain Univ CEU San Pablo, Fac Farm, Ctr Metabol & Bioanal CEMBIO, Madrid 28668, Spain
[5]
Influence of Missing Values Substitutes on Multivariate Analysis of Metabolomics Data
[J].
Gromski, Piotr S.
;
Xu, Yun
;
Kotze, Helen L.
;
Correa, Elon
;
Ellis, David I.
;
Armitage, Emily Grace
;
Turner, Michael L.
;
Goodacre, Royston
.
METABOLITES,
2014, 4 (02)
:433-452

Gromski, Piotr S.
论文数: 0 引用数: 0
h-index: 0
机构:
Univ Manchester, Manchester Inst Biotechnol, Sch Chem, 131 Princess St, Manchester M1 7DN, Lancs, England Univ Manchester, Manchester Inst Biotechnol, Sch Chem, 131 Princess St, Manchester M1 7DN, Lancs, England

Xu, Yun
论文数: 0 引用数: 0
h-index: 0
机构:
Univ Manchester, Manchester Inst Biotechnol, Sch Chem, 131 Princess St, Manchester M1 7DN, Lancs, England Univ Manchester, Manchester Inst Biotechnol, Sch Chem, 131 Princess St, Manchester M1 7DN, Lancs, England

Kotze, Helen L.
论文数: 0 引用数: 0
h-index: 0
机构:
Univ Manchester, Manchester Inst Biotechnol, Sch Chem, 131 Princess St, Manchester M1 7DN, Lancs, England Univ Manchester, Manchester Inst Biotechnol, Sch Chem, 131 Princess St, Manchester M1 7DN, Lancs, England

Correa, Elon
论文数: 0 引用数: 0
h-index: 0
机构:
Univ Manchester, Manchester Inst Biotechnol, Sch Chem, 131 Princess St, Manchester M1 7DN, Lancs, England Univ Manchester, Manchester Inst Biotechnol, Sch Chem, 131 Princess St, Manchester M1 7DN, Lancs, England

Ellis, David I.
论文数: 0 引用数: 0
h-index: 0
机构:
Univ Manchester, Manchester Inst Biotechnol, Sch Chem, 131 Princess St, Manchester M1 7DN, Lancs, England Univ Manchester, Manchester Inst Biotechnol, Sch Chem, 131 Princess St, Manchester M1 7DN, Lancs, England

Armitage, Emily Grace
论文数: 0 引用数: 0
h-index: 0
机构:
Univ Manchester, Manchester Inst Biotechnol, Sch Chem, 131 Princess St, Manchester M1 7DN, Lancs, England Univ Manchester, Manchester Inst Biotechnol, Sch Chem, 131 Princess St, Manchester M1 7DN, Lancs, England

Turner, Michael L.
论文数: 0 引用数: 0
h-index: 0
机构:
Univ Manchester, Sch Chem, Manchester M13 9PL, Lancs, England Univ Manchester, Manchester Inst Biotechnol, Sch Chem, 131 Princess St, Manchester M1 7DN, Lancs, England

Goodacre, Royston
论文数: 0 引用数: 0
h-index: 0
机构:
Univ Manchester, Manchester Inst Biotechnol, Sch Chem, 131 Princess St, Manchester M1 7DN, Lancs, England Univ Manchester, Manchester Inst Biotechnol, Sch Chem, 131 Princess St, Manchester M1 7DN, Lancs, England
[6]
Counting Missing Values in a Metabolite-Intensity Data Set for Measuring the Analytical Performance of a Metabolomics Platform
[J].
Huan, Tao
;
Li, Liang
.
ANALYTICAL CHEMISTRY,
2015, 87 (02)
:1306-1313

Huan, Tao
论文数: 0 引用数: 0
h-index: 0
机构:
Univ Alberta, Dept Chem, Edmonton, AB T6G 2G2, Canada Univ Alberta, Dept Chem, Edmonton, AB T6G 2G2, Canada

Li, Liang
论文数: 0 引用数: 0
h-index: 0
机构:
Univ Alberta, Dept Chem, Edmonton, AB T6G 2G2, Canada Univ Alberta, Dept Chem, Edmonton, AB T6G 2G2, Canada
[7]
Separating the wheat from the chaff: a prioritisation pipeline for the analysis of metabolomics datasets
[J].
Jankevics, Andris
;
Merlo, Maria Elena
;
de Vries, Marcel
;
Vonk, Roel J.
;
Takano, Eriko
;
Breitling, Rainer
.
METABOLOMICS,
2012, 8 (01)
:S29-S36

Jankevics, Andris
论文数: 0 引用数: 0
h-index: 0
机构:
Univ Glasgow, Inst Mol Cell & Syst Biol, Coll Med Vet & Life Sci, Glasgow G11 8QQ, Lanark, Scotland
Univ Groningen, Groningen Bioinformat Ctr, Groningen Biomol Sci & Biotechnol Inst, NL-9747 AG Groningen, Netherlands Univ Glasgow, Inst Mol Cell & Syst Biol, Coll Med Vet & Life Sci, Glasgow G11 8QQ, Lanark, Scotland

论文数: 引用数:
h-index:
机构:

de Vries, Marcel
论文数: 0 引用数: 0
h-index: 0
机构:
Univ Med Ctr Groningen, Ctr Med Biom, NL-9713 AV Groningen, Netherlands Univ Glasgow, Inst Mol Cell & Syst Biol, Coll Med Vet & Life Sci, Glasgow G11 8QQ, Lanark, Scotland

Vonk, Roel J.
论文数: 0 引用数: 0
h-index: 0
机构:
Univ Med Ctr Groningen, Ctr Med Biom, NL-9713 AV Groningen, Netherlands Univ Glasgow, Inst Mol Cell & Syst Biol, Coll Med Vet & Life Sci, Glasgow G11 8QQ, Lanark, Scotland

Takano, Eriko
论文数: 0 引用数: 0
h-index: 0
机构: Univ Glasgow, Inst Mol Cell & Syst Biol, Coll Med Vet & Life Sci, Glasgow G11 8QQ, Lanark, Scotland

Breitling, Rainer
论文数: 0 引用数: 0
h-index: 0
机构:
Univ Glasgow, Inst Mol Cell & Syst Biol, Coll Med Vet & Life Sci, Glasgow G11 8QQ, Lanark, Scotland
Univ Groningen, Groningen Bioinformat Ctr, Groningen Biomol Sci & Biotechnol Inst, NL-9747 AG Groningen, Netherlands Univ Glasgow, Inst Mol Cell & Syst Biol, Coll Med Vet & Life Sci, Glasgow G11 8QQ, Lanark, Scotland
[8]
Analysis of longitudinal metabolomics data
[J].
Jansen, JJ
;
Hoefsloot, HCJ
;
Boelens, HFM
;
van der Greef, J
;
Smilde, AK
.
BIOINFORMATICS,
2004, 20 (15)
:2438-2446

Jansen, JJ
论文数: 0 引用数: 0
h-index: 0
机构: Univ Amsterdam, Fac Sci, NL-1018 WV Amsterdam, Netherlands

Hoefsloot, HCJ
论文数: 0 引用数: 0
h-index: 0
机构: Univ Amsterdam, Fac Sci, NL-1018 WV Amsterdam, Netherlands

Boelens, HFM
论文数: 0 引用数: 0
h-index: 0
机构: Univ Amsterdam, Fac Sci, NL-1018 WV Amsterdam, Netherlands

van der Greef, J
论文数: 0 引用数: 0
h-index: 0
机构: Univ Amsterdam, Fac Sci, NL-1018 WV Amsterdam, Netherlands

Smilde, AK
论文数: 0 引用数: 0
h-index: 0
机构: Univ Amsterdam, Fac Sci, NL-1018 WV Amsterdam, Netherlands
[9]
Direct infusion mass spectrometry metabolomics dataset: a benchmark for data processing and quality control
[J].
Kirwan, Jennifer A.
;
Weber, Ralf J. M.
;
Broadhurst, David I.
;
Viant, Mark R.
.
SCIENTIFIC DATA,
2014, 1

Kirwan, Jennifer A.
论文数: 0 引用数: 0
h-index: 0
机构:
Univ Birmingham, Sch Biosci, Birmingham B15 2TT, W Midlands, England Univ Birmingham, Sch Biosci, Birmingham B15 2TT, W Midlands, England

Weber, Ralf J. M.
论文数: 0 引用数: 0
h-index: 0
机构:
Univ Birmingham, Sch Biosci, Birmingham B15 2TT, W Midlands, England Univ Birmingham, Sch Biosci, Birmingham B15 2TT, W Midlands, England

Broadhurst, David I.
论文数: 0 引用数: 0
h-index: 0
机构:
Univ Alberta, Dept Med, Edmonton, AB T6G 2E1, Canada Univ Birmingham, Sch Biosci, Birmingham B15 2TT, W Midlands, England

Viant, Mark R.
论文数: 0 引用数: 0
h-index: 0
机构:
Univ Birmingham, Sch Biosci, Birmingham B15 2TT, W Midlands, England
Univ Birmingham, NERC Biomol Anal Facil, Metabol Node NBAF B, Birmingham B15 2TT, W Midlands, England Univ Birmingham, Sch Biosci, Birmingham B15 2TT, W Midlands, England
[10]
Semi-automated non-target processing in GC x GC-MS metabolomics analysis: applicability for biomedical studies
[J].
Koek, Maud M.
;
van der Kloet, Frans M.
;
Kleemann, Robert
;
Kooistra, Teake
;
Verheij, Elwin R.
;
Hankemeier, Thomas
.
METABOLOMICS,
2011, 7 (01)
:1-14

Koek, Maud M.
论文数: 0 引用数: 0
h-index: 0
机构:
TNO Qual Life, Analyt Res Dept, NL-3704 HE Zeist, Netherlands TNO Qual Life, Analyt Res Dept, NL-3704 HE Zeist, Netherlands

van der Kloet, Frans M.
论文数: 0 引用数: 0
h-index: 0
机构:
Leiden Univ, LACDR Analyt Biosci, NL-2333 CC Leiden, Netherlands TNO Qual Life, Analyt Res Dept, NL-3704 HE Zeist, Netherlands

Kleemann, Robert
论文数: 0 引用数: 0
h-index: 0
机构:
TNO Qual Life, Dept Vasc & Metab Dis, NL-2333 CK Leiden, Netherlands TNO Qual Life, Analyt Res Dept, NL-3704 HE Zeist, Netherlands

Kooistra, Teake
论文数: 0 引用数: 0
h-index: 0
机构:
TNO Qual Life, Dept Vasc & Metab Dis, NL-2333 CK Leiden, Netherlands TNO Qual Life, Analyt Res Dept, NL-3704 HE Zeist, Netherlands

Verheij, Elwin R.
论文数: 0 引用数: 0
h-index: 0
机构:
TNO Qual Life, Analyt Res Dept, NL-3704 HE Zeist, Netherlands TNO Qual Life, Analyt Res Dept, NL-3704 HE Zeist, Netherlands

Hankemeier, Thomas
论文数: 0 引用数: 0
h-index: 0
机构:
Leiden Univ, LACDR Analyt Biosci, NL-2333 CC Leiden, Netherlands
Netherlands Metabol Ctr, NL-2333 CC Leiden, Netherlands TNO Qual Life, Analyt Res Dept, NL-3704 HE Zeist, Netherlands