Implementation of quality controls is essential to prevent batch effects in breathomics data and allow for cross-study comparisons

被引:11
作者
Stavropoulos, Georgios [1 ]
Jonkers, Daisy M. A. E. [2 ]
Mujagic, Zlatan [2 ]
Koek, Ger H. [2 ]
Masclee, Ad A. M. [2 ]
Pierik, Marieke J. [2 ]
Dallinga, Jan W. [1 ]
Van Schooten, Frederik-Jan [1 ]
Smolinska, Agnieszka [1 ]
机构
[1] Maastricht Univ, NUTRIM Sch Nutr & Translat Res, Dept Pharmacol & Toxicol, Maastricht, Netherlands
[2] Maastricht Univ, NUTRIM Sch Nutr & Translat Res, Div Gastroenterol & Hepatol, Maastricht, Netherlands
关键词
exhaled breath; volatile organic compounds; VOCs; data analysis; batch effects; IBD; IBS; liver cirrhosis; VOLATILE ORGANIC-COMPOUNDS; IRRITABLE-BOWEL-SYNDROME; PARTIAL LEAST-SQUARES; GENE-EXPRESSION; MICROARRAY DATA; CLASSIFICATION; PERFORMANCE; PREDICTION; DIAGNOSIS; DISEASE;
D O I
10.1088/1752-7163/ab7b8d
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Exhaled breath analysis has become a promising monitoring tool for various ailments by identifying volatile organic compounds (VOCs) as indicative biomarkers excreted in the human body. Throughout the process of sampling, measuring, and data processing, non-biological variations are introduced in the data leading to batch effects. Algorithmic approaches have been developed to cope with within-study batch effects. Batch differences, however, may occur among different studies too, and up-to-date, ways to correct for cross-study batch effects are lacking; ultimately, cross-study comparisons to verify the uniqueness of found VOC profiles for a specific disease may be challenging. This study applies within-study batch-effect-correction approaches to correct for cross-study batch effects; suggestions are made that may help prevent the introduction of cross-study variations. Three batch-effect-correction algorithms were investigated: zero-centering, combat, and the analysis of covariance framework. The breath samples were collected from inflammatory bowel disease (n = 213), chronic liver disease (n = 189), and irritable bowel syndrome (n = 261) patients at different periods, and they were analysed via gas chromatography-mass spectrometry. Multivariate statistics were used to visualise and verify the results. The visualisation of the data before any batch-effect-correction technique was applied showed a clear distinction due to probable batch effects among the datasets of the three cohorts. The visualisation of the three datasets after implementing all three correction techniques showed that the batch effects were still present in the data. Predictions made using partial least squares discriminant analysis and random forest confirmed this observation. The within-study batch-effect-correction approaches fail to correct for cross-study batch effects present in the data. The present study proposes a framework for systematically standardising future breathomics data by using internal standards or quality control samples at regular analysis intervals. Further knowledge regarding the nature of the unsolicited variations among cross-study batches must be obtained to move the field further.
引用
收藏
页数:12
相关论文
共 60 条
  • [1] Unsupervised random forest: a tutorial with case studies
    Afanador, Nelson Lee
    Smolinska, Agnieszka
    Tran, Thanh N.
    Blanchet, Lionel
    [J]. JOURNAL OF CHEMOMETRICS, 2016, 30 (05) : 232 - 241
  • [2] The human volatilome: volatile organic compounds (VOCs) in exhaled breath, skin emanations, urine, feces and saliva
    Amann, Anton
    Costello, Ben de Lacy
    Miekisch, Wolfram
    Schubert, Jochen
    Buszewski, Boguslaw
    Pleil, Joachim
    Ratcliffe, Norman
    Risby, Terence
    [J]. JOURNAL OF BREATH RESEARCH, 2014, 8 (03)
  • [3] [Anonymous], 1936, Proceedings of the National Academy of Sciences
  • [4] Diagnosing Inflammatory bowel disease using noninvasive applications of volatile organic compounds: a systematic review
    Bannaga, Ayman S.
    Farrugia, Alexia
    Arasaradnam, Ramesh P.
    [J]. EXPERT REVIEW OF GASTROENTEROLOGY & HEPATOLOGY, 2019, 13 (11) : 1113 - 1122
  • [5] Volatile organic compounds in breath as markers for irritable bowel syndrome: a metabolomic approach
    Baranska, A.
    Mujagic, Z.
    Smolinska, A.
    Dallinga, J. W.
    Jonkers, D. M. A. E.
    Tigchelaar, E. F.
    Dekens, J.
    Zhernakova, A.
    Ludwig, T.
    Masclee, A. A. M.
    Wijmenga, C.
    van Schooten, F. J.
    [J]. ALIMENTARY PHARMACOLOGY & THERAPEUTICS, 2016, 44 (01) : 45 - 56
  • [6] Profile of volatile organic compounds in exhaled breath changes as a result of gluten-free diet
    Baranska, Agnieszka
    Tigchelaar, Ettje
    Smolinska, Agnieszka
    Dallinga, Jan W.
    Moonen, Edwin J. C.
    Dekens, Jackie A. M.
    Wijmenga, Cisca
    Zhernakova, Alexandra
    van Schooten, Frederik J.
    [J]. JOURNAL OF BREATH RESEARCH, 2013, 7 (03)
  • [7] Partial least squares for discrimination
    Barker, M
    Rayens, W
    [J]. JOURNAL OF CHEMOMETRICS, 2003, 17 (03) : 166 - 173
  • [8] Adjustment of systematic microarray data biases
    Benito, M
    Parker, J
    Du, Q
    Wu, JY
    Xang, D
    Perou, CM
    Marron, JS
    [J]. BIOINFORMATICS, 2004, 20 (01) : 105 - 114
  • [9] Bhattacharyya A., 1943, Bull. Calcutta Math. Soc., V35, P99
  • [10] Volatile Organic Compounds in Exhaled Air as Novel Marker for Disease Activity in Crohn's Disease: A Metabolomic Approach
    Bodelier, Alexander G. L.
    Smolinska, Agnieszka
    Baranska, Agnieszka
    Dallinga, Jan W.
    Mujagic, Zlatan
    Vanhees, Kimberly
    van den Heuvel, Tim
    Masclee, Ad A. M.
    Jonkers, Daisy
    Pierik, Marie J.
    van Schooten, Frederik J.
    [J]. INFLAMMATORY BOWEL DISEASES, 2015, 21 (08) : 1776 - 1785