Feature extraction from resolution perspective for gas chromatography-mass spectrometry datasets

被引:12
作者
Ma, Pan [1 ]
Zhang, Zhimin [1 ]
Zhou, Xinyi [1 ]
Yun, Yonghuan [1 ]
Liang, Yizeng [1 ]
Lu, Hongmei [1 ]
机构
[1] Cent S Univ, Coll Chem & Chem Engn, Changsha 410083, Peoples R China
来源
RSC ADVANCES | 2016年 / 6卷 / 115期
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
PERFORMANCE LIQUID-CHROMATOGRAPHY; MULTIVARIATE CURVE RESOLUTION; EVOLVING LATENT PROJECTIONS; 2-WAY MULTICOMPONENT DATA; WINDOW FACTOR-ANALYSIS; PEAK ALIGNMENT; COMPOUND IDENTIFICATION; BACKGROUND CORRECTION; LEAST-SQUARES; GC-MS;
D O I
10.1039/c6ra17864b
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Automatic feature extraction from large-scale datasets is one of the major challenges when analyzing complex samples with gas chromatography-mass spectrometry (GC-MS). The classic processing pipeline basically consists of noise filtering, baseline correction, peak detection, alignment, normalization and identification. The long pipeline makes the extracted features inconsistent with different methods and values of parameters. In this study, MS-Assisted Resolution of Signals (MARS) has been proposed to extract features automatically from resolution perspective for large-scale GC-MS datasets. Firstly, it divides complex data into small segments and searches the target zone by moving sub-window factor analysis (MSWFA). Then, improved iterative target transformation factor analysis (ITTFA) has been developed to extract features of the compound from complex datasets. MARS was systematically tested on a simulated dataset (5 samples), peppermint dataset (2 samples), red wine dataset (24 samples) and human plasma dataset (131 samples). The results show that MARS can extract features accurately, automatically, objectively and swiftly from these complex datasets at 2-3 minutes/chromatogram speed. The extracted features of overlapped peaks are comparable to the features resolved by MCR-ALS or PARAFAC2, and significantly better than XCMS. Furthermore, PLS-DA models of the human plasma dataset indicated that features extracted automatically by MARS are comparable or better than features extracted manually by experts with a GC-MS workstation. It has been implemented and open-sourced at https://github.com/zmzhang/MARS.
引用
收藏
页码:113997 / 114004
页数:8
相关论文
共 48 条
  • [1] ChroMATHography: Solving Chromatographic Issues with Mathematical Models and Intuitive Graphics
    Amigo, Jose Manuel
    Skov, Thomas
    Bro, Rasmus
    [J]. CHEMICAL REVIEWS, 2010, 110 (08) : 4582 - 4605
  • [2] Comprehensive analysis of chromatographic data by using PARAFAC2 and principal components analysis
    Amigo, Jose Manuel
    Popielarz, Marta J.
    Callejon, Raquel M.
    Morales, Maria L.
    Troncoso, Ana M.
    Petersen, Mikael A.
    Toldam-Andersen, Torben B.
    [J]. JOURNAL OF CHROMATOGRAPHY A, 2010, 1217 (26) : 4422 - 4429
  • [3] Metabolomics approaches for discovering biomarkers of drug-induced hepatotoxicity and nephrotoxicity
    Beger, Richard D.
    Sun, Jinchun
    Schnackenberg, Laura K.
    [J]. TOXICOLOGY AND APPLIED PHARMACOLOGY, 2010, 243 (02) : 154 - 166
  • [4] Multivariate curve resolution (MCR) from 2000: Progress in concepts and applications
    de Juan, Anna
    Tauler, Roma
    [J]. CRITICAL REVIEWS IN ANALYTICAL CHEMISTRY, 2006, 36 (3-4) : 163 - 176
  • [5] DIA mass spectrometry
    Doerr, Allison
    [J]. NATURE METHODS, 2015, 12 (01) : 35 - 35
  • [6] Compound identification in gas chromatography/mass spectrometry-based metabolomics by blind source separation
    Domingo-Almenara, Xavier
    Perera, Alexandre
    Ramirez, Noelia
    Canellas, Nicolau
    Correig, Xavier
    Brezmes, Jesus
    [J]. JOURNAL OF CHROMATOGRAPHY A, 2015, 1409 : 226 - 233
  • [7] Improved peak detection in mass spectrum by incorporating continuous wavelet transform-based pattern matching
    Du, Pan
    Kibbe, Warren A.
    Lin, Simon M.
    [J]. BIOINFORMATICS, 2006, 22 (17) : 2059 - 2065
  • [8] Normalization to Specific Gravity Prior to Analysis Improves Information Recovery from High Resolution Mass Spectrometry Metabolomic Profiles of Human Urine
    Edmands, William M. B.
    Ferrari, Pietro
    Scalbert, Augustin
    [J]. ANALYTICAL CHEMISTRY, 2014, 86 (21) : 10925 - 10931
  • [9] Parametric time warping
    Eilers, PHC
    [J]. ANALYTICAL CHEMISTRY, 2004, 76 (02) : 404 - 411
  • [10] A PRIORI ESTIMATES OF THE ELUTION PROFILES OF THE PURE COMPONENTS IN OVERLAPPED LIQUID-CHROMATOGRAPHY PEAKS USING TARGET FACTOR-ANALYSIS
    GEMPERLINE, PJ
    [J]. JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1984, 24 (04): : 206 - 212