Review of Data Pre-processing Techniques and Machine Learning in PTR-MS

被引:0
|
作者
Sun Y. [1 ]
Chen Y.-B. [2 ]
Chu M.-J. [1 ]
Jiang X.-H. [1 ]
Wang Y. [1 ]
Guo B.-Q. [1 ]
机构
[1] School of Precision Instrument and Optoelectronics Engineering, Tianjin University, Tianjin
[2] Respiratory Medicine, Chinese PLA General Hospital, Beijing
关键词
Data pre-processing; Machine learning; Proton transfer reaction mass spectrometry (PTR-MS); Volatile organic compounds (VOCs);
D O I
10.7538/zpxb.2017.0181
中图分类号
学科分类号
摘要
Proton transfer reaction mass spectrometry (PTR-MS) is an analytical technique developed for the detection of volatile organic compounds (VOCs). It offers many advantages for VOCs analysis,such us ultra-low detection limits, very short response, no sample preparation, real-time analysis, etc. It has been applied in atmospheric chemistry environmental chemistry, food and biomedical. With the expansion of applications of PTR-MS and the increase of sample types, how to analyze the features from complex data and find out the inherent rules have put forward higher requirements on the processing ability of the algorithm. Therefore, this paper discussed the data preprocessing techniques and machine learning methods. Firstly, we summarized the data preprocessing methods with PTR-MS features. The data generated by the instrument cannot be directly used for statistical analysis, otherwise it will bring great error. Therefore, data pre-processing is an essential step. It includes several steps,such as denoising, normalization, and concentration calculation. The purpose of preprocessing is to get data matrix for subsequent analysis. Next, we focused on the use of machine learning methods for data analysis in PTR-MS, and the advantages of this techniques would be demonstrated as well as the drawbacks. The machine learning method can be divided into two parts. Usually unsupervised methods are common choices for initial data analysis. For further analysis and a priori knowledge, a supervised analysis would be a better way. These methods use this knowledge to learn rules and patterns related to classes in the data, and then use these rules and patterns to predict classes in newly acquired data sets. The main goal of all surveillance techniques is to find the relationship between the predictor (VOC) matrix and the response vector. In general, the combination of the unsupervised and supervised methods is a good idea. PTR-MS is a soft ionization technique, however, the presence of a few fragments will still cause great difficulties in spectral analysis, especially for unknown mixtures, which is the main reason why spectral analysis of PTR-MS differs from other mass spectrometry methods. Perhaps, the data fusion of different platform instruments and different samples will be a good way to solve this problem. © 2018, Editorial Board of Journal of Chinese Mass Spectrometry Society. All right reserved.
引用
收藏
页码:513 / 523
页数:10
相关论文
共 64 条
  • [1] Lindinger W., Hansel A., Jordan A., On-line monitoring of volatile organic compounds at pptv levels by means of proton-transfer-reaction mass spectrometry (PTR-MS) medical applications, food control and environmental research, International Journal of Mass Spectrometry & Ion Processes, 173, 3, pp. 191-241, (1998)
  • [2] Blake R.S., Monks P.S., Ellis A.M., Proton-transfer reaction mass spectrometry, Chemical Reviews, 109, 3, (2009)
  • [3] De L.C.B., Amann A., Alkateb H., Et al., A review of the volatiles from the healthy human body, Journal of Breath Research, 8, 1, (2014)
  • [4] Cappellin L., Biasioli F., Granitto P.M., Et al., On data analysis in PTR-TOF-MS: from raw spectra to data mining, Sensors & Actuators B Chemical, 155, 1, pp. 183-190, (2011)
  • [5] Muller M., Graus M., Ruuskanen T.M., Et al., First eddy covariance flux measurements by PTR-TOF, Atmospheric Measurement Techniques, 3, 2, pp. 387-395, (2010)
  • [6] Ke C., Zhang T., Wu X., Et al., Statistical methods for metabolomics data analysis, Chinese Journal of Health Stastistics, 31, 2, pp. 357-359, (2014)
  • [7] Fazzini M., Multidimensional statistical analysis of PTR-MS breath samples: a test study on irradiation detection, International Journal of Mass Spectrometry, 295, 1-2, pp. 13-20, (2010)
  • [8] Hansel A., Jordan A., Holzinger R., Et al., Proton transfer reaction mass spectrometry: on-line trace gas analysis at the ppb level, International Journal of Mass Spectrometry & Ion Processes, 149-150 s, 150, pp. 609-619, (1995)
  • [9] Hansel A., Mark T.D., Foreword, International Journal of Mass Spectrometry, 239, 2-3, (2004)
  • [10] Thekedar B., Investigations on the use of breath gas analysis with proton transfer reaction mass spectrometry (PTR-MS) for a non-invasive method of early lung cancer detection, Acta Geodaetica Et Cartographic Sinica, (2009)