Optimization of machine learning techniques for the determination of clinical parameters in dried human serum samples from FTIR spectroscopic data

被引:3
|
作者
Palumbo, Domenico [1 ]
Giorni, Antonio [2 ]
Minocchi, Rossella [2 ]
Amendola, Roberto [1 ,3 ]
Guidi, Mariangela Cestelli [3 ]
机构
[1] ENEA, C R Casaccia, Via Anguillarese 301, I-00123 Rome, Italy
[2] ENEA, Occupat Med Serv, C R Casaccia, Via Anguillarese 301, I-00123 Rome, Italy
[3] INFN, Lab Nazl Frascati, Via Enr Fermi 54, I-00044 Frascati, Italy
关键词
FTIR; Human serum; Clinical parameters prediction; Machine learning; Regression; NEAR-INFRARED SPECTROSCOPY; LEAST-SQUARES REGRESSION; MULTILAYER FILM ELEMENTS; MULTIVARIATE CALIBRATION; VIBRATIONAL SPECTROSCOPY; PROTEIN CONTENTS; IR SPECTROSCOPY; HUMAN PLASMA; ATR; TRIGLYCERIDES;
D O I
10.1016/j.vibspec.2022.103408
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Machine learning techniques are powerful tools that can be applied to a large variety of fields due to their great versatility. Here, numerous machine learning regression methods are compared for the analysis of FTIR spectra of biological human serum samples in order to support and validate the use of vibrational spectroscopies for the quantification of clinical parameters and the identification of pathologies or states of alteration. To this end, we systematically analysed the prediction of 6 clinical parameters through machine learning techniques: Triglycerides, Cholesterol, HDL Cholesterol, Urea, Glucose and Total Proteins. The prediction ability is excellent in the case of Partial Least Squares regression (PLSR), Neural Networks (NN) and Support Vector regression (SVR) and in particular for Triglycerides, Cholesterol, HDL Cholesterol and Urea while for Glucose and Total Proteins the prediction ability is less accurate. The ensemble regression algorithms, specifically Boosting (BOOST), Boostrap Aggregation (BAG) applied to these base learners and to Decision Trees (DT) and Random Forest (RF), doesn't significantly improve the base learner results. The comparison also shows superior performances in the case of linear regression and considering the entire infrared spectrum without the need to select spectral features. The results obtained here go in the direction of standardizing the FTIR data analysis methodology to optimize the prediction of clinical parameters. Coupled with the development of portable spectrometers, faster detectors and powerful light sources, FTIR spectroscopy can replace standard clinical testing procedures by making them faster, simpler and lower cost.
引用
收藏
页数:10
相关论文
共 12 条
  • [1] Early Diagnosis of Dementia from Clinical Data by Machine Learning Techniques
    So, Aram
    Hooshyar, Danial
    Park, Kun Woo
    Lim, Heui Seok
    APPLIED SCIENCES-BASEL, 2017, 7 (07):
  • [2] Determination of human arterial wall parameters from clinical data
    Stalhand, Jonas
    BIOMECHANICS AND MODELING IN MECHANOBIOLOGY, 2009, 8 (02) : 141 - 148
  • [3] Determination of human arterial wall parameters from clinical data
    Jonas Stålhand
    Biomechanics and Modeling in Mechanobiology, 2009, 8 : 141 - 148
  • [4] RESERVOIR POROSITY DETERMINATION FROM 3D SEISMIC DATA - APPLICATION OF TWO MACHINE LEARNING TECHNIQUES
    Alimoradi, Andisheh
    Moradzadeh, Ali
    Bakhtiari, Mohammad Reza
    JOURNAL OF SEISMIC EXPLORATION, 2012, 21 (04): : 323 - 345
  • [5] Refinement of blood sampling techniques from the non-human primate to provide dried blood spot samples for generation of toxicokinetic data
    Burnett, J.
    Brook, J.
    Price, C.
    Tasker, L.
    Hanson-Williams, K.
    TOXICOLOGY LETTERS, 2010, 196 : S99 - S99
  • [6] Machine Learning Techniques for Extracting Relevant Features from Clinical Data for COVID-19 Mortality Prediction
    Fraccaroli, Michele
    Mazzuchelli, Giulia
    Bizzarri, Alice
    26TH IEEE SYMPOSIUM ON COMPUTERS AND COMMUNICATIONS (IEEE ISCC 2021), 2021,
  • [7] Viral genome prediction from raw human DNA sequence samples by combining natural language processing and machine learning techniques
    Alshayeji, Mohammad H.
    Sindhu, Silpa ChandraBhasi
    Abed, Saed
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 218
  • [8] Machine Learning Approach for Predicting Past Environmental Exposures From Molecular Profiling of Post-Exposure Human Serum Samples
    Khan, Atif
    Thatcher, Thomas H.
    Woeller, Collynn F.
    Sime, Patricia J.
    Phipps, Richard P.
    Hopke, Philip K.
    Utell, Mark J.
    Krahl, Pamela L.
    Mallon, Timothy M.
    Thakar, Juilee
    JOURNAL OF OCCUPATIONAL AND ENVIRONMENTAL MEDICINE, 2019, 61 (12) : S55 - S64
  • [9] Comparison of machine learning techniques for the identification of human activities from inertial sensors available in a mobile device after the application of data imputation techniques
    Pires, Ivan Miguel
    Hussain, Faisal
    Marques, Goncalo
    Garcia, Nuno M.
    COMPUTERS IN BIOLOGY AND MEDICINE, 2021, 135 (135)
  • [10] Simultaneous determination of ascorbic and uric acids and dopamine in human serum samples using three-way calibration with data from square wave voltammetry
    Marcelo Granero, Adrian
    Dario Pierini, Gaston
    Noel Robledo, Sebastian
    Susana Di Nezio, Maria
    Fernandez, Hector
    Alicia Zon, Maria
    MICROCHEMICAL JOURNAL, 2016, 129 : 205 - 212