Improved Stress Classification Using Automatic Feature Selection from Heart Rate and Respiratory Rate Time Signals

被引:16
作者
Iqbal, Talha [1 ]
Elahi, Adnan [2 ]
Wijns, William [1 ,3 ]
Amin, Bilal [1 ,2 ]
Shahzad, Atif [1 ,4 ]
机构
[1] Univ Galway, Lambe Inst Translat Res, Coll Med, Smart Sensor Lab,Nursing Hlth Sci, Galway H91 TK33, Ireland
[2] Univ Galway, Elect & Elect Engn, Galway H91 TK33, Ireland
[3] CURAM Ctr Res Med Devices, Galway H91 W2TY, Ireland
[4] Univ Birmingham, Ctr Syst Modelling & Quantitat Biomed SMQB, Birmingham B15 2TT, England
来源
APPLIED SCIENCES-BASEL | 2023年 / 13卷 / 05期
基金
爱尔兰科学基金会;
关键词
time-series; distinctive features; respiratory rate; heart rate; feature engineering; stress; classification; CORRELATION-COEFFICIENTS; SPEARMANS;
D O I
10.3390/app13052950
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Time-series features are the characteristics of data periodically collected over time. The calculation of time-series features helps in understanding the underlying patterns and structure of the data, as well as in visualizing the data. The manual calculation and selection of time-series feature from a large temporal dataset are time-consuming. It requires researchers to consider several signal-processing algorithms and time-series analysis methods to identify and extract meaningful features from the given time-series data. These features are the core of a machine learning-based predictive model and are designed to describe the informative characteristics of the time-series signal. For accurate stress monitoring, it is essential that these features are not only informative but also well-distinguishable and interpretable by the classification models. Recently, a lot of work has been carried out on automating the extraction and selection of times-series features. In this paper, a correlation-based time-series feature selection algorithm is proposed and evaluated on the stress-predict dataset. The algorithm calculates a list of 1578 features of heart rate and respiratory rate signals (combined) using the tsfresh library. These features are then shortlisted to the more specific time-series features using Principal Component Analysis (PCA) and Pearson, Kendall, and Spearman correlation ranking techniques. A comparative study of conventional statistical features (like, mean, standard deviation, median, and mean absolute deviation) versus correlation-based selected features is performed using linear (logistic regression), ensemble (random forest), and clustering (k-nearest neighbours) predictive models. The correlation-based selected features achieved higher classification performance with an accuracy of 98.6% as compared to the conventional statistical feature's 67.4%. The outcome of the proposed study suggests that it is vital to have better analytical features rather than conventional statistical features for accurate stress classification.
引用
收藏
页数:15
相关论文
共 58 条
[1]  
Abuaitah G. R., 2012, P 2012 IEEE 9 INT C, P1
[2]  
Benesty Jacob, 2009, The SAGE Encyclopedia of Educational Research, Measurement, and Evaluation, P1, DOI DOI 10.4135/9781506326139
[3]  
Benjamini Y, 2001, ANN STAT, V29, P1165
[4]   Wearable Flexible Electronics Based Cardiac Electrode for Researcher Mental Stress Detection System Using Machine Learning Models on Single Lead Electrocardiogram Signal [J].
Bin Heyat, Md Belal ;
Akhtar, Faijan ;
Abbas, Syed Jafar ;
Al-Sarem, Mohammed ;
Alqarafi, Abdulrahman ;
Stalin, Antony ;
Abbasi, Rashid ;
Muaad, Abdullah Y. ;
Lai, Dakun ;
Wu, Kaishun .
BIOSENSORS-BASEL, 2022, 12 (06)
[5]   A Review on Outlier/Anomaly Detection in Time Series Data [J].
Blazquez-Garcia, Ane ;
Conde, Angel ;
Mori, Usue ;
Lozano, Jose A. .
ACM COMPUTING SURVEYS, 2022, 54 (03)
[6]  
Box G.E., 2015, Time Series Analysis: Forecasting and Control
[7]  
Braun N, 2019, RELEASE V0 11 0 BLU
[8]  
Chourpiliadis C., 2019, Physiology, respiratory rate
[9]  
Christ M., 2016, arXiv
[10]  
Christ M., 2016, WORKSHOP EXTREME VAL