Diagnosis of Obstructive Sleep Apnea Using Feature Selection, Classification Methods, and Data Grouping Based Age, Sex, and Race

被引:4
作者
Sheta, Alaa [1 ]
Thaher, Thaer [2 ]
Surani, Salim R. [3 ]
Turabieh, Hamza [4 ]
Braik, Malik [5 ]
Too, Jingwei [6 ]
Abu-El-Rub, Noor [7 ]
Mafarjah, Majdi [8 ]
Chantar, Hamouda [9 ]
Subramanian, Shyam [10 ]
机构
[1] Southern Connecticut State Univ, Comp Sci Dept, New Haven, CT 06514 USA
[2] Arab Amer Univ, Dept Comp Syst Engn, POB 240, Jenin, Palestine
[3] Texas A&M Univ, Dept Pulm Crit Care & Sleep Med, College Stn, TX 77843 USA
[4] Univ Missouri, Sch Med, Hlth Management & Informat Dept, Columbia, MO 65212 USA
[5] Al Balqa Appl Univ, Dept Comp Sci, Salt 19117, Jordan
[6] Univ Teknikal Malaysia Melaka, Fac Elect Engn, Hang Tuah Jaya, Durian Tunggal 76100, Melaka, Malaysia
[7] Univ Kansas, Ctr Med Informat & Enterprise Analyt, Med Ctr, Kansas City, KS 66160 USA
[8] Birzeit Univ, Dept Comp Sci, POB 14, Birzeit, Palestine
[9] Sebha Univ, Fac Informat Technol, Sebha 18758, Libya
[10] Sutter Hlth, Pulm Crit Care & Sleep Med, Tracy, CA 95376 USA
关键词
obstructive sleep apnea; grouping; feature selection; machine learning; OPTIMIZATION ALGORITHM; GENDER; HYPERTENSION; MODEL;
D O I
10.3390/diagnostics13142417
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Obstructive sleep apnea (OSA) is a prevalent sleep disorder that affects approximately 3-7% of males and 2-5% of females. In the United States alone, 50-70 million adults suffer from various sleep disorders. OSA is characterized by recurrent episodes of breathing cessation during sleep, thereby leading to adverse effects such as daytime sleepiness, cognitive impairment, and reduced concentration. It also contributes to an increased risk of cardiovascular conditions and adversely impacts patient overall quality of life. As a result, numerous researchers have focused on developing automated detection models to identify OSA and address these limitations effectively and accurately. This study explored the potential benefits of utilizing machine learning methods based on demographic information for diagnosing the OSA syndrome. We gathered a comprehensive dataset from the Torr Sleep Center in Corpus Christi, Texas, USA. The dataset comprises 31 features, including demographic characteristics such as race, age, sex, BMI, Epworth score, M. Friedman tongue position, snoring, and more. We devised a novel process encompassing pre-processing, data grouping, feature selection, and machine learning classification methods to achieve the research objectives. The classification methods employed in this study encompass decision tree (DT), naive Bayes (NB), k-nearest neighbor (kNN), support vector machine (SVM), linear discriminant analysis (LDA), logistic regression (LR), and subspace discriminant (Ensemble) classifiers. Through rigorous experimentation, the results indicated the superior performance of the optimized kNN and SVM classifiers for accurately classifying sleep apnea. Moreover, significant enhancements in model accuracy were observed when utilizing the selected demographic variables and employing data grouping techniques. For instance, the accuracy percentage demonstrated an approximate improvement of 4.5%, 5%, and 10% with the feature selection approach when applied to the grouped data of Caucasians, females, and individuals aged 50 or below, respectively. Furthermore, a comparison with prior studies confirmed that effective data grouping and proper feature selection yielded superior performance in OSA detection when combined with an appropriate classification method. Overall, the findings of this research highlight the importance of leveraging demographic information, employing proper feature selection techniques, and utilizing optimized classification models for accurate and efficient OSA diagnosis.
引用
收藏
页数:28
相关论文
共 50 条
  • [31] Effects of age and sex on the performance of the NoSAS score as a screening tool for obstructive sleep apnea: a hospital-based retrospective study in China
    Zhigang Zhang
    Dan Yang
    Haiying Wang
    Xinmin Liu
    Sleep and Breathing, 2021, 25 : 1407 - 1417
  • [32] Simplified Process of Obstructive Sleep Apnea Detection Using ECG Signal Based Analysis with Data Flow Programming
    Bali, Jyoti
    Nandi, Anilkumar V.
    INFORMATION AND COMMUNICATION TECHNOLOGY FOR INTELLIGENT SYSTEMS (ICTIS 2017) - VOL 2, 2018, 84 : 165 - 173
  • [33] Support Vector Machines-based Heart Disease Diagnosis using Feature Subset, Wrapping Selection and Extraction Methods
    Shah, Syed Muhammad Saqlain
    Shah, Faiz Ali
    Hussain, Syed Adnan
    Batool, Safeera
    COMPUTERS & ELECTRICAL ENGINEERING, 2020, 84
  • [34] Logistic regression and artificial neural network-based simple predicting models for obstructive sleep apnea by age, sex, and body mass index
    Kuan, Yi-Chun
    Hong, Chien-Tai
    Chen, Po-Chih
    Liu, Wen-Te
    Chung, Chen-Chih
    MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2022, 19 (11) : 11409 - 11421
  • [35] Grad-CAM-Based Feature Selection and Dementia Classification Algorithm Using Voice Data
    Ko, Hansol
    Wang, Bohyun
    Lim, Joon S.
    TEHNICKI VJESNIK-TECHNICAL GAZETTE, 2024, 31 (06): : 2036 - 2044
  • [36] Improved Kurdish Dialect Classification Using Data Augmentation and ANOVA-Based Feature Selection
    Ghafoor, Karzan J.
    Taher, Sarkhel H.
    Rawf, Karwan M. Hama
    Abdulrahman, Ayub O.
    ARO-THE SCIENTIFIC JOURNAL OF KOYA UNIVERSITY, 2025, 13 (01): : 94 - 103
  • [37] Feature selection using neighborhood entropy-based uncertainty measures for gene expression data classification
    Sun, Lin
    Zhang, Xiaoyu
    Qian, Yuhua
    Xu, Jiucheng
    Zhang, Shiguang
    INFORMATION SCIENCES, 2019, 502 : 18 - 41
  • [38] Feature Subset Selection for High-Dimensional, Low Sampling Size Data Classification Using Ensemble Feature Selection With a Wrapper-Based Search
    Mandal, Ashis Kumar
    Nadim, MD.
    Saha, Hasi
    Sultana, Tangina
    Hossain, Md. Delowar
    Huh, Eui-Nam
    IEEE ACCESS, 2024, 12 : 62341 - 62357
  • [39] Benchmarking feature selection and feature extraction methods to improve the performances of machine-learning algorithms for patient classification using metabolomics biomedical data
    Labory, Justine
    Njomgue-Fotso, Evariste
    Bottini, Silvia
    COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2024, 23 : 1274 - 1287
  • [40] Age- and Gender-Based Tongue Volume Variations on Asymptomatic Patients: A Simplified Approach to Form Baseline Data for Obstructive Sleep Apnea
    Bastug, Betul Tiryaki
    DIAGNOSTICS, 2025, 15 (03)