Diagnosis of Obstructive Sleep Apnea Using Feature Selection, Classification Methods, and Data Grouping Based Age, Sex, and Race

被引:4
作者
Sheta, Alaa [1 ]
Thaher, Thaer [2 ]
Surani, Salim R. [3 ]
Turabieh, Hamza [4 ]
Braik, Malik [5 ]
Too, Jingwei [6 ]
Abu-El-Rub, Noor [7 ]
Mafarjah, Majdi [8 ]
Chantar, Hamouda [9 ]
Subramanian, Shyam [10 ]
机构
[1] Southern Connecticut State Univ, Comp Sci Dept, New Haven, CT 06514 USA
[2] Arab Amer Univ, Dept Comp Syst Engn, POB 240, Jenin, Palestine
[3] Texas A&M Univ, Dept Pulm Crit Care & Sleep Med, College Stn, TX 77843 USA
[4] Univ Missouri, Sch Med, Hlth Management & Informat Dept, Columbia, MO 65212 USA
[5] Al Balqa Appl Univ, Dept Comp Sci, Salt 19117, Jordan
[6] Univ Teknikal Malaysia Melaka, Fac Elect Engn, Hang Tuah Jaya, Durian Tunggal 76100, Melaka, Malaysia
[7] Univ Kansas, Ctr Med Informat & Enterprise Analyt, Med Ctr, Kansas City, KS 66160 USA
[8] Birzeit Univ, Dept Comp Sci, POB 14, Birzeit, Palestine
[9] Sebha Univ, Fac Informat Technol, Sebha 18758, Libya
[10] Sutter Hlth, Pulm Crit Care & Sleep Med, Tracy, CA 95376 USA
关键词
obstructive sleep apnea; grouping; feature selection; machine learning; OPTIMIZATION ALGORITHM; GENDER; HYPERTENSION; MODEL;
D O I
10.3390/diagnostics13142417
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Obstructive sleep apnea (OSA) is a prevalent sleep disorder that affects approximately 3-7% of males and 2-5% of females. In the United States alone, 50-70 million adults suffer from various sleep disorders. OSA is characterized by recurrent episodes of breathing cessation during sleep, thereby leading to adverse effects such as daytime sleepiness, cognitive impairment, and reduced concentration. It also contributes to an increased risk of cardiovascular conditions and adversely impacts patient overall quality of life. As a result, numerous researchers have focused on developing automated detection models to identify OSA and address these limitations effectively and accurately. This study explored the potential benefits of utilizing machine learning methods based on demographic information for diagnosing the OSA syndrome. We gathered a comprehensive dataset from the Torr Sleep Center in Corpus Christi, Texas, USA. The dataset comprises 31 features, including demographic characteristics such as race, age, sex, BMI, Epworth score, M. Friedman tongue position, snoring, and more. We devised a novel process encompassing pre-processing, data grouping, feature selection, and machine learning classification methods to achieve the research objectives. The classification methods employed in this study encompass decision tree (DT), naive Bayes (NB), k-nearest neighbor (kNN), support vector machine (SVM), linear discriminant analysis (LDA), logistic regression (LR), and subspace discriminant (Ensemble) classifiers. Through rigorous experimentation, the results indicated the superior performance of the optimized kNN and SVM classifiers for accurately classifying sleep apnea. Moreover, significant enhancements in model accuracy were observed when utilizing the selected demographic variables and employing data grouping techniques. For instance, the accuracy percentage demonstrated an approximate improvement of 4.5%, 5%, and 10% with the feature selection approach when applied to the grouped data of Caucasians, females, and individuals aged 50 or below, respectively. Furthermore, a comparison with prior studies confirmed that effective data grouping and proper feature selection yielded superior performance in OSA detection when combined with an appropriate classification method. Overall, the findings of this research highlight the importance of leveraging demographic information, employing proper feature selection techniques, and utilizing optimized classification models for accurate and efficient OSA diagnosis.
引用
收藏
页数:28
相关论文
共 50 条
  • [41] Feature Selection Using Approximate Conditional Entropy Based on Fuzzy Information Granule for Gene Expression Data Classification
    Zhang, Hengyi
    FRONTIERS IN GENETICS, 2021, 12
  • [42] RANDOM FORESTS-BASED FEATURE SELECTION FOR LAND-USE CLASSIFICATION USING LIDAR DATA AND ORTHOIMAGERY
    Guan, Haiyan
    Yu, Jun
    Li, Jonathan
    Luo, Lun
    XXII ISPRS CONGRESS, TECHNICAL COMMISSION VII, 2012, 39 (B7): : 203 - 208
  • [43] Obstructive Sleep Apnea (OSA) Classification Based on Heart Rate Variability (HRV) on Electrocardiogram (ECG) Signal Using Support Vector Machine (SVM)
    Rizal, Achmad
    Siregar, Fauzan Dizki Alif Azmi
    Fauzi, Hilman Tresna
    TRAITEMENT DU SIGNAL, 2022, 39 (02) : 469 - 474
  • [44] Predicting Low Cognitive Ability at Age 5-Feature Selection Using Machine Learning Methods and Birth Cohort Data
    Bowe, Andrea K.
    Lightbody, Gordon
    Staines, Anthony
    Kiely, Mairead E.
    McCarthy, Fergus P.
    Murray, Deirdre M.
    INTERNATIONAL JOURNAL OF PUBLIC HEALTH, 2022, 67
  • [45] An empirical evaluation of importance-based feature selection methods for the driver identification task using OBD data
    Priyadharshini, G.
    Ukrit, M. Ferni
    INTERNATIONAL JOURNAL OF SYSTEM ASSURANCE ENGINEERING AND MANAGEMENT, 2022,
  • [46] Comparison between three abbreviated methods for the diagnosis of obstructive sleep apnea syndrome in children and adolescents in a real-world setting - a prospective study using polysomnography
    Marechal, Manon
    Renard, Emeline
    Franco, Patricia
    Da Mota, Sofia
    Schweitzer, Noemie
    Tiotiu, Angelica
    Schweitzer, Cyril
    Coutier, Laurianne
    Ioan, Iulia
    EUROPEAN ARCHIVES OF OTO-RHINO-LARYNGOLOGY, 2025, 282 (03) : 1493 - 1500
  • [47] An Efficient SVM-Based Feature Selection Model for Cancer Classification Using High-Dimensional Microarray Data
    El Kafrawy, Passent
    Fathi, Hanaa
    Qaraad, Mohammed
    Kelany, Ayda K.
    Chen, Xumin
    IEEE ACCESS, 2021, 9 : 155353 - 155369
  • [48] Semi-supervised SVM-based Feature Selection for Cancer Classification using Microarray Gene Expression Data
    Ang, Jun Chin
    Haron, Habibollah
    Hamed, Haza Nuzly Abdull
    CURRENT APPROACHES IN APPLIED ARTIFICIAL INTELLIGENCE, 2015, 9101 : 468 - 477
  • [49] An Enhanced Evolutionary Based Feature Selection Approach Using Grey Wolf Optimizer for the Classification of High-dimensional Biological Data
    Thaher, Thaer
    Awad, Mohammed
    Aldasht, Mohammed
    Sheta, Alaa
    Turabieh, Hamza
    Chantar, Hamouda
    JOURNAL OF UNIVERSAL COMPUTER SCIENCE, 2022, 28 (05) : 499 - 539
  • [50] Genetic Clustering Algorithm-Based Feature Selection and Divergent Random Forest for Multiclass Cancer Classification Using Gene Expression Data
    L. Senbagamalar
    S. Logeswari
    International Journal of Computational Intelligence Systems, 17