Analyzing Medical Data by Using Statistical Learning Models

被引:0
|
作者
Mariani, Maria C. [1 ,2 ]
Biney, Francis [2 ]
Tweneboah, Osei K. [3 ]
机构
[1] Univ Texas El Paso, Dept Math Sci, El Paso, TX 79968 USA
[2] Univ Texas El Paso, Computat Sci Program, El Paso, TX 79968 USA
[3] Ramapo Coll, Dept Data Sci, Mahwah, NJ 07430 USA
关键词
statistical learning; deep-feedforward neural network; heart disease; prostate cancer; breast cancer;
D O I
10.3390/math9090968
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
In this work, we investigated the prognosis of three medical data specifically, breast cancer, heart disease, and prostate cancer by using 10 machine learning models. We applied all 10 models to each dataset to identify patterns in them. Furthermore, we use the models to diagnose risk factors that increases the chance of these diseases. All the statistical learning techniques discussed were grouped into linear and nonlinear models based on their similarities and learning styles. The models performances were significantly improved by selecting models while taking into account the bias-variance tradeoffs and using cross-validation for selecting the tuning parameter. Our results suggests that no particular class of models or learning style dominated the prognosis and diagnosis for all three medical datasets. However nonlinear models gave the best predictive performance for breast cancer data. Linear models on the other hand gave the best predictive performance for heart disease data and a combination of linear and nonlinear models for the prostate cancer dataset.
引用
收藏
页数:30
相关论文
共 50 条
  • [41] Spam filtering using statistical data compression models
    Bratko, Andrej
    Cormack, Gordon V.
    Filipic, Bogdan
    Lynam, Thomas R.
    Zupan, Blaz
    JOURNAL OF MACHINE LEARNING RESEARCH, 2006, 7 : 2673 - 2698
  • [42] Peptide vaccine models using statistical data mining
    Joshi, Rajani R.
    PROTEIN AND PEPTIDE LETTERS, 2007, 14 (06): : 536 - 542
  • [43] Statistical testing for load models using measured data
    Lv, Jiaqing
    Pawlak, Miroslaw
    Annakkage, U. D.
    Bagen, Bagen
    ELECTRIC POWER SYSTEMS RESEARCH, 2018, 163 : 66 - 72
  • [44] Addressing the data bottleneck in medical deep learning models using a human-in-the-loop machine learning approach
    Mosqueira-Rey, Eduardo
    Hernandez-Pereira, Elena
    Bobes-Bascaran, Jose
    Alonso-Rios, David
    Perez-Sanchez, Alberto
    Fernandez-Leal, Angel
    Moret-Bonillo, Vicente
    Vidal-Insua, Yolanda
    Vazquez-Rivera, Francisca
    NEURAL COMPUTING & APPLICATIONS, 2024, 36 (05): : 2597 - 2616
  • [45] Addressing the data bottleneck in medical deep learning models using a human-in-the-loop machine learning approach
    Eduardo Mosqueira-Rey
    Elena Hernández-Pereira
    José Bobes-Bascarán
    David Alonso-Ríos
    Alberto Pérez-Sánchez
    Ángel Fernández-Leal
    Vicente Moret-Bonillo
    Yolanda Vidal-Ínsua
    Francisca Vázquez-Rivera
    Neural Computing and Applications, 2024, 36 : 2597 - 2616
  • [46] A Novel Method for Medical Predictive Models in Small Data Using Out-of-Distribution Data and Transfer Learning
    Jeong, Inyong
    Kim, Yeongmin
    Cho, Nam-Jun
    Gil, Hyo-Wook
    Lee, Hwamin
    MATHEMATICS, 2024, 12 (02)
  • [47] Analyzing Road Accident Data using Machine Learning Paradigms
    Nandurge, Priyanka A.
    Dharwadkar, Nagaraj V.
    2017 INTERNATIONAL CONFERENCE ON I-SMAC (IOT IN SOCIAL, MOBILE, ANALYTICS AND CLOUD) (I-SMAC), 2017, : 604 - 610
  • [48] Identifying Acute Ischemic Stroke by Analyzing Icd-10 Claims Data Using Machine Learning Models
    Esenwa, Charles
    Luna, Jorge
    Kummer, Benjamin
    Salmasian, Hojjat
    Vawdrey, David
    Kamel, Hooman
    Elkind, Mitchell
    STROKE, 2017, 48
  • [49] Analyzing Harmonic Monitoring Data Using Supervised and Unsupervised Learning
    Asheibi, Ali
    Stirling, David
    Sutanto, Danny
    IEEE TRANSACTIONS ON POWER DELIVERY, 2009, 24 (01) : 293 - 301
  • [50] Analyzing the Twitter Data Stream Using the Snap! Learning Environment
    Grillenberger, Andreas
    Romeike, Ralf
    INFORMATICS IN SCHOOLS: CURRICULA, COMPETENCES, AND COMPETITIONS, 2015, 9378 : 155 - 164