Analyzing Medical Data by Using Statistical Learning Models

被引:0
|
作者
Mariani, Maria C. [1 ,2 ]
Biney, Francis [2 ]
Tweneboah, Osei K. [3 ]
机构
[1] Univ Texas El Paso, Dept Math Sci, El Paso, TX 79968 USA
[2] Univ Texas El Paso, Computat Sci Program, El Paso, TX 79968 USA
[3] Ramapo Coll, Dept Data Sci, Mahwah, NJ 07430 USA
关键词
statistical learning; deep-feedforward neural network; heart disease; prostate cancer; breast cancer;
D O I
10.3390/math9090968
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
In this work, we investigated the prognosis of three medical data specifically, breast cancer, heart disease, and prostate cancer by using 10 machine learning models. We applied all 10 models to each dataset to identify patterns in them. Furthermore, we use the models to diagnose risk factors that increases the chance of these diseases. All the statistical learning techniques discussed were grouped into linear and nonlinear models based on their similarities and learning styles. The models performances were significantly improved by selecting models while taking into account the bias-variance tradeoffs and using cross-validation for selecting the tuning parameter. Our results suggests that no particular class of models or learning style dominated the prognosis and diagnosis for all three medical datasets. However nonlinear models gave the best predictive performance for breast cancer data. Linear models on the other hand gave the best predictive performance for heart disease data and a combination of linear and nonlinear models for the prostate cancer dataset.
引用
收藏
页数:30
相关论文
共 50 条
  • [1] Machine learning and statistical models for analyzing multilevel patent data
    Sunyun Qi
    Yu Zhang
    Hua Gu
    Fei Zhu
    Meiying Gao
    Hongxiao Liang
    Qifeng Zhang
    Yanchao Gao
    Scientific Reports, 13
  • [2] Machine learning and statistical models for analyzing multilevel patent data
    Qi, Sunyun
    Zhang, Yu
    Gu, Hua
    Zhu, Fei
    Gao, Meiying
    Liang, Hongxiao
    Zhang, Qifeng
    Gao, Yanchao
    SCIENTIFIC REPORTS, 2023, 13 (01)
  • [3] ANALYZING MEDICAL DATA - SOME STATISTICAL CONSIDERATIONS
    WALSH, JE
    IRE TRANSACTIONS ON MEDICAL ELECTRONICS, 1960, 7 (04): : 362 - 366
  • [4] Analyzing sickness absence with statistical models for survival data
    Christensen, Karl Bang
    Andersen, Per Kragh
    Smith-Hansen, Lars
    Nielsen, Martin L.
    Kristensen, Tage S.
    SCANDINAVIAN JOURNAL OF WORK ENVIRONMENT & HEALTH, 2007, 33 (03) : 233 - 239
  • [5] Analyzing the Models of Medical Data Center on Cloud Computing
    Zhang Jiemin
    10TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE & EDUCATION (ICCSE 2015), 2015, : 76 - 79
  • [6] Outlier Detection in Sensed Data using Statistical Learning Models for IoT
    Nesa, Nashreen
    Ghosh, Tania
    Banerjee, Indrajit
    2018 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE (WCNC), 2018,
  • [7] Learning statistical models of phenotypes using noisy labeled training data
    Agarwal, Vibhu
    Podchiyska, Tanya
    Banda, Juan M.
    Goel, Veena
    Leung, Tiffany I.
    Minty, Evan P.
    Sweeney, Timothy E.
    Gyang, Elsie
    Shah, Nigam H.
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2016, 23 (06) : 1166 - 1173
  • [8] Analyzing Longitudinal Data Using Machine Learning with Mixed-Effects Models
    Yigit, Pakize
    Ahmed, Syed Ejaz
    EIGHTEENTH INTERNATIONAL CONFERENCE ON MANAGEMENT SCIENCE AND ENGINEERING MANAGEMENT, ICMSEM 2024, 2024, 215 : 633 - 646
  • [9] Analyzing Data Modalities for Cattle Weight Estimation Using Deep Learning Models
    Afridi, Hina
    Ullah, Mohib
    Nordbo, Oyvind
    Hoff, Solvei Cottis
    Furre, Siri
    Larsgard, Anne Guro
    Cheikh, Faouzi Alaya
    JOURNAL OF IMAGING, 2024, 10 (03)
  • [10] Statistical models for e-learning data
    Figini, Silvia
    Giudici, Paolo
    STATISTICAL METHODS AND APPLICATIONS, 2009, 18 (02): : 293 - 304