Analyzing Medical Data by Using Statistical Learning Models

被引:0
|
作者
Mariani, Maria C. [1 ,2 ]
Biney, Francis [2 ]
Tweneboah, Osei K. [3 ]
机构
[1] Univ Texas El Paso, Dept Math Sci, El Paso, TX 79968 USA
[2] Univ Texas El Paso, Computat Sci Program, El Paso, TX 79968 USA
[3] Ramapo Coll, Dept Data Sci, Mahwah, NJ 07430 USA
关键词
statistical learning; deep-feedforward neural network; heart disease; prostate cancer; breast cancer;
D O I
10.3390/math9090968
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
In this work, we investigated the prognosis of three medical data specifically, breast cancer, heart disease, and prostate cancer by using 10 machine learning models. We applied all 10 models to each dataset to identify patterns in them. Furthermore, we use the models to diagnose risk factors that increases the chance of these diseases. All the statistical learning techniques discussed were grouped into linear and nonlinear models based on their similarities and learning styles. The models performances were significantly improved by selecting models while taking into account the bias-variance tradeoffs and using cross-validation for selecting the tuning parameter. Our results suggests that no particular class of models or learning style dominated the prognosis and diagnosis for all three medical datasets. However nonlinear models gave the best predictive performance for breast cancer data. Linear models on the other hand gave the best predictive performance for heart disease data and a combination of linear and nonlinear models for the prostate cancer dataset.
引用
收藏
页数:30
相关论文
共 50 条
  • [31] Statistical models for analyzing count data: predictors of length of stay among HIV patients in Portugal using a multilevel model
    Ahmed Nabil Shaaban
    Bárbara Peleteiro
    Maria Rosario O. Martins
    BMC Health Services Research, 21
  • [32] Comparison of statistical models for analyzing genotype, inferred haplotype and molecular haplotype data.
    Wallenstein, Sylvan
    Chen, Jia
    Wetmur, James
    CANCER RESEARCH, 2006, 66 (08)
  • [33] STATISTICAL-MODELS FOR ANALYZING TIME-TO-OCCURRENCE DATA IN RADIOBIOLOGY AND RADIATION ONCOLOGY
    TAYLOR, JMG
    KIM, DK
    INTERNATIONAL JOURNAL OF RADIATION BIOLOGY, 1993, 64 (05) : 627 - 640
  • [34] A Comparison of different learning models used in Data Mining for Medical Data
    Srimani, P. K.
    Koti, Manjula Sanjay
    2ND INTERNATIONAL CONFERENCE ON METHODS AND MODELS IN SCIENCE AND TECHNOLOGY (ICM2ST-11), 2011, 1414
  • [35] Template Learning using Wavelet Domain Statistical Models
    Ramamurthy, Karthikeyan Natesan
    Thiagarajan, Jayaraman J.
    Spanias, Andreas
    RESEARCH AND DEVELOPMENT IN INTELLIGENT SYSTEMS XXVI: INCORPORATING APPLICATIONS AND INNOVATIONS IN INTELLIGENT SYSTEMS XVII, 2010, : 179 - 192
  • [36] Analyzing Job Analysis Data Using Mixture Rasch Models
    Wyse, Adam E.
    INTERNATIONAL JOURNAL OF TESTING, 2019, 19 (01) : 52 - 73
  • [37] Statistical Approach to Endurance Models - Data Processing Using Regression Models
    Trnka, Pavel
    Soucek, Jakub
    Hornak, Jaroslav
    Svoboda, Michal
    Koltunowicz, Tomasz
    Gutten, Miroslav
    PROCEEDINGS OF THE 2015 16TH INTERNATIONAL SCIENTIFIC CONFERENCE ON ELECTRIC POWER ENGINEERING (EPE), 2015, : 238 - 241
  • [38] Statistical models for jointly analyzing multiple allometries
    Gao, Huijiang
    Liu, Yongxin
    Zhang, Tingting
    Yang, Runqing
    Yang, Huanmin
    JOURNAL OF THEORETICAL BIOLOGY, 2013, 318 : 205 - 209
  • [39] Analyzing linguistic variation: Statistical models and methods
    Hoffmann, T
    JOURNAL OF SOCIOLINGUISTICS, 2005, 9 (02) : 293 - 298
  • [40] Spam filtering using statistical data compression models
    Department of Intelligent Systems, Jožef Stefan Institute, Jamova 39, Ljubljana, SI-1000, Slovenia
    不详
    不详
    J. Mach. Learn. Res., 2006, (2673-2698):