Analyzing Medical Data by Using Statistical Learning Models

被引:0
|
作者
Mariani, Maria C. [1 ,2 ]
Biney, Francis [2 ]
Tweneboah, Osei K. [3 ]
机构
[1] Univ Texas El Paso, Dept Math Sci, El Paso, TX 79968 USA
[2] Univ Texas El Paso, Computat Sci Program, El Paso, TX 79968 USA
[3] Ramapo Coll, Dept Data Sci, Mahwah, NJ 07430 USA
关键词
statistical learning; deep-feedforward neural network; heart disease; prostate cancer; breast cancer;
D O I
10.3390/math9090968
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
In this work, we investigated the prognosis of three medical data specifically, breast cancer, heart disease, and prostate cancer by using 10 machine learning models. We applied all 10 models to each dataset to identify patterns in them. Furthermore, we use the models to diagnose risk factors that increases the chance of these diseases. All the statistical learning techniques discussed were grouped into linear and nonlinear models based on their similarities and learning styles. The models performances were significantly improved by selecting models while taking into account the bias-variance tradeoffs and using cross-validation for selecting the tuning parameter. Our results suggests that no particular class of models or learning style dominated the prognosis and diagnosis for all three medical datasets. However nonlinear models gave the best predictive performance for breast cancer data. Linear models on the other hand gave the best predictive performance for heart disease data and a combination of linear and nonlinear models for the prostate cancer dataset.
引用
收藏
页数:30
相关论文
共 50 条
  • [21] Analysis of self-regulated learning processing using statistical models for count data
    Greene, Jeffrey Alan
    Costa, Lara-Jeane
    Dellinger, Kristin
    METACOGNITION AND LEARNING, 2011, 6 (03) : 275 - 301
  • [22] Analysis of self-regulated learning processing using statistical models for count data
    Jeffrey Alan Greene
    Lara-Jeane Costa
    Kristin Dellinger
    Metacognition and Learning, 2011, 6 : 275 - 301
  • [23] Comparison of statistical models for analyzing genotype, inferred haplotype, and molecular haplotype data
    Wallenstein, Sylvan
    Chen, Jia
    Wetmur, James G.
    MOLECULAR GENETICS AND METABOLISM, 2006, 89 (03) : 270 - 273
  • [24] Analyzing Dyadic Sequence Data-Research Questions and Implied Statistical Models
    Fuchs, Peter
    Nussbeck, Fridtjof W.
    Meuwly, Nathalie
    Bodenmann, Guy
    FRONTIERS IN PSYCHOLOGY, 2017, 8
  • [25] Analyzing entities and topics in news articles using statistical topic models
    Newman, David
    Chemudugunta, Chaitanya
    Smyth, Padhraic
    Steyvers, Mark
    INTELLIGENCE AND SECURITY INFORMATICS, PROCEEDINGS, 2006, 3975 : 93 - 104
  • [26] An Analyzing Algorithm Based On Learning And Searching In Chinese Medical Big Data
    Luo Jie
    Li Zhimin
    Zhu Binger
    Wang Wei
    Xie Shuming
    2020 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2020, : 1573 - 1579
  • [27] Medical Diagnosis Using Machine Learning: A Statistical Review
    Bhavsar, Kaustubh Arun
    Singla, Jimmy
    Al-Otaibi, Yasser D.
    Song, Oh-Young
    Bin Zikriya, Yousaf
    Bashir, Ali Kashif
    CMC-COMPUTERS MATERIALS & CONTINUA, 2021, 67 (01): : 107 - 125
  • [28] ANALYZING PROTEIN DATA USING UNSUPERVISED LEARNING TECHNIQUES
    Albert, Silvana
    Teletin, Mihai
    Czibula, Gabriela
    INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2018, 14 (03): : 861 - 880
  • [29] Analyzing Meteorological Data Using Unsupervised Learning Techniques
    Mihai, Andrei
    Czibula, Gabriela
    Mihulet, Eugen
    2019 IEEE 15TH INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTER COMMUNICATION AND PROCESSING (ICCP 2019), 2019, : 529 - 536
  • [30] Statistical models for analyzing count data: predictors of length of stay among HIV patients in Portugal using a multilevel model
    Shaaban, Ahmed Nabil
    Peleteiro, Barbara
    Martins, Maria Rosario O.
    BMC HEALTH SERVICES RESEARCH, 2021, 21 (01)