Integrated framework to integrate Spark-based big data analytics and for health monitoring and recommendation in sports using XGBoost algorithm

被引:1
|
作者
Zhao, Yin [1 ]
Ramos, Ma. Finipina [2 ]
Li, Bin [3 ]
机构
[1] Southwest Med Univ, Sch Phys Educ, Studies Sect, Luzhou 646000, Sichuan, Peoples R China
[2] Jose Rizal Univ Jose Rizal Univ, Grad Sch, Mandaluyong 1552, Philippines
[3] Southwest Med Univ, Sch Phys Educ, Luzhou 646000, Sichuan, Peoples R China
关键词
Big data; Spark; Data mining; XGBoost algorithm; Sports medical integration; Service system construction;
D O I
10.1007/s00500-023-09450-9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, technological advancements have been replicated in various industries, including sports medicine. Recent developments, such as big data analytics and data mining, which have revolutionized medical services in sports, are apparent in this transformation. This technological shift is motivated by the need to enhance athletic performance, prevent injuries, and offer individualized health advice. Modern lifestyles have simultaneously increased people's attention to their health, creating a demand for better medical services. However, China's ability to provide superior medical care needs to be improved due to a lack of medical resources and an ever-increasing patient population. To address these challenges, this research paper presents an integrated framework that leverages Spark-based big data analytics and the XGBoost algorithm. The framework aims to provide a robust sports medical service encompassing real-time health monitoring and data-driven insights. Powered by the formidable distributed computing platform Spark, it adeptly manages extensive sports data generated during training and events, facilitating instant health evaluations. Incorporating the XGBoost algorithm for data mining amplifies health prediction and recommendation capabilities. Renowned for its predictive prowess, XGBoost excels in discerning intricate sports data patterns and trends. Its proficiency in tackling intricates feature selection and modeling tasks ensures precision and actionable insights. Empirical findings underscore substantial enhancements in sports medical services. When applied to chronic disease datasets, the XGBoost algorithm garnered an impressive 93% trust rate. In contrast to conventional methods like K-Nearest Neighbors (KNN), Random Forest (RF), Decision Trees (DT), Support Vector Machines (SVM), Naive Bayes (NB), and Logistic Regression (LR), the proposed framework consistently outperforms these established techniques. This remarkable performance underscores the transformative potential of the integrated framework in revolutionizing sports medical services.
引用
收藏
页码:1585 / 1608
页数:24
相关论文
共 43 条
  • [1] Integrated framework to integrate Spark-based big data analytics and for health monitoring and recommendation in sports using XGBoost algorithm
    Yin Zhao
    Ma. Finipina Ramos
    Bin Li
    Soft Computing, 2024, 28 : 1585 - 1608
  • [2] A Dynamic Spark-based Classification Framework for Imbalanced Big Data
    Abdel-Hamid, Nahla B.
    ElGhamrawy, Sally
    El Desouky, Ali
    Arafat, Hesham
    JOURNAL OF GRID COMPUTING, 2018, 16 (04) : 607 - 626
  • [3] A Dynamic Spark-based Classification Framework for Imbalanced Big Data
    Nahla B. Abdel-Hamid
    Sally ElGhamrawy
    Ali El Desouky
    Hesham Arafat
    Journal of Grid Computing, 2018, 16 : 607 - 626
  • [4] Efficient Spark-Based Framework for Big Geospatial Data Query Processing and Analysis
    Aljawarneh, Isam Mashhour
    Bellavista, Paolo
    Corradi, Antonio
    Montanari, Rebecca
    Foschini, Luca
    Zanotti, Andrea
    2017 IEEE SYMPOSIUM ON COMPUTERS AND COMMUNICATIONS (ISCC), 2017, : 851 - 856
  • [5] An Efficient Spark-Based Hybrid Frequent Itemset Mining Algorithm for Big Data
    Al-Bana, Mohamed Reda
    Farhan, Marwa Salah
    Othman, Nermin Abdelhakim
    DATA, 2022, 7 (01)
  • [6] A distributed frequent itemset mining algorithm using Spark for Big Data analytics
    Feng Zhang
    Min Liu
    Feng Gui
    Weiming Shen
    Abdallah Shami
    Yunlong Ma
    Cluster Computing, 2015, 18 : 1493 - 1501
  • [7] A distributed frequent itemset mining algorithm using Spark for Big Data analytics
    Zhang, Feng
    Liu, Min
    Gui, Feng
    Shen, Weiming
    Shami, Abdallah
    Ma, Yunlong
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2015, 18 (04): : 1493 - 1501
  • [8] A new Apache Spark-based framework for big data streaming forecasting in IoT networks
    Fernandez-Gomez, Antonio M.
    Gutierrez-Aviles, David
    Troncoso, Alicia
    Martinez-Alvarez, Francisco
    JOURNAL OF SUPERCOMPUTING, 2023, 79 (10) : 11078 - 11100
  • [9] A new Apache Spark-based framework for big data streaming forecasting in IoT networks
    Antonio M. Fernández-Gómez
    David Gutiérrez-Avilés
    Alicia Troncoso
    Francisco Martínez-Álvarez
    The Journal of Supercomputing, 2023, 79 : 11078 - 11100
  • [10] An Efficient Parallel Algorithm for Clustering Big Data based on the Spark Framework
    Dafir, Zineb
    Slaoui, Said
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (07) : 890 - 896