Victimization (V) of Big Data: A Solution Using Federated Learning

被引:0
作者
Shivkumar, S. [1 ,2 ]
Supriya, M. [1 ,2 ]
机构
[1] Amrita Vishwa Vidyapeetham, Dept Comp Sci Engn, Bengaluru 560036, Karnataka, India
[2] Amrita Vishwa Vidyapeetham, Amrita Sch Comp, Bengaluru 560036, Karnataka, India
来源
SMART TRENDS IN COMPUTING AND COMMUNICATIONS, VOL 1, SMARTCOM 2024 | 2024年 / 945卷
关键词
Victimization; Big data; Federated learning; Apache Spark; HEALTH-CARE; DATA ANALYTICS; DATA-SECURITY; PRIVACY;
D O I
10.1007/978-981-97-1320-2_15
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The rapid emergence of big data has revolutionized the way organizations perceive and utilize information. With its unparalleled ability to process vast volumes of data at high speeds and handle diverse data types, big data is reshaping industries and enabling evidence-based decision-making. However, the proliferation of big data presents significant privacy challenges. The extensive collection, aggregation, and analysis of diverse datasets can inadvertently expose sensitive personal information, leading to potential breaches of individual privacy. In this work, a new "V" called victimization is introduced as a characteristic of big data. This issue can lead to hazardous consequences. To address the vulnerabilities due to this characteristic, a federated learning approach is proposed as a solution. The proposed approach was tested on two datasets in the domain of health care. The model was also trained using the conventional deep learning approach and Pyspark. The findings in our research suggest that the federated learning approach helps in overcoming those issues leading to victimization without compromising the performance of the model.
引用
收藏
页码:171 / 182
页数:12
相关论文
共 39 条
  • [21] kaggle, Pima Indians Diabetes Database
  • [22] Big Data for Healthcare Industry 4.0: Applications, challenges and future perspectives
    Karatas, Mumtaz
    Eriskin, Levent
    Deveci, Muhammet
    Pamucar, Dragan
    Garg, Harish
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2022, 200
  • [23] Keikhosrokiani P, 2022, Big data analytics for healthcare, P201, DOI [10.1016/B978-0-323-91907-4.00011-X, DOI 10.1016/B978-0-323-91907-4.00011-X]
  • [24] Big data analytics in healthcare: a systematic literature review
    Khanra, Sayantan
    Dhir, Amandeep
    Islam, Najmul
    Mantymaki, Matti
    [J]. ENTERPRISE INFORMATION SYSTEMS, 2020, 14 (07) : 878 - 912
  • [25] Diabetic Retinopathy Binary Image Classification Using Pyspark
    Kotiyal, Bina
    Pathak, Heman
    [J]. INTERNATIONAL JOURNAL OF MATHEMATICAL ENGINEERING AND MANAGEMENT SCIENCES, 2022, 7 (05) : 624 - 642
  • [26] A Comprehensive Survey on Machine Learning-Based Big Data Analytics for IoT-Enabled Smart Healthcare System
    Li, Wei
    Chai, Yuanbo
    Khan, Fazlullah
    Jan, Syed Rooh Ullah
    Verma, Sahil
    Menon, Varun G.
    Kavita
    Li, Xingwang
    [J]. MOBILE NETWORKS & APPLICATIONS, 2021, 26 (01) : 234 - 252
  • [27] The state of the art and taxonomy of big data analytics: view from new big data framework
    Mohamed, Azlinah
    Najafabadi, Maryam Khanian
    Wah, Yap Bee
    Zaman, Ezzatul Akmal Kamaru
    Maskat, Ruhaila
    [J]. ARTIFICIAL INTELLIGENCE REVIEW, 2020, 53 (02) : 989 - 1037
  • [28] Nambiar Seema, 2020, Procedia Computer Science, V171, P2734, DOI 10.1016/j.procs.2020.04.297
  • [29] Nazari E, 2023, Front Health Inf, V8
  • [30] Raviteja Kantamneni, 2020, International Conference on Communication, Computing and Electronics Systems. Proceedings of ICCCES 2019. Lecture Notes in Electrical Engineering (LNEE 637), P581, DOI 10.1007/978-981-15-2612-1_56