Federated Learning for Data Analytics in Education

被引:7
作者
Fachola, Christian [1 ]
Tornaria, Agustin [2 ]
Bermolen, Paola [2 ]
Capdehourat, German [3 ,4 ]
Etcheverry, Lorena [1 ]
Fariello, Maria Ines [2 ]
机构
[1] Univ Republica, Fac Ingn, Inst Comp, Montevideo 11300, Uruguay
[2] Univ Republica, Fac Ingn, Inst Matemat, Montevideo 11300, Uruguay
[3] Univ Republica, Fac Ingn, Inst Ingn Electr, Montevideo 11300, Uruguay
[4] Ceibal, Montevideo 11500, Uruguay
关键词
federated learning; learning analytics; PRIVACY; CHALLENGES;
D O I
10.3390/data8020043
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Federated learning techniques aim to train and build machine learning models based on distributed datasets across multiple devices while avoiding data leakage. The main idea is to perform training on remote devices or isolated data centers without transferring data to centralized repositories, thus mitigating privacy risks. Data analytics in education, in particular learning analytics, is a promising scenario to apply this approach to address the legal and ethical issues related to processing sensitive data. Indeed, given the nature of the data to be studied (personal data, educational outcomes, and data concerning minors), it is essential to ensure that the conduct of these studies and the publication of the results provide the necessary guarantees to protect the privacy of the individuals involved and the protection of their data. In addition, the application of quantitative techniques based on the exploitation of data on the use of educational platforms, student performance, use of devices, etc., can account for educational problems such as the determination of user profiles, personalized learning trajectories, or early dropout indicators and alerts, among others. This paper presents the application of federated learning techniques to a well-known learning analytics problem: student dropout prediction. The experiments allow us to conclude that the proposed solutions achieve comparable results from the performance point of view with the centralized versions, avoiding the concentration of all the data in a single place for training the models.
引用
收藏
页数:16
相关论文
共 25 条
[1]  
Banihashem S. K., 2018, INTERDISCIPLINARY J, V9, DOI [10.5812/ijvlms.63024, DOI 10.5812/IJVLMS.63024]
[2]   Federated Learning for Smart Healthcare: A Survey [J].
Dinh C Nguyen ;
Quoc-Viet Pham ;
Pathirana, Pubudu N. ;
Ding, Ming ;
Seneviratne, Aruna ;
Lin, Zihuai ;
Dobre, Octavia ;
Hwang, Won-Joo .
ACM COMPUTING SURVEYS, 2023, 55 (03)
[3]  
Divi S., 2021, arXiv
[4]   Ethical and Privacy Issues in the Design of Learning Analytics Applications [J].
Drachsler, Hendrik ;
Hoel, Tore ;
Cooper, Adam ;
Kismihok, Gabor ;
Berg, Alan ;
Scheffel, Maren ;
Chen, Weiqin ;
Ferguson, Rebecca .
LAK '16 CONFERENCE PROCEEDINGS: THE SIXTH INTERNATIONAL LEARNING ANALYTICS & KNOWLEDGE CONFERENCE,, 2016, :492-493
[5]   Differential privacy: A survey of results [J].
Dwork, Cynthia .
THEORY AND APPLICATIONS OF MODELS OF COMPUTATION, PROCEEDINGS, 2008, 4978 :1-19
[6]  
Feng WZ, 2019, AAAI CONF ARTIF INTE, P517
[7]  
FLEA, 2022, FLEA PROJ PUBL REP
[8]   Privacy-Preserving Learning Analytics: Challenges and Techniques [J].
Gursoy, Mehmet Emre ;
Inan, Ali ;
Nergiz, Mehmet Ercan ;
Saygin, Yucel .
IEEE TRANSACTIONS ON LEARNING TECHNOLOGIES, 2017, 10 (01) :68-81
[9]   A Framework for Edge-Assisted Healthcare Data Analytics using Federated Learning [J].
Hakak, Saqib ;
Ray, Suprio ;
Khan, Wazir Zada ;
Scheme, Erik .
2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, :3423-3427
[10]  
Kairouz P, 2021, Arxiv, DOI arXiv:2103.00039