Novel Considerations in the ML/AI Modeling of Large-Scale Learning Loss

被引:0
作者
Elizondo, Mirna [1 ]
Yu, June [2 ]
Payan, Daniel [3 ]
Feng, Li [4 ]
Tesic, Jelena [1 ]
机构
[1] Texas State Univ, Dept Comp Sci, San Marcos, TX 78666 USA
[2] State Texas Legislat Budget Board, Austin, TX 78701 USA
[3] Loves Travel Stops & Country Stores, Yukon, OK 73099 USA
[4] Texas State Univ, Dept Finance & Econ, San Marcos, TX 78666 USA
来源
IEEE ACCESS | 2025年 / 13卷
关键词
Data models; Deep learning; Random forests; Noise measurement; Nearest neighbor methods; Logistic regression; Biological system modeling; Artificial neural networks; Support vector machines; Radio frequency; Noisy tabular data; data in the wild; gradient boosting; feature selection; dimensionality reduction;
D O I
10.1109/ACCESS.2025.3526412
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This study is a path forward for the large-scale, data-driven quantitative analysis of noisy open-source data resources. The goal is to support qualitative findings of smaller studies with extensive open-source data-driven analytics in a new way. The study presented in this research focuses on learning interventions. It uses nine publicly accessible datasets to understand and mitigate factors contributing to learning loss and the practical learning recovery measures in Texas public school districts after the recent school closures. The data came from the Census Bureau 2010, USAFACTS, Texas Department of State Health Services (DSHS), the National Center for Education Statistics (CCD), the US Bureau of Labor Statistics (LAUS), and three sources from the Texas Education Agency (STAAR, TEA, ADA, ESSER). We demonstrate a novel data-driven approach to discover insights from an extensive collection of heterogeneous public data sources. For the pandemic school closure period, the mode of instruction and prior score emerged as the primary resilience factors in the learning recovery intervention method. Grade level and census community income level are the most influential factors in predicting learning loss for both Math and Reading. We demonstrate that data-driven unbiased data analysis at a larger scale can offer policymakers an actionable understanding of how to identify learning-loss tendencies and prevent them in public schools.
引用
收藏
页码:7780 / 7792
页数:13
相关论文
共 38 条
[1]  
Abe S., 2005, Proc. ESANN, P163
[2]  
[Anonymous], 2022, Texas Public Schools COVID-19 Data
[3]  
[Anonymous], 2022, State of Texas Assessments of Academic Readiness (staar) for 2018-2019 and 2020-2021
[4]  
[Anonymous], 2022, Impacts of COVID-19 and Accountability Updates for 2022 and Beyond
[5]  
[Anonymous], 2022, Texas Coronavirus Cases and Deaths
[6]  
[Anonymous], 2022, Common Core of Data
[7]  
[Anonymous], 2022, N. C. D. Of Public Instruction
[8]  
[Anonymous], 2021, Educ. At a Glance 2021
[9]  
[Anonymous], 2021, Impacts of COVID-19 on Preschool Enrollment and Spending
[10]  
[Anonymous], 2022, Local Area Unemployment Statistics (laus)