Unsupervised machine learning to classify language dimensions to constitute the linguistic complexity of mathematical word problems

被引:1
|
作者
Bednorz, David [1 ]
Kleine, Michael [2 ]
机构
[1] IPN Leibniz Inst Sciene & Math Educ, Dept Math Educ, Kiel, Germany
[2] Bielefeld Univ, Dept Math Educ, Bielefeld, Germany
关键词
language dimensions; mathematical word problems; linguistic complexity; machine learning; unsupervised machine learning; ACADEMIC-LANGUAGE; MINORITY-STUDENTS; TEXT; LEARNERS; COMPREHENSION; PERFORMANCE; KNOWLEDGE;
D O I
10.29333/iejme/12588
中图分类号
G40 [教育学];
学科分类号
040101 ; 120403 ;
摘要
The study examines language dimensions of mathematical word problems and the classification of mathematical word problems according to these dimensions with unsupervised machine learning (ML) techniques. Previous research suggests that the language dimensions are important for mathematical word problems because it has an influence on the linguistic complexity of word problems. Depending on the linguistic complexity students can have language obstacles to solve mathematical word problems. A lot of research in mathematics education research focus on the analysis on the linguistic complexity based on theoretical build language dimensions. To date, however it has been unclear what empirical relationship between the linguistic features exist for mathematical word problems. To address this issue, we used unsupervised ML techniques to reveal latent linguistic structures of 17 linguistic features for 342 mathematical word problems and classify them. The models showed that three -and five-dimensional linguistic structures have the highest explanatory power. Additionally, the authors consider a four-dimensional solution. Mathematical word problem from the three-dimensional solution can be classify in two groups, three-and five-dimensional solutions in three groups. The findings revealed latent linguistic structures and groups that could have an implication of the linguistic complexity of mathematical word problems and differ from language dimensions, which are considered theoretically. Therefore, the results indicate for new design principles for interventions and materials for language education in mathematics learning and teaching.
引用
收藏
页数:16
相关论文
共 37 条
  • [31] Machine learning based framework for fine-grained word segmentation and enhanced text normalization for low resourced language
    Nazir, Shahzad
    Asif, Muhammad
    Rehman, Mariam
    Ahmad, Shahbaz
    PEERJ COMPUTER SCIENCE, 2024, 10
  • [32] Predicting Emerging Themes in Rapidly Expanding COVID-19 Literature With Unsupervised Word Embeddings and Machine Learning: Evidence-Based Study
    Pal, Ridam
    Chopra, Harshita
    Awasthi, Raghav
    Bandhey, Harsh
    Nagori, Aditya
    Sethi, Tavpritesh
    JOURNAL OF MEDICAL INTERNET RESEARCH, 2022, 24 (11)
  • [33] CoLI-Machine Learning Approaches for Code-mixed Language Identification at the Word Level in Kannada-English Texts
    Lakshmaiah, Shashirekha Hosahalli
    Balouchzahi, Fazlourrahman
    Anusha, Mudoor Devadas
    Sidorov, Grigori
    ACTA POLYTECHNICA HUNGARICA, 2022, 19 (10) : 123 - 141
  • [34] Exploiting linguistic information from Nepali transcripts for early detection of Alzheimer's disease using natural language processing and machine learning techniques
    Adhikari, Surabhi
    Thapa, Surendrabikram
    Naseem, Usman
    Singh, Priyanka
    Huo, Huan
    Bharathy, Gnana
    Prasad, Mukesh
    INTERNATIONAL JOURNAL OF HUMAN-COMPUTER STUDIES, 2022, 160
  • [35] Challenges and solutions to employing natural language processing and machine learning to measure patients' health literacy and physician writing complexity: The ECLIPPSE study
    Brown, I. I. I. William
    Balyan, Renu
    Karter, Andrew J.
    Crossley, Scott
    Semere, Wagahta
    Duran, Nicholas D.
    Lyles, Courtney
    Liu, Jennifer
    Moffet, Howard H.
    Daniels, Ryane
    McNamara, Danielle S.
    Schillinger, Dean
    JOURNAL OF BIOMEDICAL INFORMATICS, 2021, 113
  • [36] Analyzing credit risk model problems through natural language processing-based clustering and machine learning: insights from validation reports
    Lis, Szymon
    Kubkowski, Mariusz
    Borkowska, Olimpia
    Serwa, Dobromil
    Kurpanik, Jaroslaw
    JOURNAL OF RISK MODEL VALIDATION, 2024, 18 (02): : 59 - 86
  • [37] Cautions, Concerns, and Future Directions for Using Machine Learning in Relation to Mental Health Problems and Clinical and Forensic Risks: A Brief Comment on "Model Complexity Improves the Prediction of Nonsuicidal Self-Injury" (Fox et al., 2019)
    Siddaway, Andy P.
    Quinlivan, Leah
    Kapur, Nav
    O'Connor, Rory C.
    de Beurs, Derek
    JOURNAL OF CONSULTING AND CLINICAL PSYCHOLOGY, 2020, 88 (04) : 384 - 387