Unsupervised machine learning to classify language dimensions to constitute the linguistic complexity of mathematical word problems

被引：1

作者：

Bednorz, David ^{[1
]}

Kleine, Michael ^{[2
]}

机构：

[1] IPN Leibniz Inst Sciene & Math Educ, Dept Math Educ, Kiel, Germany

[2] Bielefeld Univ, Dept Math Educ, Bielefeld, Germany

来源：

INTERNATIONAL ELECTRONIC JOURNAL OF MATHEMATICS EDUCATION | 2023年 / 18卷 / 01期

关键词：

language dimensions; mathematical word problems; linguistic complexity; machine learning; unsupervised machine learning; ACADEMIC-LANGUAGE; MINORITY-STUDENTS; TEXT; LEARNERS; COMPREHENSION; PERFORMANCE; KNOWLEDGE;

D O I：

10.29333/iejme/12588

中图分类号：

G40 [教育学];

学科分类号：

040101 ; 120403 ;

摘要：

The study examines language dimensions of mathematical word problems and the classification of mathematical word problems according to these dimensions with unsupervised machine learning (ML) techniques. Previous research suggests that the language dimensions are important for mathematical word problems because it has an influence on the linguistic complexity of word problems. Depending on the linguistic complexity students can have language obstacles to solve mathematical word problems. A lot of research in mathematics education research focus on the analysis on the linguistic complexity based on theoretical build language dimensions. To date, however it has been unclear what empirical relationship between the linguistic features exist for mathematical word problems. To address this issue, we used unsupervised ML techniques to reveal latent linguistic structures of 17 linguistic features for 342 mathematical word problems and classify them. The models showed that three -and five-dimensional linguistic structures have the highest explanatory power. Additionally, the authors consider a four-dimensional solution. Mathematical word problem from the three-dimensional solution can be classify in two groups, three-and five-dimensional solutions in three groups. The findings revealed latent linguistic structures and groups that could have an implication of the linguistic complexity of mathematical word problems and differ from language dimensions, which are considered theoretically. Therefore, the results indicate for new design principles for interventions and materials for language education in mathematics learning and teaching.

引用

页数：16

共 37 条

[21] Flexibility when Dealing with Situational Structures in Mathematical Contexts-A Preliminary Study Investigating a Learning Framework on Solving Additive Word Problems
Gabler, Laura
Ufer, Stefan
JOURNAL FUR MATHEMATIK-DIDAKTIK, 2021, 42 (01): : 61 - 96
[22] Text Complexity of Chinese Elementary School Textbooks: Analysis of Text Linguistic Features Using Machine Learning Algorithms
Liu, Miaomiao
Li, Yixun
Su, Yongqiang
Li, Hong
SCIENTIFIC STUDIES OF READING, 2024, 28 (03) : 235 - 255
[23] Unsupervised machine learning to classify crystal structures according to their structural distortion: A case study on Li-argyrodite solid-state electrolytes
Gallo-Bueno, A.
Reynaud, M.
Casas-Cabanas, M.
Carrasco, J.
ENERGY AND AI, 2022, 9
[24] Evaluating Coding Proficiency of Large Language Models: An Investigation Through Machine Learning Problems
Ko, Eunbi
Kang, Pilsung
IEEE ACCESS, 2025, 13 : 52925 - 52938
[25] Evaluating an Automated Analysis Using Machine Learning and Natural Language Processing Approaches to Classify Computer Science Students' Reflective Writing
Alrashidi, Huda
Almujally, Nouf
Kadhum, Methaq
Ullmann, Thomas Daniel
Joy, Mike
PERVASIVE COMPUTING AND SOCIAL NETWORKING, ICPCSN 2022, 2023, 475 : 463 - 477
[26] Leveraging ensemble machine learning and multimodal video complexity for better prediction of video difficulty in second language
Alghamdi, Emad A.
INTERACTIVE LEARNING ENVIRONMENTS, 2024,
[27] Using a computer-based learning task to promote work on mathematical relationships in the context of word problems in early grades
Freiman, Viktor
Polotskaia, Elena
Savard, Annie
ZDM-MATHEMATICS EDUCATION, 2017, 49 (06): : 835 - 849
[28] FindICI: Using machine learning to detect linguistic inconsistencies between code and natural language descriptions in infrastructure-as-code
Nemania Borovits
Indika Kumara
Dario Di Nucci
Parvathy Krishnan
Stefano Dalla Palma
Fabio Palomba
Damian A. Tamburri
Willem-Jan van den Heuvel
Empirical Software Engineering, 2022, 27
[29] FindICI: Using machine learning to detect linguistic inconsistencies between code and natural language descriptions in infrastructure-as-code
Borovits, Nemania
Kumara, Indika
Di Nucci, Dario
Krishnan, Parvathy
Dalla Palma, Stefano
Palomba, Fabio
Tamburri, Damian A.
van den Heuvel, Willem-Jan
EMPIRICAL SOFTWARE ENGINEERING, 2022, 27 (07)
[30] Unsupervised machine learning identified distinct population clusters based on symptoms of oral pain, psychological distress, and sleep problems
Chuinsiri, Nontawat
JOURNAL OF INTERNATIONAL SOCIETY OF PREVENTIVE AND COMMUNITY DENTISTRY, 2021, 11 (05) : 531 - 538

← 1 2 3 4 →