Enhanced Student Admission Procedures at Universities Using Data Mining and Machine Learning Techniques

被引:1
|
作者
Assiri, Basem [1 ]
Bashraheel, Mohammed [2 ]
Alsuri, Ala [2 ]
机构
[1] Jazan Univ, Coll Comp Sci & Informat Technol, Comp Sci Dept, Jazan 82817, Saudi Arabia
[2] Jazan Univ, Coll Comp Sci & Informat Technol, Dept Informat Technol & Secur, Jazan 82817, Saudi Arabia
来源
APPLIED SCIENCES-BASEL | 2024年 / 14卷 / 03期
关键词
students; university admission; major selection; data mining analysis; machine learning models; SIMILARITY INDEXES; PERFORMANCE; JACCARD;
D O I
10.3390/app14031109
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
The progress of technology has played a crucial role in enhancing various fields such as education. Universities in Saudi Arabia offer free education to students and follow specific admission policies. These policies usually focus on features and scores such as the high school grade point average, general aptitude test, and achievement test. The main issue with current admission policies is that they do not fit with all majors, which results in high rates of failure, dropouts, and transfer. Another issue is that all mentioned features and scores are cumulatively calculated, which obscures some details. Therefore, this study aims to explore admission criteria used in Saudi Arabian universities and the factors that influence students' choice of major. First, using data mining techniques, the research analyzes the relationships and similarities between the university's grade point average and the other student admission features. The study proposes a new Jaccard model that includes modified Jaccard and approximated modified Jaccard techniques to match the specifications of students' data records. It also uses data distribution analysis and correlation coefficient analysis to understand the relationships between admission features and student performance. The investigation shows that relationships vary from one major to another. Such variations emphasize the weakness of the generalization of the current procedures since they are not applicable to all majors. Additionally, the analysis highlights the importance of hidden details such as high school course grades. Second, this study employs machine learning models to incorporate additional features, such as high school course grades, to find suitable majors for students. The K-nearest neighbor, decision tree, and support vector machine algorithms were used to classify students into appropriate majors. This process significantly improves the enrolment of students in majors that align with their skills and interests. The results of the experimental simulation indicate that the K-nearest neighbor algorithm achieves the highest accuracy rate of 100%, while the decision tree algorithm's accuracy rate is 81% and the support vector machine algorithm's accuracy rate is 75%. This encourages the idea of using machine learning models to find a suitable major for applicants.
引用
收藏
页数:19
相关论文
共 50 条
  • [41] Mining of soil data for predicting the paddy productivity by machine learning techniques
    Antony, Ajitha
    Karuppasamy, Ramanathan
    PADDY AND WATER ENVIRONMENT, 2023, 21 (02) : 231 - 242
  • [42] A survey on data mining and machine learning techniques for diagnosing hepatitis disease
    Tasneem, Tabeen
    Kabir, Mir Md. Jahangir
    Xu, Shuxiang
    Tasneem, Tazeen
    INTERNATIONAL JOURNAL OF BIOMEDICAL ENGINEERING AND TECHNOLOGY, 2023, 41 (04) : 340 - 375
  • [43] Exploration of Machine Learning and Data Mining techniques on a horse racing dataset
    Kyriacou, E
    Toolan, F
    Dunnion, J
    MLMTA '05: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON MACHINE LEARNING MODELS TECHNOLOGIES AND APPLICATIONS, 2005, : 161 - 166
  • [44] Applying data mining and machine learning techniques for sentiment shifter identification
    Rahimi, Zeinab
    Noferesti, Samira
    Shamsfard, Mehrnoush
    LANGUAGE RESOURCES AND EVALUATION, 2019, 53 (02) : 279 - 302
  • [45] Mining of soil data for predicting the paddy productivity by machine learning techniques
    Ajitha Antony
    Ramanathan Karuppasamy
    Paddy and Water Environment, 2023, 21 : 231 - 242
  • [46] Continuous acoustic data mining using machine learning
    de la SELLE, Théotime
    Deschanel, Stéphanie
    Weiss, Jérôme
    e-Journal of Nondestructive Testing, 2024, 29 (10):
  • [47] Mining Process Control Data Using Machine Learning
    Nasr, Emad S. Abouel
    Al-Mubaid, Hisham
    CIE: 2009 INTERNATIONAL CONFERENCE ON COMPUTERS AND INDUSTRIAL ENGINEERING, VOLS 1-3, 2009, : 1434 - +
  • [48] Analysis of Various Machine Learning Algorithms for Enhanced Opinion Mining using Twitter Data Streams
    Kumar, Praveen
    Choudhury, Tanupriya
    Rawat, Seema
    Jayaraman, Shobhna
    2016 INTERNATIONAL CONFERENCE ON MICRO-ELECTRONICS AND TELECOMMUNICATION ENGINEERING (ICMETE), 2016, : 265 - 270
  • [49] Analysis of Student Performance Applying Data Mining Techniques in a Virtual Learning Environment
    Aguagallo L.
    Salazar-Fierro F.
    García-Santillán J.
    Posso-Yépez M.
    Landeta-López P.
    García-Santillán I.
    International Journal of Emerging Technologies in Learning, 2023, 18 (11) : 175 - 195
  • [50] Data Mining and Machine Learning in Education with Focus in Undergraduate CS Student Success
    Johnson, William Gregory
    ICER'18: PROCEEDINGS OF THE 2018 ACM CONFERENCE ON INTERNATIONAL COMPUTING EDUCATION RESEARCH, 2018, : 270 - 271