Constructing optimal educational tests using GMDH-based item ranking and selection

被引:17
作者
Abdel-Aal, Radwan E. [1 ]
El-Alfy, Ei-Sayed M. [2 ]
机构
[1] King Fahd Univ Petr & Minerals, Dept Comp Engn, Coll Comp Sci & Engn, Dhahran 31261, Saudi Arabia
[2] King Fahd Univ Petr & Minerals, Dept Informat & Comp Sci, Coll Comp Sci & Engn, Dhahran 31261, Saudi Arabia
关键词
GMDH algorithm; Abductive networks; Neural networks; Machine learning; Optimal test design; Feature selection; Feature ranking; Educational measurements; Item response theory; Mutual information; Filter methods; Wrapper methods; Genetic algorithms; MUTUAL INFORMATION; FEATURES;
D O I
10.1016/j.neucom.2008.02.004
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Item ranking and selection plays a key role in constructing concise and informative educational tests. Traditional techniques based on the item response theory (IRT) have been used to automate this task, but they require model parameters to be determined a priori for each item and their application becomes more tedious with larger item banks. Machine-learning techniques can be used to build data-based models that relate the test result as output to the examinees' responses to various test items as inputs. With this approach, test item selection can benefit from the vast amount of literature on feature selection in many areas of machine learning and artificial intelligence that are characterized by high data dimensionality. This paper describes a novel technique for item ranking and selection using abductive network pass/fail classifiers based on the group method of data handling (GMDH). Experiments were carried out on a dataset consisting of the response of 2000 examinees to 45 test items together with the examinee's true ability level. The approach utilizes the ability of GMDH-based learning algorithms to automatically select optimum input features from a set of available inputs. Rankings obtained by iteratively applying this procedure are similar to those based on the average item information function (IIF) at the pass-fail ability threshold, IIF (theta = 0), and the average information gain (IG). An optimum item subset derived from the GMDH-based ranking contains only one third of the test items and performs pass/fail classification with 91.2% accuracy on a 500-case evaluation subset, compared to 86.8% for a randomly selected item subset of the same size and 92% for a subset of the 15 items having the largest values for IIF (theta = 0). Item rankings obtained with the proposed approach compare favorably with those obtained using neural network modeling and popular filter type feature selection methods, and the proposed approach is much faster than wrapper methods employing genetic search. (C) 2008 Elsevier B.V. All rights reserved.
引用
收藏
页码:1184 / 1197
页数:14
相关论文
共 48 条
[1]   GMDH-based feature ranking and selection for improved classification of medical data [J].
Abdel-Aal, RE .
JOURNAL OF BIOMEDICAL INFORMATICS, 2005, 38 (06) :456-468
[2]   Short-term hourly load forecasting using abductive networks [J].
Abdel-Aal, RE .
IEEE TRANSACTIONS ON POWER SYSTEMS, 2004, 19 (01) :164-173
[3]   Automatic fitting of Gaussian peaks using abductive machine learning [J].
Abdel-Aal, RE .
IEEE TRANSACTIONS ON NUCLEAR SCIENCE, 1998, 45 (01) :1-16
[4]  
ABDELAAL RE, 1995, WEATHER FORECAST, V10, P310, DOI 10.1175/1520-0434(1995)010<0310:MAFTDM>2.0.CO
[5]  
2
[6]  
AbdelAal RE, 1996, METHOD INFORM MED, V35, P265
[7]   Reduced feature-set based parallel CHMM speech recognition systems [J].
Abdulla, WH ;
Kasabov, N .
INFORMATION SCIENCES, 2003, 156 (1-2) :21-38
[8]  
AbTech, 1990, AIM US MAN
[9]  
Agarwal A., 1999, Journal of Applied Business Research (JABR), V15, P1
[10]  
Aha David W., 1996, COMP EVALUATION SEQU