A feed-forward network for input that is both categorical and quantitative

被引:26
作者
Brouwer, RK [1 ]
机构
[1] Univ Coll Cariboo, Dept Comp Sci, Kamloops, BC V2C 5N3, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
feed forward neural networks; multi-layer perceptron; nominal input; indicator variables; categorical variable regression; anova;
D O I
10.1016/S0893-6080(02)00090-4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The data on which a multi-layer perceptron (MLP) is to be trained to approximate a continuous function may have inputs that are categorical rather than numeric or quantitative such as color, gender, race, etc. A categorical variable causes a discontinuous relationship between an input variable and the output. A MLP, with connection matrices that multiply input values and sigmoid functions that further transform values, represents a continuous mapping in all input variables. A MLP therefore requires that all inputs correspond to numeric, continuously valued variables and represents a continuous function in all input variables. The way that this problem is usually dealt with is to replace the categorical values by numeric ones and treat them as if they were continuously valued. However, there is no meaningful correspondence between the continuous quantities generated this way and the original categorical values. Another approach is to encode the categorical portion of the input using 1-out-of-n encoding and include this code as input to the MLP. The approach in this paper is to segregate categorical variables from the continuous independent variables completely. The MLP is trained with multiple outputs; a separate output unit for each of the allowed combination of values of the categorical independent variables. During training the categorical value or combination of categorical values determines which of the output units should have the target value on it, with the remaining outputs being 'do not care'. Three data sets were used for comparison of methods. Results show that this approach is much more effective than the conventional approach of assigning continuous variables to the categorical features. In case of the data set where there were several categorical variables the method proposed here is also more effective than the 1-out-of-n input method. (C) 2002 Elsevier Science Ltd. All rights reserved.
引用
收藏
页码:881 / 890
页数:10
相关论文
共 10 条
[1]  
Bishop C. M., 1994, NCRG94004
[2]  
Bishop C. M., 1995, NEURAL NETWORKS PATT
[3]   Training a feed-forward network by feeding gradients forward rather than by back-propagation of errors [J].
Brouwer, RK .
NEUROCOMPUTING, 1997, 16 (02) :117-126
[4]   HEDONIC HOUSING PRICES AND DEMAND FOR CLEAN-AIR [J].
HARRISON, D ;
RUBINFELD, DL .
JOURNAL OF ENVIRONMENTAL ECONOMICS AND MANAGEMENT, 1978, 5 (01) :81-102
[5]  
JACOBS RA, 1992, NEURAL COMPUT, V31, P79
[6]  
Lee KW, 2001, IEEE IJCNN, P93, DOI 10.1109/IJCNN.2001.938998
[7]  
NETER J, 1990, APPL LINEAR STAT MOD, P349
[8]  
Rumelhart D.E., 1987, Parallel Distributed Processing: Explorations in the Microstructure of Cognition, P318
[9]  
Wasserman PD., 1993, ADV METHODS NEURAL C
[10]  
Werbos P. J, 1974, THESIS HARVARD U MA