A hybrid neural network for input that is both categorical and quantitative

被引:15
作者
Brouwer, RK [1 ]
机构
[1] Univ Coll Cariboo, Dept Comp Sci, Kamloops, BC V2C 5N3, Canada
关键词
D O I
10.1002/int.20032
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The data on which a MLP (multilayer perceptron) is normally trained to approximate a continuous function may include inputs that are categorical in addition to the numeric or quantitative inputs. Examples of categorical variables are gender, race, and so on. An approach examined in this article is to train a hybrid network consisting of a MLP and an encoder with multiple output units; that is, a separate output unit for each of the various combinations of values of the categorical variables. Input to the feed forward subnetwork of the hybrid network is then restricted to truly numerical quantities. A MLP with connection matrices that multiply input values and sigmoid functions that further transform values represents a continuous mapping in all input variables. A MLP therefore requires that all inputs correspond to numeric, continuously valued variables and represents a continuous function in all input variables. A categorical variable, on the other hand, produces a discontinuous relationship between an input variable and the output. The way that this problem is often dealt with is to replace the categorical values by numeric ones and treat them as if they were continuously valued. However there is no meaningful correspondence between the continuous quantities generated this way and the original categorical values. The basic difficulty with using these variables is that they define a metric for the categories that may not be reasonable. This suggests that the categorical inputs should be segregated from the continuous inputs as explained above. Results show that the method utilizing a hybrid network and separating numerical from quantitative input, as discussed here, is quite effective. (C) 2004 Wiley Periodicals, Inc.
引用
收藏
页码:979 / 1001
页数:23
相关论文
共 11 条
  • [1] Comparison of shoe insole materials by neural network analysis
    Barton, JG
    Lees, A
    [J]. MEDICAL & BIOLOGICAL ENGINEERING & COMPUTING, 1996, 34 (06) : 453 - 459
  • [2] Bishop C. M., 1996, Neural networks for pattern recognition
  • [3] BISHOP CM, NCRG94004
  • [4] BURGESS AN, 1995, 4 INT C ART NEUR NET
  • [5] HEDONIC HOUSING PRICES AND DEMAND FOR CLEAN-AIR
    HARRISON, D
    RUBINFELD, DL
    [J]. JOURNAL OF ENVIRONMENTAL ECONOMICS AND MANAGEMENT, 1978, 5 (01) : 81 - 102
  • [6] IVERSON KE, 1966, ELEMENTARY FUNCTIONS
  • [7] IVERSON KE, 1996, INTRO DICT
  • [8] Adaptive Mixtures of Local Experts
    Jacobs, Robert A.
    Jordan, Michael I.
    Nowlan, Steven J.
    Hinton, Geoffrey E.
    [J]. NEURAL COMPUTATION, 1991, 3 (01) : 79 - 87
  • [9] Lee KW, 2001, IEEE IJCNN, P93, DOI 10.1109/IJCNN.2001.938998
  • [10] Rumelhart D. E., 1986, PARALLEL DISTRIBUTED, V1, P45, DOI [10.1016/B978-1-4832-1446-7.50010-8, DOI 10.1016/B978-1-4832-1446-7.50010-8]