Optimal Ratio of Continuous to Categorical Variables for the Two Group Location Model

被引:0
|
作者
Baah, Philemon [1 ]
Adebanji, Atinuke [1 ]
Kakaie, Romain Glele [2 ]
机构
[1] Kwame Nkrumah Univ Sci & Technol, Dept Math, Kumasi, Ghana
[2] Univ Abomey Calavi, Fac Agron Sci, Cotonou, Benin
关键词
Location model; classification; categorical to continuous variables; contingency table; leave-one-out method;
D O I
暂无
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
We investigated the effect of different combinations of (p) continuous to (q) categorical variables and increasing group centroid separation function (delta = 1, 2, 3) on the performance of the Location model for two groups (Pi(i), i = 1, 2). The number of predictor variables were 4 and 8 with 1:3, 1:1 and 3:1 being the predetermined ratios for p : q. We generated N(mu(1), I) of sizes 40, 80 and 120 with MatLab R2007b for p variables within 2(q) binary cells in Pi(1). The size of Pi(2) was determined using sample ratios 1:1, 1:2, 1:3 and 1:4 for n(1) : n(2) within 2(q) cells. Group1 has mean mu((1))(1) = 0 in the first cell (for p continuous variables) and mu((1))(2) 2 = delta, subsequent cells, mu((m+1))(i) = mu((m))(i) + 1. Error rates reduced more rapidly for increase in d than asymptotically. The optimal p : q was 3: 1 and the model deteriorated at 1: 3 with larger variability. The 8 variable model performed better than the 4 variable model for large sample sizes of p : q = 1 : 1 and outperformed it for all sample sizes of p : q = 3 : 1. Results showed that to use the Location model for classification problems with equal (or more) categorical to continuous variables, it should be compensated with increased distance function and sample sizes.
引用
收藏
页码:18 / 26
页数:9
相关论文
共 50 条
  • [1] THE LOCATION MODEL FOR MIXTURES OF CATEGORICAL AND CONTINUOUS-VARIABLES
    KRZANOWSKI, WJ
    JOURNAL OF CLASSIFICATION, 1993, 10 (01) : 25 - 49
  • [2] The modified location model for classifying genetic resources: I. Association between categorical and continuous variables
    Franco, J
    Crossa, J
    CROP SCIENCE, 2002, 42 (05) : 1719 - 1726
  • [3] New Discrimination Procedure of Location Model for Handling Large Categorical Variables
    Hamid, Hashibah
    Mei, Long Mei
    Yahaya, Sharipah Soaad Syed
    SAINS MALAYSIANA, 2017, 46 (06): : 1001 - 1010
  • [4] Regularized classification for mixed continuous and categorical variables under across-location heteroscedasticity
    Leung, CY
    JOURNAL OF MULTIVARIATE ANALYSIS, 2005, 93 (02) : 358 - 374
  • [5] Performance Analysis and Discrimination Procedure of Two-Group Location Model with Some Continuous and High-Dimensional of Binary Variables
    Hamid, Hashibah
    Okwonu, Friday Zinzendoff
    Ahad, Nor Aishah
    Rahim, Hasliza Abdul
    SAINS MALAYSIANA, 2022, 51 (12): : 4153 - 4160
  • [6] THE GROUPED CONTINUOUS MODEL FOR MULTIVARIATE ORDERED CATEGORICAL VARIABLES AND COVARIATE ADJUSTMENT
    ANDERSON, JA
    PEMBERTON, JD
    BIOMETRICS, 1985, 41 (04) : 875 - 885
  • [7] Optimal Designs for Mixture-Process Experiments Involving Continuous and Categorical Noise Variables
    Chung, Peter J.
    Goldfarb, Heidi B.
    Montgomery, Douglas C.
    Borror, Connie M.
    QUALITY TECHNOLOGY AND QUANTITATIVE MANAGEMENT, 2009, 6 (04): : 451 - 470
  • [8] Two practical issues in using LISCOMP for analysing continuous and ordered categorical variables
    Poon, WY
    Lee, SY
    BRITISH JOURNAL OF MATHEMATICAL & STATISTICAL PSYCHOLOGY, 1999, 52 : 195 - 211
  • [9] Multivariate time-series analysis with categorical and continuous variables in an lstr model
    Davis, Ginger M.
    Ensor, Katherine B.
    JOURNAL OF TIME SERIES ANALYSIS, 2007, 28 (06) : 867 - 885
  • [10] A Hybrid Model for Joint Simulation of High-Dimensional Continuous and Categorical Variables
    Talebi, Hassan
    Lo, Johnny
    Mueller, Ute
    GEOSTATISTICS VALENCIA 2016, 2017, 19 : 415 - 430