Order selection in finite mixtures of linear regressions

被引：18

作者：

Depraetere, Nicolas ^{[1
]}

Vandebroek, Martina ^{[1
,2
]}

机构：

[1] Katholieke Univ Leuven, Fac Econ & Business, B-3000 Louvain, Belgium

[2] Leuven Stat Res Ctr, B-3001 Louvain, Belgium

来源：

STATISTICAL PAPERS | 2014年 / 55卷 / 03期

关键词：

Finite mixture; Regression; Penalized likelihood; Order selection; LATENT CLASS ANALYSIS; AKAIKE INFORMATION CRITERION; LIKELIHOOD RATIO TESTS; MODEL-SELECTION; MAXIMUM-LIKELIHOOD; EM ALGORITHM; NUMBER; COMPONENTS; RETENTION; CLUSTERS;

D O I：

10.1007/s00362-013-0534-x

中图分类号：

O21 [概率论与数理统计]; C8 [统计学];

学科分类号：

020208 ; 070103 ; 0714 ;

摘要：

Finite mixture models can adequately model population heterogeneity when this heterogeneity arises from a finite number of relatively homogeneous clusters. An example of such a situation is market segmentation. Order selection in mixture models, i.e. selecting the correct number of components, however, is a problem which has not been satisfactorily resolved. Existing simulation results in the literature do not completely agree with each other. Moreover, it appears that the performance of different selection methods is affected by the type of model and the parameter values. Furthermore, most existing results are based on simulations where the true generating model is identical to one of the models in the candidate set. In order to partly fill this gap we carried out a (relatively) large simulation study for finite mixture models of normal linear regressions. We included several types of model (mis)specification to study the robustness of 18 order selection methods. Furthermore, we compared the performance of these selection methods based on unpenalized and penalized estimates of the model parameters. The results indicate that order selection based on penalized estimates greatly improves the success rates of all order selection methods. The most successful methods were , , , -, , , and but not one method was consistently good or best for all types of model (mis)specification.

引用

页码：871 / 911

页数：41

共 73 条

[1]

Abbi R., 2008, P 4 INT IEEE C INT S, V3, P9

[2] NEW LOOK AT STATISTICAL-MODEL IDENTIFICATION [J].

AKAIKE, H .

IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1974, AC19 (06) :716-723

[3] Retention of latent segments in regression-based marketing models [J].

Andrews, RL ;

Currim, IS .

INTERNATIONAL JOURNAL OF RESEARCH IN MARKETING, 2003, 20 (04) :315-321

[4] A comparison of segment retention criteria for finite mixture logit models [J].

Andrews, RL ;

Currim, IS .

JOURNAL OF MARKETING RESEARCH, 2003, 40 (02) :235-243

[5]

[Anonymous], 2009, MED APPL FINITE MIXT

[6]

[Anonymous], 2002, Model selection and multimodel inference: a practical informationtheoretic approach

[7]

[Anonymous], 2008, EM ALGORITHM EXTENSI

[8]

[Anonymous], 1985, Proc. Berkeley Conference

[9]

[Anonymous], 1998, Technical Report 3521

[10] Latent variable regression for multiple discrete outcomes [J].

Bandeen-Roche, K ;

Miglioretti, DL ;

Zeger, SL ;

Rathouz, PJ .

JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1997, 92 (440) :1375-1386

← 1 2 3 4 5 6 7 8 →