Stepwise possibilistic c-regressions

被引:7
作者
Chang, Shao-Tung [1 ]
Lu, Kang-Ping [2 ]
Yang, Miin-Shen [3 ]
机构
[1] Natl Taiwan Normal Univ, Dept Math, Taipei, Taiwan
[2] Natl Taichung Univ Sci & Technol, Dept Appl Stat, Taichung, Taiwan
[3] Chung Yuan Christian Univ, Dept Appl Math, Chungli, Taiwan
关键词
Fuzzy c-means; Possibilistic clustering; Switching regressions; Fuzzy c-regressions; Possibilistic c-regressions (PCR); Stepwise PCR; LINEAR-REGRESSION; ESTIMATING MIXTURES; FUZZY; ALGORITHM; MULTIVARIATE; CONSTRAINTS; LIKELIHOOD; PLANTS; MODEL;
D O I
10.1016/j.ins.2015.11.042
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In 1993, Hathaway and Bezdek combined switching regressions with fuzzy c-means (FCM) to create fuzzy c-regressions (FCR). The FCR algorithm had been widely studied and applied in various areas. However, membership of the FCR does not always correspond to the degree of belonging and it can be inaccurate in a noisy environment Krishnapuram and Keller (1993) proposed a possibilistic c-means (PCM) whereby membership gives a much better explanation of the degree of belonging and it is more robust to noise and outliers than the FCM. Therefore, we incorporate possibilistic clustering into switching regression models and term it possibilistic c-regressions (PCR). Although a PCR ameliorates the problem of outliers and noisy points, a PCR still depends heavily on initial values. In this paper, we propose a schema for a nested stepwise procedure for a PCR, called a stepwise PCR (SPCR) method, which repeats the PCR on a series of nested subsets using the clustering results of the previous subset as good initial values for the PCR for the succeeding subset. When the smallest subset, D-1, is determined at the location where the c regression models separate sufficiently, the number of clusters and the cluster centers in D-1 are determined using a modified mountain method, so data in D-1 is properly partitioned, which provides good initial values for the following PCR. The proposed SPCR is unsupervised, does not require initialization and is unaffected by noise and outliers. Several experiments and real examples demonstrate the superiority and effectiveness of the proposed SPCR method. (C) 2015 Elsevier Inc. All rights reserved.
引用
收藏
页码:307 / 322
页数:16
相关论文
共 57 条
[22]   Robust clusterwise linear regression through trimming [J].
Garcia-Escudero, L. A. ;
Gordaliza, A. ;
Mayo-Iscar, A. ;
San Martin, R. .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2010, 54 (12) :3057-3069
[23]  
Grun B, 2008, Recent advances in linear models and related areas, P205, DOI [DOI 10.1007/978-3-7908-2064-5_11, DOI 10.1007/978-3-7908-2064-511.843,847]
[24]  
Hathaway R. J., 1993, IEEE Transactions on Fuzzy Systems, V1, P195, DOI 10.1109/91.236552
[25]   Fuzzy c-means clustering of incomplete data [J].
Hathaway, RJ ;
Bezdek, JC .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2001, 31 (05) :735-744
[26]   Mixture of Regression Models With Varying Mixing Proportions: A Semiparametric Approach [J].
Huang, Mian ;
Yao, Weixin .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2012, 107 (498) :711-724
[27]   Semiparametric mixtures of regressions [J].
Hunter, David R. ;
Young, Derek S. .
JOURNAL OF NONPARAMETRIC STATISTICS, 2012, 24 (01) :19-38
[28]   Estimating mixtures of regressions [J].
Hurn, M ;
Justel, A ;
Robert, CP .
JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2003, 12 (01) :55-79
[29]  
Justel A., 1996, Journal of Computational and Graphical Statistics, V5, P176, DOI [10.1080/10618600.1996.10474703, DOI 10.1080/10618600.1996.10474703]
[30]   The possibilistic C-means algorithm: Insights and recommendations [J].
Krishnapuram, R ;
Keller, JM .
IEEE TRANSACTIONS ON FUZZY SYSTEMS, 1996, 4 (03) :385-393