An Extended Regularized K-Means Clustering Approach for High-Dimensional Customer Segmentation With Correlated Variables

被引:5
作者
Zhao, Hong-Hao [1 ]
Luo, Xi-Chun [2 ]
Ma, Rui [3 ]
Lu, Xi [1 ]
机构
[1] Macau Univ Sci & Technol, Sch Business, Dept Decis Sci, Taipa 999078, Macao, Peoples R China
[2] Dongguan Univ Technol, Sch Intelligent Mfg, City Coll, Dongguan 523000, Peoples R China
[3] Zhuhai Coll Sci & Technol, Sch Comp Sci, Zhuhai 519000, Peoples R China
关键词
Customer segmentation; high-dimensional clustering; regularized K-means; correlated variables; SELECTION; MODEL; FRAMEWORK; SYSTEM;
D O I
10.1109/ACCESS.2021.3067499
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The omnichannel business has becomes a hot topic due to the fast development on e-commerce and the customers' acquaintance with multichannel shopping mode. Various business organizations have started to work on omnichannel business issue in order to satisfy the new trend of customer demand and tend to devote their efforts to both online and offline business. Thus, there is no doubt that understanding the shopping behavior for online customers is vital for the omnichannel business. The RFM (recency, frequency, monetary) model and the k-means clustering method are commonly used to extract customers' information and segment customers, respectively. To extend the RFM model, we divide the total frequency and monetary information into weekly level data, and as a consequence, the number of variables corresponding to one customer increases significantly, leading to the problem of high-dimensional analysis. To address this issue, in this paper we extend the regularized k-means clustering method with L-1-norm for independent case to the clustering method with elastic net penalty with a focus on correlated variables. Our simulation results show that the proposed method performs better than the standard k-means method by providing lower error rates and can select variables simultaneously under 4 different scenarios. A real example of an online retailer is presented to illustrate the use of the proposed method and highlight its high potential in clustering high-dimensional applications. In particular, the number of variables is reduced from 108 to 98 without any loss on clustering accuracy.
引用
收藏
页码:48405 / 48412
页数:8
相关论文
共 50 条
  • [1] Customer Segmentation using K-means Clustering
    Kansal, Tushar
    Bahuguna, Suraj
    Singh, Vishal
    Choudhury, Tanupriya
    PROCEEDINGS OF THE 2018 INTERNATIONAL CONFERENCE ON COMPUTATIONAL TECHNIQUES, ELECTRONICS AND MECHANICAL SYSTEMS (CTEMS), 2018, : 135 - 139
  • [2] Telecom Customer Segmentation with K-means Clustering
    Luo Ye
    Cai Qiu-ru
    Xi Hai-xu
    Liu Yi-jun
    Yu Zhi-min
    PROCEEDINGS OF 2012 7TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE & EDUCATION, VOLS I-VI, 2012, : 648 - 651
  • [3] Robust and sparse k-means clustering for high-dimensional data
    Brodinova, Sarka
    Filzmoser, Peter
    Ortner, Thomas
    Breiteneder, Christian
    Rohm, Maia
    ADVANCES IN DATA ANALYSIS AND CLASSIFICATION, 2019, 13 (04) : 905 - 932
  • [4] Explainable Customer Segmentation Using K-means Clustering
    Khan, Riyo Hayat
    Dofadar, Dibyo Fabian
    Alam, Md Golam Rabiul
    2021 IEEE 12TH ANNUAL UBIQUITOUS COMPUTING, ELECTRONICS & MOBILE COMMUNICATION CONFERENCE (UEMCON), 2021, : 639 - 643
  • [5] K-Means Clustering Approach for Intelligent Customer Segmentation Using Customer Purchase Behavior Data
    Tabianan, Kayalvily
    Velu, Shubashini
    Ravi, Vinayakumar
    SUSTAINABILITY, 2022, 14 (12)
  • [6] Application of Improved K-means Clustering Algorithm in Customer Segmentation
    Li, Gang
    INFORMATION TECHNOLOGY APPLICATIONS IN INDUSTRY II, PTS 1-4, 2013, 411-414 : 1081 - 1084
  • [7] A REGULARIZED K-MEANS AND MULTIPHASE SCALE SEGMENTATION
    Kang, Sung Ha
    Sandberg, Berta
    Yip, Andy M.
    INVERSE PROBLEMS AND IMAGING, 2011, 5 (02) : 407 - 429
  • [8] SPARSE k-MEANS WITH l∞/l0 PENALTY FOR HIGH-DIMENSIONAL DATA CLUSTERING
    Chang, Xiangyu
    Wang, Yu
    Li, Rongjian
    Xu, Zongben
    STATISTICA SINICA, 2018, 28 (03) : 1265 - 1284
  • [9] Customer Segmentation Using K-Means Clustering and the Hybrid Particle Swarm Optimization Algorithm
    Li, Yue
    Qi, Jianfang
    Chu, Xiaoquan
    Mu, Weisong
    COMPUTER JOURNAL, 2023, 66 (04) : 941 - 962
  • [10] Customer segmentation using K-means clustering and the adaptive particle swarm optimization algorithm
    Li, Yue
    Chu, Xiaoquan
    Tian, Dong
    Feng, Jianying
    Mu, Weisong
    APPLIED SOFT COMPUTING, 2021, 113