A comparative dimensionality reduction study in telecom customer segmentation using deep learning and PCA

被引:0
|
作者
Maha Alkhayrat
Mohamad Aljnidi
Kadan Aljoumaa
机构
[1] Higher Institute for Applied Sciences and Technology,Faculty of Information Technology
来源
Journal of Big Data | / 7卷
关键词
Autoencoder; PCA; Neural networks; Deep learning; Big data; Clustering; Data representation; Dimentionality reduction; Segmentation;
D O I
暂无
中图分类号
学科分类号
摘要
Telecom Companies logs customer’s actions which generate a huge amount of data that can bring important findings related to customer’s behavior and needs. The main characteristics of such data are the large number of features and the high sparsity that impose challenges to the analytics steps. This paper aims to explore dimensionality reduction on a real telecom dataset and evaluate customers’ clustering in reduced and latent space, compared to original space in order to achieve better quality clustering results. The original dataset contains 220 features that belonging to 100,000 customers. However, dimensionality reduction is an important data preprocessing step in the data mining process specially with the presence of curse of dimensionality. In particular, the aim of data reduction techniques is to filter out irrelevant features and noisy data samples. To reduce the high dimensional data, we projected it down to a subspace using well known Principal Component Analysis (PCA) decomposition and a novel approach based on Autoencoder Neural Network, performing in this way dimensionality reduction of original data. Then K-Means Clustering is applied on both-original and reduced data set. Different internal measures were performed to evaluate clustering for different numbers of dimensions and then we evaluated how the reduction method impacts the clustering task.
引用
收藏
相关论文
共 50 条
  • [1] A comparative dimensionality reduction study in telecom customer segmentation using deep learning and PCA
    Alkhayrat, Maha
    Aljnidi, Mohamad
    Aljoumaa, Kadan
    JOURNAL OF BIG DATA, 2020, 7 (01)
  • [2] Online Reviews Analysis for Customer Segmentation through Dimensionality Reduction and Deep Learning Techniques
    Nilashi, Mehrbakhsh
    Samad, Sarminah
    Minaei-Bidgoli, Behrouz
    Ghabban, Fahad
    Supriyanto, Eko
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2021, 46 (09) : 8697 - 8709
  • [3] Online Reviews Analysis for Customer Segmentation through Dimensionality Reduction and Deep Learning Techniques
    Mehrbakhsh Nilashi
    Sarminah Samad
    Behrouz Minaei-Bidgoli
    Fahad Ghabban
    Eko Supriyanto‬
    Arabian Journal for Science and Engineering, 2021, 46 : 8697 - 8709
  • [4] Hyperspectral Data Dimensionality Reduction: A Comparative Study Between PCA and Autoencoder Methods
    Motsch, Jean
    Bergeon, Yves
    Ktivanek, Vaclav
    MODELLING AND SIMULATION FOR AUTONOMOUS SYSTEMS, MESAS 2023, 2025, 14615 : 314 - 334
  • [5] SLAM using incremental probabilistic PCA and dimensionality reduction
    Brunskill, E
    Roy, N
    2005 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), VOLS 1-4, 2005, : 342 - 347
  • [6] Traffic Prediction in Telecom Systems Using Deep Learning
    Kaushik, Prashant
    Singh, Sajal
    Yadav, Pankaj
    2018 7TH INTERNATIONAL CONFERENCE ON RELIABILITY, INFOCOM TECHNOLOGIES AND OPTIMIZATION (TRENDS AND FUTURE DIRECTIONS) (ICRITO) (ICRITO), 2018, : 302 - 307
  • [7] Hyperspectral image segmentation using dimensionality reduction and classical segmentation approaches
    Myasnikov E.V.
    Computer Optics, 2017, 41 (04): : 564 - 572
  • [8] Deep Learning Models for Lung Nodule Segmentation: A Comparative Study
    Orazalina, Aliya
    Yoon, Heechul
    Choi, Sang-, II
    Yoon, Seokhyun
    JOURNAL OF ELECTRICAL ENGINEERING & TECHNOLOGY, 2025, 20 (01) : 829 - 843
  • [9] Textual data dimensionality reduction - a deep learning approach
    Neetu Kushwaha
    Millie Pant
    Multimedia Tools and Applications, 2020, 79 : 11039 - 11050
  • [10] A Study on Heart Segmentation Using Deep Learning Algorithm for MRI Scans
    Ibrahim, Shakeel Muhammad
    Ibrahim, Muhammad Sohail
    Usman, Muhammad
    Naseem, Imran
    Moinuddin, Muhammad
    2019 13TH INTERNATIONAL CONFERENCE ON MATHEMATICS, ACTUARIAL SCIENCE, COMPUTER SCIENCE AND STATISTICS (MACS-13), 2019,