Entropy-Based Mixed data transform model

被引:0
|
作者
Liu, Xingxing [1 ]
Chen, Shan [1 ]
Wang, Pan [2 ]
机构
[1] Wuhan Univ Technol, Sch Management, Wuhan, Peoples R China
[2] Wuhan Univ Technol, Sch Automat, Wuhan, Peoples R China
来源
2016 2ND INTERNATIONAL CONFERENCE ON INDUSTRIAL INFORMATICS - COMPUTING TECHNOLOGY, INTELLIGENT TECHNOLOGY, INDUSTRIAL INFORMATION INTEGRATION (ICIICII) | 2016年
基金
中国国家自然科学基金;
关键词
Mixed data; Information entropy; conversion; probability; CATEGORICAL-DATA;
D O I
10.1109/ICIICII.2016.60
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In the era of big data, in-depth data mining is inevitable and urgent. Cluster analysis is one of the major data mining methods. Measuring similarity or distance between two objects is a key step for several data mining and knowledge discovery tasks. For abound unstructured free data, conversion between numeric data and categorical data matters most, while the notion of similarity for numeric data is relatively well-studied and for categorical data not satisfying. Learning from current clustering algorithm for categorical data and mixed data, several methods and corresponding features are explored and summarized. Results on a variety of data sets show that while no one measure dominates others for all types of problems, but some measures are able to be integrated into clustering process. Proposed method has the potential capability to deal with numeric and categorical features ( mixed features) of dataset.
引用
收藏
页码:123 / 126
页数:4
相关论文
共 50 条
  • [1] Entropy-based transform learning algorithms
    Parthasarathy, Gayatri
    Abhilash, G.
    IET SIGNAL PROCESSING, 2018, 12 (04) : 439 - 446
  • [2] On entropy-based data mining
    Holzinger, Andreas
    Hörtenhuber, Matthias
    Mayer, Christopher
    Bachler, Martin
    Wassertheurer, Siegfried
    Pinho, Armando J
    Koslicki, David
    1600, Springer Verlag (8401): : 209 - 226
  • [3] Data Entropy-Based Imbalanced Learning
    Fan, Yutao
    Huang, Heming
    RECENT ADVANCES IN NEXT-GENERATION DATA SCIENCE, SDSC 2024, 2024, 2158 : 95 - 109
  • [4] Entropy-based reduction of traffic data
    Pescape, Antonio
    IEEE COMMUNICATIONS LETTERS, 2007, 11 (02) : 191 - 193
  • [5] An Entropy-based Analytic Model for the Privacy-Preserving in Open Data
    Kim, Soo-Hyung
    Jung, Changwook
    Lee, Yoon-Joon
    2016 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2016, : 3676 - 3684
  • [6] Unsupervised Entropy-Based Selection of Data Sets for Improved Model Fitting
    Ferreira, Pedro M.
    2016 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2016, : 3330 - 3337
  • [7] AN ENTROPY-BASED MODAL SPLIT MODEL
    JORNSTEN, KO
    LUNDGREN, JT
    TRANSPORTATION RESEARCH PART B-METHODOLOGICAL, 1989, 23 (05) : 345 - 359
  • [8] An Entropy-Based Model for Hierarchical Learning
    Asadi, Amir R.
    JOURNAL OF MACHINE LEARNING RESEARCH, 2024, 25 : 1 - 45
  • [9] An Entropy-based Data Reduction Method for Data Preprocessing
    Cassandro, Rocco
    Li, Quing
    Li, Zhaojun Steven
    2023 IEEE INTERNATIONAL CONFERENCE ON PROGNOSTICS AND HEALTH MANAGEMENT, ICPHM, 2023, : 351 - 356
  • [10] Riemannian manifold on stream data: Fourier transform and entropy-based DDoS attacks detection method
    Liu, Zhen
    Hu, Changzhen
    Shan, Chun
    COMPUTERS & SECURITY, 2021, 109