Feature Selection-Based Clustering on Micro-blogging Data

被引:6
|
作者
Dutta, Soumi [1 ,2 ]
Ghatak, Sujata [1 ]
Das, Asit Kumar [2 ]
Gupta, Manan [1 ]
Dasgupta, Sayantika [1 ]
机构
[1] Inst Engn & Management, Kolkata 700091, India
[2] Indian Inst Engn Sci & Technol Shibpur, Howrah 711103, India
来源
COMPUTATIONAL INTELLIGENCE IN DATA MINING | 2019年 / 711卷
关键词
Clustering; Feature selection; Micro-blogs;
D O I
10.1007/978-981-10-8055-5_78
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The growing popularity of micro-blogging phenomena opens up a flexible platform for the public as communication media for the public. For any trending/non-trending topic, thousands of post are posted daily in micro-blogs. During any important event, such as natural calamity and election, and sports event, such as IPL and World Cup, a huge number of messages (micro-blogs) are posted. Due to fast and huge exchange of messages causes information overload, hence clustering or grouping similar messages is an effective way to reduce that. Less content and noisy nature of messages are challenging factor in micro-blog data clustering. Incremental huge data is another challenge to clustering. So, in this work, a novel clustering approach is proposed for micro-blogs combining feature selection technique. The proposed approach has been applied to several experimental dataset, and it is compared with several existing clustering techniques which results in better outcome than other methods.
引用
收藏
页码:885 / 895
页数:11
相关论文
共 50 条
  • [1] Curious Feature Selection-Based Clustering
    Moran M.
    Gordon G.
    IEEE Transactions on Artificial Intelligence, 2024, 5 (12): : 6146 - 6158
  • [2] A feature selection-based speaker clustering method for paralinguistic tasks
    Gábor Gosztolya
    László Tóth
    Pattern Analysis and Applications, 2018, 21 : 193 - 204
  • [3] A feature selection-based speaker clustering method for paralinguistic tasks
    Gosztolya, Gabor
    Toth, Laszlo
    PATTERN ANALYSIS AND APPLICATIONS, 2018, 21 (01) : 193 - 204
  • [4] FEATS: feature selection-based clustering of single-cell RNA-seq data
    Vans, Edwin
    Patil, Ashwini
    Sharma, Alok
    BRIEFINGS IN BIOINFORMATICS, 2021, 22 (04)
  • [5] Distance based feature selection for clustering microarray data
    Dash, Manoranjan
    Gopalkrishnan, Vivekanand
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, 2008, 4947 : 512 - 519
  • [6] PSO Based Feature Selection for Clustering Gene Expression Data
    Deepthi, P. S.
    Thampi, Sabu M.
    2015 IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, INFORMATICS, COMMUNICATION AND ENERGY SYSTEMS (SPICES), 2015,
  • [7] A Feature Selection Framework Based on Supervised Data Clustering
    Liu, Hongzhi
    Fu, Bin
    Jiang, Zhengshen
    Wu, Zhonghai
    Hsu, D. Frank
    2016 IEEE 15TH INTERNATIONAL CONFERENCE ON COGNITIVE INFORMATICS & COGNITIVE COMPUTING (ICCI*CC), 2016, : 316 - 321
  • [8] Feature selection based on partition clustering
    Liu, Shuang
    Zhao, Qiang
    Wu, Xiang
    INTERNATIONAL JOURNAL OF KNOWLEDGE-BASED AND INTELLIGENT ENGINEERING SYSTEMS, 2014, 18 (02) : 135 - 142
  • [9] A Feature Selection-based Ensemble Method for Arrhythmia Classification
    Namsrai, Erdenetuya
    Munkhdalai, Tsendsuren
    Li, Meijing
    Shin, Jung-Hoon
    Namsrai, Oyun-Erdene
    Ryu, Keun Ho
    JOURNAL OF INFORMATION PROCESSING SYSTEMS, 2013, 9 (01): : 31 - 40
  • [10] Graph-based unsupervised feature selection and multiview clustering for microarray data
    Swarnkar, Tripti
    Mitra, Pabitra
    JOURNAL OF BIOSCIENCES, 2015, 40 (04) : 755 - 767