Mining Social Media Data Using Topological Data Analysis

被引:8
|
作者
Almgren, Khaled [1 ]
Kim, Minkyu [2 ]
Lee, Jeongkyu [1 ]
机构
[1] Univ Bridgeport, Comp Sci & Engn Dept, Bridgeport, CT 06614 USA
[2] ASML, Wilton, CT 06897 USA
来源
2017 IEEE 18TH INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION (IEEE IRI 2017) | 2017年
关键词
topological data analysis; social network analysis and mining; machine learning; clustering;
D O I
10.1109/IRI.2017.41
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Topological data analysis is a noble method to analyze high-dimensional qualitative data using a set of properties from topology. In this paper, we explore the feasibility of topological data analysis for mining social media data by investigating the problem of image popularity. We randomly crawl images from Instagram, convert their captions to 300 dimensional numerical vectors using Word2vec, calculate cosine distances to evaluate the similarities of the caption vectors, and then apply the distances to a topological data analysis algorithm called mapper. With caption vectors, the results show that topological data analysis is able to cluster the images related to the images' popularity. Moreover, the results show relationships between the clusters that are represented as a monotonic increase of popularity. This approach is compared with traditional clustering algorithms, including k-means and hierarchical clustering, and the results show that topological data analysis outperforms the others.
引用
收藏
页码:144 / 153
页数:10
相关论文
共 50 条
  • [31] Social User Mining: Survey on Mining Different Types of Social Media Data
    Eltaher, Mohammed
    Lee, Jeongkyu
    INTERNATIONAL JOURNAL OF MULTIMEDIA DATA ENGINEERING & MANAGEMENT, 2013, 4 (04): : 58 - 70
  • [32] Data Mining for Social Networks Open Data Analysis
    Spiridonov, Roman E.
    Cvetkov, Vladislav D.
    Yurchik, Oleg M.
    2017 IEEE II INTERNATIONAL CONFERENCE ON CONTROL IN TECHNICAL SYSTEMS (CTS), 2017, : 395 - 396
  • [33] Disaster impacts analysis using social media data
    Gangadhari, Rajan Kumar
    Khanzode, Vivek
    Murthy, Shankar
    2021 INTERNATIONAL CONFERENCE ON MAINTENANCE AND INTELLIGENT ASSET MANAGEMENT (ICMIAM), 2021,
  • [34] Frequent Itemset Mining for Big Data in social media using ClustBigFIM algorithm
    Gole, Sheela
    Tidke, Bharat
    2015 INTERNATIONAL CONFERENCE ON PERVASIVE COMPUTING (ICPC), 2015,
  • [35] Mining and analysis of opioid content in longitudinal data posted in a social media forum
    Sarker, Abeed
    Deroos, Francis
    Gonzalez-Hernandez, Graciela
    O'Connor, Karen
    Perrone, Jeanmarie
    CLINICAL TOXICOLOGY, 2020, 58 (06) : 571 - 571
  • [36] Mining urban perceptions from social media data
    Liu, Yu
    Yuan, Yihong
    Zhang, Fan
    JOURNAL OF SPATIAL INFORMATION SCIENCE, 2020, (20): : 51 - 55
  • [37] Social Media Data Mining in Public Relations Research
    Britt, Brian C.
    Hayes, Jameson L.
    Holiday, Steven
    Lyu, Yuanwei
    JOURNAL OF PUBLIC RELATIONS RESEARCH, 2024,
  • [38] InfoExtractor-A Tool for Social Media Data Mining
    File, Charles
    Shah, Chirag
    JOURNAL OF INFORMATION TECHNOLOGY & POLITICS, 2012, 9 (03) : 269 - 278
  • [39] Data Mining Cultural Aspects of Social Media Marketing
    Hochreiter, Ronald
    Waldhauser, Christoph
    ADVANCES IN DATA MINING: APPLICATIONS AND THEORETICAL ASPECTS, 2014, 8557 : 130 - 143
  • [40] An ontological artifact for classifying social media: Text mining analysis for financial data
    Alzamil, Zamil
    Appelbaum, Deniz
    Nehmer, Robert
    INTERNATIONAL JOURNAL OF ACCOUNTING INFORMATION SYSTEMS, 2020, 38