Mining Social Media Data Using Topological Data Analysis

被引:8
|
作者
Almgren, Khaled [1 ]
Kim, Minkyu [2 ]
Lee, Jeongkyu [1 ]
机构
[1] Univ Bridgeport, Comp Sci & Engn Dept, Bridgeport, CT 06614 USA
[2] ASML, Wilton, CT 06897 USA
来源
2017 IEEE 18TH INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION (IEEE IRI 2017) | 2017年
关键词
topological data analysis; social network analysis and mining; machine learning; clustering;
D O I
10.1109/IRI.2017.41
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Topological data analysis is a noble method to analyze high-dimensional qualitative data using a set of properties from topology. In this paper, we explore the feasibility of topological data analysis for mining social media data by investigating the problem of image popularity. We randomly crawl images from Instagram, convert their captions to 300 dimensional numerical vectors using Word2vec, calculate cosine distances to evaluate the similarities of the caption vectors, and then apply the distances to a topological data analysis algorithm called mapper. With caption vectors, the results show that topological data analysis is able to cluster the images related to the images' popularity. Moreover, the results show relationships between the clusters that are represented as a monotonic increase of popularity. This approach is compared with traditional clustering algorithms, including k-means and hierarchical clustering, and the results show that topological data analysis outperforms the others.
引用
收藏
页码:144 / 153
页数:10
相关论文
共 50 条
  • [41] Cyberbullying Detection and Prevention: Data Mining in Social Media
    Sultan, Daniyar
    Suliman, Azizah
    Toktarova, Aigerim
    Omarov, Batyrkhan
    Mamikov, Satmyrza
    Beissenova, Gulbakhram
    2021 11TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, DATA SCIENCE & ENGINEERING (CONFLUENCE 2021), 2021, : 338 - 342
  • [42] Pattern Mining Approaches Used in Social Media Data
    Chaki, Jyotismita
    Dey, Nilanjan
    Panigrahi, B. K.
    Shi, Fuqian
    Fong, Simon James
    Sherratt, R. Simon
    INTERNATIONAL JOURNAL OF UNCERTAINTY FUZZINESS AND KNOWLEDGE-BASED SYSTEMS, 2020, 28 (Supp02) : 123 - 152
  • [43] Mining Social Media Data: Discovering Contradictedness Quantities
    Yafi, Eiad
    Zuhairi, M.
    ACM IMCOM 2015, PROCEEDINGS, 2015,
  • [44] On fairness: User perspectives on social media data mining
    Kennedy, Helen
    Elgesem, Dag
    Miguel, Cristina
    CONVERGENCE-THE INTERNATIONAL JOURNAL OF RESEARCH INTO NEW MEDIA TECHNOLOGIES, 2017, 23 (03): : 270 - 288
  • [45] USING DATA MINING FOR ASSESSING THE IMPACT OF SOCIAL MEDIA IN HIGHER EDUCATION: THE CASE OF INTEGRATING SOCIAL MEDIA IN THE CURRICULUM
    Dafoulas, Georgios
    Loveday, Joanna
    Neilson, David
    ICERI2015: 8TH INTERNATIONAL CONFERENCE OF EDUCATION, RESEARCH AND INNOVATION, 2015, : 4684 - 4693
  • [46] Introduction to the Data Analytics, Data Mining and Machine Learning for Social Media Minitrack
    Haughton, Dominique M.
    Xu, Jennifer J.
    Yates, David J.
    Yan, Xiangbin
    PROCEEDINGS OF THE 51ST ANNUAL HAWAII INTERNATIONAL CONFERENCE ON SYSTEM SCIENCES (HICSS), 2018, : 1741 - 1741
  • [47] Introduction to Data Analytics, Data Mining and Machine Learning for Social Media Minitrack
    Yates, David
    Xu, Jennifer
    Mentzer, Kevin
    PROCEEDINGS OF THE 52ND ANNUAL HAWAII INTERNATIONAL CONFERENCE ON SYSTEM SCIENCES, 2019, : 2215 - 2215
  • [48] Introduction to data analytics, data mining and machine learning for social media minitrack
    Yates, David
    Xu, Jennifer
    Mentzer, Kevin
    Proceedings of the Annual Hawaii International Conference on System Sciences, 2019, 2019-January
  • [49] Mining Trajectory Data and Geotagged Data in Social Media for Road Map Inference
    Li, Jun
    Qin, Qiming
    Han, Jiawei
    Tang, Lu-An
    Lei, Kin Hou
    TRANSACTIONS IN GIS, 2015, 19 (01) : 1 - 18
  • [50] Enhancing decision-making support by mining social media data with social network analysis
    Manuela Freire
    Francisco Antunes
    João Paulo Costa
    Social Network Analysis and Mining, 13