A survey on text mining in social networks

被引:75
|
作者
Irfan, Rizwana [1 ]
King, Christine K. [1 ]
Grages, Daniel [1 ]
Ewen, Sam [1 ]
Khan, Samee U. [1 ]
Madani, Sajjad A. [2 ]
Kolodziej, Joanna [3 ]
Wang, Lizhe [4 ]
Chen, Dan [5 ]
Rayes, Ammar [6 ]
Tziritas, Nikolaos [4 ]
Xu, Cheng-Zhong [4 ]
Zomaya, Albert Y. [7 ]
Alzahrani, Ahmed Saeed [8 ]
Li, Hongxiang [9 ]
机构
[1] N Dakota State Univ, Fargo, ND 58102 USA
[2] COMSATS Inst Informat Technol, Islamabad 44000, Pakistan
[3] Cracow Univ Technol, PL-30001 Krakow, Poland
[4] Chinese Acad Sci, Beijing 100864, Peoples R China
[5] China Univ Geosci, Wuhan 430000, Peoples R China
[6] CISCO Syst, San Jose, CA 94089 USA
[7] Univ Sydney, Sydney, NSW 2006, Australia
[8] King Abdulaziz Univ, Jeddah 21589, Saudi Arabia
[9] Univ Louisville, Louisville, KY 40292 USA
关键词
CLASSIFICATION;
D O I
10.1017/S0269888914000277
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this survey, we review different text mining techniques to discover various textual patterns from the social networking sites. Social network applications create opportunities to establish interaction among people leading to mutual learning and sharing of valuable knowledge, such as chat, comments, and discussion boards. Data in social networking websites is inherently unstructured and fuzzy in nature. In everyday life conversations, people do not care about the spellings and accurate grammatical construction of a sentence that may lead to different types of ambiguities, such as lexical, syntactic, and semantic. Therefore, analyzing and extracting information patterns from such data sets are more complex. Several surveys have been conducted to analyze different methods for the information extraction. Most of the surveys emphasized on the application of different text mining techniques for unstructured data sets reside in the form of text documents, but do not specifically target the data sets in social networking website. This survey attempts to provide a thorough understanding of different text mining techniques as well as the application of these techniques in the social networking websites. This survey investigates the recent advancement in the field of text analysis and covers two basic approaches of text mining, such as classification and clustering that are widely used for the exploration of the unstructured text available on the Web.
引用
收藏
页码:157 / 170
页数:14
相关论文
共 50 条
  • [31] Text Mining for Security Threat Detection
    Suh-Lee, Candace
    Jo, Ju-Yeon
    Kim, Yoohwan
    2016 IEEE CONFERENCE ON COMMUNICATIONS AND NETWORK SECURITY (CNS), 2016, : 252 - 260
  • [32] Text mining in the classification of digital documents
    Contreras Barrera, Marcial
    BIBLIOS-REVISTA DE BIBLIOTECOLOGIA Y CIENCIAS DE LA INFORMACION, 2016, (64): : 33 - 43
  • [33] Selection criteria for text mining approaches
    Hashimi, Hussein
    Hafez, Alaaeldin
    Mathkour, Hassan
    COMPUTERS IN HUMAN BEHAVIOR, 2015, 51 : 729 - 733
  • [34] A review on authorship attribution in text mining
    Zheng, Wanwan
    Jin, Mingzhe
    WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2023, 15 (02)
  • [35] Text Mining: Techniques, Applications and Issues
    Talib, Ramzan
    Hanif, Muhammad Kashif
    Ayesha, Shaeela
    Fatima, Fakeeha
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2016, 7 (11) : 414 - 418
  • [36] A Comprehensive Science Mapping Analysis of Textual Emotion Mining in Online Social Networks
    Chawla, Shivangi
    Mehrotra, Monica
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (05) : 218 - 229
  • [37] Text Mining Method in the Field of Health
    Toplu, Selcuk Goksel
    Cangur, Sengul
    KONURALP TIP DERGISI, 2020, 12 (02): : 236 - 246
  • [38] Mining social media text for disaster resource management using a feature selection based on forest optimization
    Bhoi, Ashutosh
    Balabantaray, Rakesh Chandra
    Sahoo, Deepak
    Dhiman, Gaurav
    Khare, Manish
    Narducci, Fabio
    Kaur, Amandeep
    COMPUTERS & INDUSTRIAL ENGINEERING, 2022, 169
  • [39] Text Recognition in the Wild: A Survey
    Chen, Xiaoxue
    Jin, Lianwen
    Zhu, Yuanzhi
    Luo, Canjie
    Wang, Tianwei
    ACM COMPUTING SURVEYS, 2021, 54 (02)
  • [40] Text categorization with WEKA: A survey
    Merlini, Donatella
    Rossini, Martina
    MACHINE LEARNING WITH APPLICATIONS, 2021, 4