Towards a real-time processing framework based on improved distributed recurrent neural network variants with fastText for social big data analytics

被引:60
|
作者
Hammou, Badr Ait [1 ]
Lahcen, Ayoub Ait [1 ,2 ]
Mouline, Salma [1 ]
机构
[1] Mohammed V Univ, Rabat IT Ctr, Associated Unit CNRST URAC 29, Fac Sci,LRIT, Rabat, Morocco
[2] Ibn Tofail Univ, Natl Sch Appl Sci ENSA, LGS, Kenitra, Morocco
关键词
Big data; FastText; Recurrent neural networks; LSTM; BiLSTM; GRU; Natural language processing; Sentiment analysis; Social big data analytics; SENTIMENT ANALYSIS; BIDIRECTIONAL LSTM; CLASSIFICATION;
D O I
10.1016/j.ipm.2019.102122
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Big data generated by social media stands for a valuable source of information, which offers an excellent opportunity to mine valuable insights. Particularly, User-generated contents such as reviews, recommendations, and users' behavior data are useful for supporting several marketing activities of many companies. Knowing what users are saying about the products they bought or the services they used through reviews in social media represents a key factor for making decisions. Sentiment analysis is one of the fundamental tasks in Natural Language Processing. Although deep learning for sentiment analysis has achieved great success and allowed several firms to analyze and extract relevant information from their textual data, but as the volume of data grows, a model that runs in a traditional environment cannot be effective, which implies the importance of efficient distributed deep learning models for social Big Data analytics. Besides, it is known that social media analysis is a complex process, which involves a set of complex tasks. Therefore, it is important to address the challenges and issues of social big data analytics and enhance the performance of deep learning techniques in terms of classification accuracy to obtain better decisions. In this paper, we propose an approach for sentiment analysis, which is devoted to adopting fastText with Recurrent neural network variants to represent textual data efficiently. Then, it employs the new representations to perform the classification task. Its main objective is to enhance the performance of well-known Recurrent Neural Network (RNN) variants in terms of classification accuracy and handle large scale data. In addition, we propose a distributed intelligent system for real-time social big data analytics. It is designed to ingest, store, process, index, and visualize the huge amount of information in real-time. The proposed system adopts distributed machine learning with our proposed method for enhancing decision-making processes. Extensive experiments conducted on two benchmark data sets demonstrate that our proposal for sentiment analysis outperforms well-known distributed recurrent neural network variants (i.e., Long Short-Term Memory (LSTM), Bidirectional Long Short-Term Memory (BiLSTM), and Gated Recurrent Unit (GRU)). Specifically, we tested the efficiency of our approach using the three different deep learning models. The results show that our proposed approach is able to enhance the performance of the three models. The current work can provide several benefits for researchers and practitioners who want to collect, handle, analyze and visualize several sources of information in real-time. Also, it can contribute to a better understanding of public opinion and user behaviors using our proposed system with the improved variants of the most powerful distributed deep learning and machine learning algorithms. Furthermore, it is able to increase the classification accuracy of several existing works based on RNN models for sentiment analysis.
引用
收藏
页数:15
相关论文
共 45 条
  • [41] A voice-based real-time emotion detection technique using recurrent neural network empowered feature modelling
    Sadil Chamishka
    Ishara Madhavi
    Rashmika Nawaratne
    Damminda Alahakoon
    Daswin De Silva
    Naveen Chilamkurti
    Vishaka Nanayakkara
    Multimedia Tools and Applications, 2022, 81 : 35173 - 35194
  • [42] A voice-based real-time emotion detection technique using recurrent neural network empowered feature modelling
    Chamishka, Sadil
    Madhavi, Ishara
    Nawaratne, Rashmika
    Alahakoon, Damminda
    De Silva, Daswin
    Chilamkurti, Naveen
    Nanayakkara, Vishaka
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (24) : 35173 - 35194
  • [43] Towards Real-time House Detection in Aerial Imagery Using Faster Region-based Convolutional Neural Network
    Ahmed, Khandaker Mamun
    Mohammadi, Farid Ghareh
    Matus, Manuel
    Shenavarmasouleh, Farzan
    Pereira, Luiz Manella
    Ioannis, Zisis
    Amini, M. Hadi
    IPSI BGD TRANSACTIONS ON INTERNET RESEARCH, 2023, 19 (02): : 46 - 54
  • [44] Tweeting for Health Using Real-time Mining and Artificial Intelligence-Based Analytics: Design and Development of a Big Data Ecosystem for Detecting and Analyzing Misinformation on Twitter
    Morita, Plinio Pelegrini
    Hussain, Irfhana Zakir
    Kaur, Jasleen
    Lotto, Matheus
    Butt, Zahid Ahmad
    JOURNAL OF MEDICAL INTERNET RESEARCH, 2023, 25
  • [45] Towards Real-Time Machine Learning-Based Signal/Background Selection in the CMS Detector Using Quantized Neural Networks and Input Data Reduction
    Burazin Misura, Arijana
    Music, Josip
    Prvan, Marina
    Lelas, Damir
    APPLIED SCIENCES-BASEL, 2024, 14 (04):