Semantic analysis via application of deep learning using Naver movie review data

被引:1
作者
Kim, Sojin [1 ]
Song, Jongwoo [1 ]
机构
[1] Ewha Womans Univ, Dept Stat, 52 Ewhayeodae Gil, Seoul 03760, South Korea
关键词
semantic analysis; natural language processing (NLP); LSTM; recurrent neural network (RNN); movie review;
D O I
10.5351/KJAS.2022.35.1.019
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
With the explosive growth of social media, its abundant text-based data generated by web users has become an important source for data analysis. For example, we often witness online movie reviews from the 'Naver Movie' affecting the general public to decide whether they should watch the movie or not. This study has conducted analysis on the Naver Movie's text-based review data to predict the actual ratings. After examining the distribution of movie ratings, we performed semantics analysis using Korean Natural Language Processing. This research sought to find the best review rating prediction model by comparing machine learning and deep learning models. We also compared various regression and classification models in 2-class and multi-class cases. Lastly we explained the causes of review misclassification related to movie review data characteristics.
引用
收藏
页码:19 / 33
页数:15
相关论文
共 9 条
  • [1] Kharde V., 2016, International Journal of Computer Applications, DOI [10.5120/ijca2016908625, DOI 10.5120/IJCA2016908625]
  • [2] 박호연, 2019, [Journal of Intelligence and Information Systems, 지능정보연구], V25, P141
  • [3] Lee Jae Jun, 2018, [Journal of Information Technology Services, 한국IT서비스학회지], V17, P79, DOI 10.9716/KITS.2018.17.1.079
  • [4] Mikolov Tomas, 2013, ARXIV, DOI 10.48550/arXiv.1301.3781
  • [5] Nayak A., 2016, COMP STUDY NAIVE BAY
  • [6] Parmar Hitesh, 2014, INT C INF SCI
  • [7] Pennington Jeffrey, 2014, P 2014 C EMPIRICAL M, P1532, DOI [10.3115/v1/D14-1162, DOI 10.3115/V1/D14-1162]
  • [8] A Hybrid CNN-LSTM Model for Improving Accuracy of Movie Reviews Sentiment Analysis
    Rehman, Anwar Ur
    Malik, Ahmad Kamran
    Raza, Basit
    Ali, Waqar
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (18) : 26597 - 26613
  • [9] Yeongtaek Oh, 2019, Journal of KIISE, V46, P45, DOI 10.5626/JOK.2019.46.1.45