MapReduce Functions to Analyze Sentiment Information from Social Big Data

被引:15
作者
Ha, Ilkyu [1 ]
Back, Bonghyun [2 ]
Ahn, Byoungchul [2 ]
机构
[1] Kyungil Univ, Dept Comp Engn, Gyongsan 712701, South Korea
[2] Yeungnam Univ, Dept Comp Engn, Gyongsan 712749, South Korea
关键词
Data handling - File organization - Data mining - Social networking (online) - MapReduce;
D O I
10.1155/2015/417502
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Opinion mining, which extracts meaningful opinion information from large amounts of social multimedia data, has recently arisen as a research area. In particular, opinion mining has been used to understand the true meaning and intent of social networking site users. It requires efficient techniques to collect a large amount of social multimedia data and extract meaningful information from them. Therefore, in this paper, we propose a method to extract sentiment information from various types of unstructured social media text data from social networks by using a parallel Hadoop Distributed File System (HDFS) to save social multimedia data and using MapReduce functions for sentiment analysis. The proposed method has stably performed data gathering and data loading and maintained stable load balancing of memory and CPU resources during data processing by the HDFS system. The proposed MapReduce functions have effectively performed sentiment analysis in the experiments. Finally, the sentiment analysis results of the proposed system are very close to those of manual processes.
引用
收藏
页数:11
相关论文
共 22 条
[1]  
[Anonymous], 2010, P 23 INT C COMPUTATI
[2]  
[Anonymous], 2011, HBase: the definitive guide: random access to your planetsize data
[3]  
[Anonymous], BIG DAT NEXT FRONT I
[4]  
[Anonymous], 2009, Sentiment140
[5]  
[Anonymous], 2013, MongoDB: The Definitive Guide
[6]  
[Anonymous], 2012, P 27 ANN ACM S APPL, DOI DOI 10.1145/2245276.2245364
[7]  
Bautin M., 2010, Proceedings of the 19th International Conference on World Wide Web, P1229
[8]  
Beomil Kang, 2013, [Journal of the Korean Library and Information Science Society, 한국문헌정보학회지], V47, P315, DOI 10.4275/KSLIS.2013.47.4.315
[9]  
Bo Pang, 2008, Foundations and Trends in Information Retrieval, V2, P1, DOI 10.1561/1500000001
[10]   Bigtable: A distributed storage system for structured data [J].
Chang, Fay ;
Dean, Jeffrey ;
Ghemawat, Sanjay ;
Hsieh, Wilson C. ;
Wallach, Deborah A. ;
Burrows, Mike ;
Chandra, Tushar ;
Fikes, Andrew ;
Gruber, Robert E. .
ACM TRANSACTIONS ON COMPUTER SYSTEMS, 2008, 26 (02)