Semi-supervised learning for big social data analysis

被引:159
作者
Hussain, Amir [1 ]
Cambria, Erik [2 ]
机构
[1] Univ Stirling, Sch Nat Sci, Stirling FK9 4LA, Scotland
[2] Nanyang Technol Univ, Sch Comp Sci & Engn, 50 Nanyang Ave, Singapore 639798, Singapore
基金
英国工程与自然科学研究理事会;
关键词
Semi-supervised learning; Big social data analysis; Sentiment analysis;
D O I
10.1016/j.neucom.2017.10.010
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In an era of social media and connectivity, web users are becoming increasingly enthusiastic about interacting, sharing, and working together through online collaborative media. More recently, this collective intelligence has spread to many different areas, with a growing impact on everyday life, such as in education, health, commerce and tourism, leading to an exponential growth in the size of the social Web. However, the distillation of knowledge from such unstructured Big data is, an extremely challenging task. Consequently, the semantic and multimodal contents of the Web in this present day are, whilst being well suited for human use, still barely accessible to machines. In this work, we explore the potential of a novel semi-supervised learning model based on the combined use of random projection scaling as part of a vector space model, and support vector machines to perform reasoning on a knowledge base. The latter is developed by merging a graph representation of commonsense with a linguistic resource for the lexical representation of affect. Comparative simulation results show a significant improvement in tasks such as emotion recognition and polarity detection, and pave the way for development of future semi-supervised learning approaches to big social data analytics. (c) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:1662 / 1673
页数:12
相关论文
共 55 条
[1]   Database-friendly random projections: Johnson-Lindenstrauss with binary coins [J].
Achlioptas, D .
JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 2003, 66 (04) :671-687
[2]   Faster Dimension Reduction [J].
Ailon, Nir ;
Chazelle, Bernard .
COMMUNICATIONS OF THE ACM, 2010, 53 (02) :97-104
[3]  
[Anonymous], 2013, Advances in neural information processing systems (NIPS)
[4]  
[Anonymous], 1989, Building large knowledge-based systems: Representation and inference in the Cyc project
[5]  
[Anonymous], 2006, Commonsense Reasoning
[6]  
[Anonymous], 2016, arXiv preprint arXiv:1610.08815
[7]  
[Anonymous], 1998, WordNet, DOI DOI 10.7551/MITPRESS/7287.001.0001
[8]  
[Anonymous], ARXIV13106536
[9]  
[Anonymous], J FRANKL I
[10]  
[Anonymous], 2015, SENTIC COMPUTING COM