Text classification algorithms for mining unstructured data: a SWOT analysis

被引:31
作者
Kumar A. [1 ]
Dabas V. [2 ]
Hooda P. [1 ]
机构
[1] Department of Computer Science and Engineering, Delhi Technological University, New Delhi
[2] Department of Computer Science, College of Computing and Informatics, University of North Carolina, Charlotte
关键词
Sentiment analysis; Social media; SWOT; Text classification; Text mining; Web mining;
D O I
10.1007/s41870-017-0072-1
中图分类号
学科分类号
摘要
It has become increasingly crucial and imperative to facilitate knowledge extraction for decision support and deliver targeted information to analysts that span wide application domains. Interestingly, the buzzing term “big data” which is estimated to be 90% unstructured further makes it difficult to tap and analyze information with traditional tools. Text mining entails defining a process which transforms and substitutes this unstructured data into a structured one to discover knowledge. Use of classification algorithms to intelligently mine text has been studied extensively across literature. This study predominantly surveys the text classification algorithms employed in the process of mining unstructured data to report a conclusive analysis on the trend of their use in terms of their respective strengths, weaknesses, opportunities and threats (SWOT). The scope of these algorithms is then explored apropos the application area of sentiment analysis, a typical text classification task. A mapping which determines the unexplored social media technologies and the extent of use of these algorithms within respective social media is proffered to give an insight to the amount of work that has been done in the domain of machine learning based sentiment analysis on social media. © 2018, Bharati Vidyapeeth's Institute of Computer Applications and Management.
引用
收藏
页码:1159 / 1169
页数:10
相关论文
共 35 条
[1]  
Analysis of unstructured data: Applications of text analytics and sentiment mining, SAS. Retrieved, 24, (2016)
[2]  
Aggarwal C.C., Zhai C.-X., A survey of text classification algorithms, book chapter in mining text data, (2012)
[3]  
Rajman M., Besancon R., Text mining—knowledge extraction from unstructured textual data, Proceeding of 6Th Conference of International Federation of Classification Societies (IFCS-98), pp. 473-480, (1998)
[4]  
Bhatia M.P.S., Kumar A., Beniwal R., SWOT analysis of ontology driven software engineering, Indian J Sci Technol, 9, 38, (2016)
[5]  
The Data Mining Encyclopedia, (2006)
[6]  
Raghu R., Gehrke J., Database management systems, (2000)
[7]  
Kuhlen R., Information and pragmatic value-adding: language games and information science, Comput Humanit, 25, pp. 93-101, (1991)
[8]  
Nunes S., State of the art in web information retrieval. Technical Report, (2006)
[9]  
Kosala R., Blockeel H., Web mining research: a survey, SIGKDD Explor, 2, 1, pp. 1-15, (2000)
[10]  
Bhatia M.P.S., Kumar A., Information retrieval and machine learning: supporting technologies for web mining research and practice, Webology, 5, 2, (2008)