A Pattern-Based Approach for Sarcasm Detection on Twitter

被引:112
作者
Bouazizi, Mondher [1 ]
Otsuki , Tomoaki [1 ]
机构
[1] Keio Univ, Grad Sch Sci & Technol, Yokohama, Kanagawa 2238522, Japan
关键词
Twitter; sentiment analysis; sarcasm detection; machine learning; IRONY;
D O I
10.1109/ACCESS.2016.2594194
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Sarcasm is a sophisticated form of irony widely used in social networks and microblogging websites. It is usually used to convey implicit information within the message a person transmits. Sarcasm might be used for different purposes, such as criticism or mockery. However, it is hard even for humans to recognize. Therefore, recognizing sarcastic statements can be very useful to improve automatic sentiment analysis of data collected from microblogging websites or social networks. Sentiment Analysis refers to the identification and aggregation of attitudes and opinions expressed by Internet users toward a specific topic. In this paper, we propose a pattern-based approach to detect sarcasm on Twitter. We propose four sets of features that cover the different types of sarcasm we defined. We use those to classify tweets as sarcastic and non-sarcastic. Our proposed approach reaches an accuracy of 83.1% with a precision equal to 91.1%. We also study the importance of each of the proposed sets of features and evaluate its added value to the classification. In particular, we emphasize the importance of pattern-based features for the detection of sarcastic statements.
引用
收藏
页码:5477 / 5488
页数:12
相关论文
共 49 条
[1]  
[Anonymous], J ASS INF SCI TECHNO
[2]  
[Anonymous], 2013, P EMNLP
[3]  
[Anonymous], 2010, Proceedings of the First Workshop on Social Media Analytics, SOMA '10, DOI DOI 10.1145/1964858.1964867
[4]  
[Anonymous], P 18 ACM INT C WEB S
[5]  
Attardo S., 2000, Rask-International Journal of Language and Communication, V12, P3
[6]  
Attardo Salvatore., 2007, Irony in Language and Thought: A Cognitive Science Reader, P135
[7]  
Bamman David., 2015, The Ninth International AAAI Conference on Web and Social Media, DOI DOI 10.1609/ICWSM.V9I1.14655
[8]  
Barbieri F., 2014, P 5 WORKSHOP COMPUTA, P50
[9]  
Berry MichaelW., 2004, SURVEY TEXT MINING C
[10]   Parsing-based Sarcasm Sentiment Recognition in Twitter Data [J].
Bharti, Santosh Kumar ;
Babu, Korra Sathya ;
Jena, Sanjay Kumar .
PROCEEDINGS OF THE 2015 IEEE/ACM INTERNATIONAL CONFERENCE ON ADVANCES IN SOCIAL NETWORKS ANALYSIS AND MINING (ASONAM 2015), 2015, :1373-1380