Machine Learning and feature engineering-based study into sarcasm and irony classification with application to cyberbullying detection

被引:49
作者
Chia, Zheng Lin [1 ]
Ptaszynski, Michal [1 ]
Masui, Fumito [1 ]
Leliwa, Gniewosz [2 ]
Wroczynski, Michal [2 ]
机构
[1] Kitami Inst Technol, Dept Comp Sci, Kitami, Hokkaido, Japan
[2] Samurailabs, Gdansk, Poland
关键词
Irony detection; Sarcasm detection; Machine Learning;
D O I
10.1016/j.ipm.2021.102600
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Irony and sarcasm detection is considered a complex task in Natural Language Processing. This paper set out to explore the sarcasm and irony on Twitter, using Machine Learning and Feature Engineering techniques. First we review and clarify the definition of irony and sarcasm by discussing various studies focusing on the terms. Next the first experiment is conducted comparing between various types of classification methods including some popular classifiers for text classification task. For the second experiment, different types of data preprocessing methods were compared and analyzed. Finally, the relationship between irony, sarcasm, and cyberbullying are discussed. The results are interesting as we observed high similarity between them.
引用
收藏
页数:12
相关论文
共 64 条
  • [1] Abrams M.H., 2009, GLOSSARY LIT TERMS
  • [2] Amir I, 2016, P 20 SIGNLL C COMP N
  • [3] [Anonymous], 2017, IJCAI 2017 3 WORKSHO
  • [4] [Anonymous], 2011, P 49 ANN M ASS COMP
  • [5] [Anonymous], 2009, P ACL IJCNLP 2009 C
  • [6] [Anonymous], 1998, Metaphor and Symbol, DOI [10.1207/s15327868ms1301_1, DOI 10.1207/S15327868MS1301_1]
  • [7] Attardo S, 1999, J PRAGMATICS
  • [8] Barbieri F, 2017, MACHINE LEARNING MET
  • [9] BAZIOTIS C, 2018, P 12 INT WORKSH SEM, P613
  • [10] Research-paper recommender systems: a literature survey
    Beel, Joeran
    Gipp, Bela
    Langer, Stefan
    Breitinger, Corinna
    [J]. INTERNATIONAL JOURNAL ON DIGITAL LIBRARIES, 2016, 17 (04) : 305 - 338