On the cost-effectiveness of neural and non-neural approaches and representations for text classification: A comprehensive comparative study

被引:45
作者
Cunha, Washington [1 ]
Mangaravite, Vitor [1 ]
Gomes, Christian [1 ]
Canuto, Sergio [1 ]
Resende, Elaine [1 ]
Nascimento, Cecilia [1 ]
Viegas, Felipe [1 ]
Franca, Celso [1 ]
Martins, Wellington Santos [3 ]
Almeida, Jussara M. [1 ]
Rosa, Thierson [3 ]
Rocha, Leonardo [2 ]
Goncalves, Marcos Andre [1 ]
机构
[1] Univ Fed Minas Gerais, Belo Horizonte, MG, Brazil
[2] Univ Fed Sao Joao del Rei, Sao Joao Del Rei, MG, Brazil
[3] Univ Fed Goias, Goiania, Go, Brazil
关键词
Text classification; Comparative study; Systematic review;
D O I
10.1016/j.ipm.2020.102481
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This article brings two major contributions. First, we present the results of a critical analysis of recent scientific articles about neural and non-neural approaches and representations for automatic text classification (ATC). This analysis is focused on assessing the scientific rigor of such studies. It reveals a profusion of potential issues related to the experimental procedures including: (i) use of inadequate experimental protocols, including no repetitions for the sake of assessing variability and generalization; (ii) lack of statistical treatment of the results; (iii) lack of details on hyperparameter tuning, especially of the baselines; (iv) use of inadequate measures of classification effectiveness (e.g., accuracy with skewed distributions). Second, we provide some organization and ground to the field by performing a comprehensive and scientifically sound comparison of recent neural and non-neural ATC solutions. Our study provides a more complete picture by looking beyond classification effectiveness, taking the trade-off between model costs (i.e., training time) into account. Our evaluation is guided by scientific rigor, which, as our literature review shows, is missing in a large body of work. Our experimental results, based on more than 1500 measurements, reveal that in the smaller datasets, the simplest and cheaper non-neural methods are among the best performers. In the larger datasets, neural Transformers perform better in terms of classification effectiveness. However, when compared to the best (properly tuned) non-neural solutions, the gains in effectiveness are not very expressive, especially considering the much longer training times (up to 23x slower). Our findings call for a self-reflection of best practices in the field, from the way experiments are conducted and analyzed to the choice of proper baselines for each situation and scenario.
引用
收藏
页数:24
相关论文
共 61 条
[1]   A Fast Similarity Search kNN for Textual Datasets [J].
Amorim, Leonardo Afonso ;
Freitas, Mateus F. ;
da Silva, Paulo Henrique ;
Martins, Wellington S. .
2018 SYMPOSIUM ON HIGH PERFORMANCE COMPUTING SYSTEMS (WSCAD 2018), 2018, :229-236
[2]  
[Anonymous], 2007, BIOSTAT ANAL
[3]  
[Anonymous], 2011, Acm T. Intel. Syst. Tec., DOI DOI 10.1145/1961189.1961199
[4]  
Armstrong T. G., 2009, P 18 ACM C INF KNOWL, P601, DOI DOI 10.1145/1645953.1646031
[5]  
Bojanowski P., 2017, Trans. Assoc. Comput. Linguist, V5, P135, DOI [DOI 10.1162/TACL_A_00051, 10.1162/tacla00051]
[6]   Stacking Bagged and Boosted Forests for Effective Automated Classification [J].
Campos, Raphael ;
Canuto, Sergio ;
Salles, Thiago ;
de Sa, Clebson C. A. ;
Goncalves, Marcos Andre .
SIGIR'17: PROCEEDINGS OF THE 40TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2017, :105-114
[7]   Similarity-Based Synthetic Document Representations for Meta-Feature Generation in Text Classification [J].
Canuto, Sergio ;
Salles, Thiago ;
Rosa, Thierson C. ;
Goncalves, Marcos A. .
PROCEEDINGS OF THE 42ND INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '19), 2019, :355-364
[8]   A Thorough Evaluation of Distance-Based Meta-Features for Automated Text Classification [J].
Canuto, Sergio ;
Sousa, Daniel Xavier ;
Goncalves, Marcos Andre ;
Rosa, Thierson Couto .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2018, 30 (12) :2242-2256
[9]   Exploiting New Sentiment-Based Meta-level Features for Effective Sentiment Analysis [J].
Canuto, Sergio ;
Goncalves, Marcos Andre ;
Benevenuto, Fabricio .
PROCEEDINGS OF THE NINTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING (WSDM'16), 2016, :53-62
[10]  
Conneau A, 2017, 15TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2017), VOL 1: LONG PAPERS, P1107