Combination of loss functions for deep text classification

被引:13
作者
Hajiabadi, Hamideh [1 ]
Molla-Aliod, Diego [2 ]
Monsefi, Reza [1 ]
Yazdi, Hadi Sadoghi [1 ]
机构
[1] FUM, Dept Comp, Mashhad, Razavi Khorasan, Iran
[2] Macquarie Univ, Sydney, NSW 2109, Australia
关键词
Loss Function; Convolutional neural network (CNN); Ensemble method; Multi-class classifier; ENSEMBLE; CORRENTROPY;
D O I
10.1007/s13042-019-00982-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Ensemble methods have shown to improve the results of statistical classifiers by combining multiple single learners into a strong one. In this paper, we explore the use of ensemble methods at the level of the objective function of a deep neural network. We propose a novel objective function that is a linear combination of single losses and integrate the proposed objective function into a deep neural network. By doing so, the weights associated with the linear combination of losses are learned by back propagation during the training stage. We study the impact of such an ensemble loss function on the state-of-the-art convolutional neural networks for text classification. We show the effectiveness of our approach through comprehensive experiments on text classification. The experimental results demonstrate a significant improvement compared with the conventional state-of-the-art methods in the literature.
引用
收藏
页码:751 / 761
页数:11
相关论文
共 43 条
[11]  
Collobert R., 2008, P 25 INT C MACH LEAR, P160, DOI DOI 10.1145/1390156.1390177
[12]  
Collobert R, 2011, J MACH LEARN RES, V12, P2493
[13]  
Condorcet Marie-Jean-Antoine-NicolasCaritat., 1955, Sketch for a Historical Picture of the Progress of the Human Mind
[14]   COMPOSITE CLASSIFIER SYSTEM-DESIGN - CONCEPTS AND METHODOLOGY [J].
DASARATHY, BV ;
SHEELA, BV .
PROCEEDINGS OF THE IEEE, 1979, 67 (05) :708-713
[15]   A tutorial on the cross-entropy method [J].
De Boer, PT ;
Kroese, DP ;
Mannor, S ;
Rubinstein, RY .
ANNALS OF OPERATIONS RESEARCH, 2005, 134 (01) :19-67
[16]   A fuzzy-based strategy for multi-domain sentiment analysis [J].
Dragoni, Mauro ;
Petrucci, Giulio .
INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2018, 93 :59-73
[17]  
Freund Y., 1996, Machine Learning. Proceedings of the Thirteenth International Conference (ICML '96), P148
[18]   relf: robust regression extended with ensemble loss function [J].
Hajiabadi, Hamideh ;
Monsefi, Reza ;
Yazdi, Hadi Sadoghi .
APPLIED INTELLIGENCE, 2019, 49 (04) :1437-1450
[19]   NEURAL NETWORK ENSEMBLES [J].
HANSEN, LK ;
SALAMON, P .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1990, 12 (10) :993-1001
[20]   Maximum Correntropy Criterion for Robust Face Recognition [J].
He, Ran ;
Zheng, Wei-Shi ;
Hu, Bao-Gang .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2011, 33 (08) :1561-1576