Generative Multi-Task Learning for Text Classification

被引:9
作者
Zhao, Wei [1 ,2 ]
Gao, Hui [1 ]
Chen, Shuhui [1 ]
Wang, Nan [3 ]
机构
[1] Natl Univ Def Technol, Coll Comp Sci, Changsha 410073, Peoples R China
[2] Hunan Police Acad, Changsha 410138, Peoples R China
[3] Natl Univ Def Technol, Coll Intelligence Sci & Technol, Changsha 410073, Peoples R China
关键词
Generative model; hierarchical classification; multi-label classification; multi-task learning;
D O I
10.1109/ACCESS.2020.2991337
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Multi-task learning leverages potential correlations among related tasks to extract common features and yield performance gains. In this paper, a generative multi-task learning (MTL) approach for text classification and categorization is proposed, which is composed of a shared encoder, a multi-label classification decoder and a hierarchical categorization decoder. In the two decoders, a label-order-independent multi-label classification loss function and a hierarchical structure mask matrix are introduced. Experiments conducted on the real-world public security dataset show that the proposed approach has obvious advantages over the baseline approaches and can enhance the semantic association between the results of the classification and categorization tasks.
引用
收藏
页码:86380 / 86387
页数:8
相关论文
共 29 条
[1]  
[Anonymous], INT JOINT C NEUR NET
[2]  
[Anonymous], 2014, NEURIPS
[3]  
[Anonymous], INT J DISTRIB SENSOR
[4]  
[Anonymous], P 48 ANN M ASS COMP
[5]  
[Anonymous], 2018, ARXIV180201697
[6]  
[Anonymous], 2017, ARXIV170301619
[7]  
[Anonymous], 1997, Hierarchically classifying documents using very few words
[8]   A model of inductive bias learning [J].
Baxter, J .
JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2000, 12 :149-198
[9]   Multitask learning [J].
Caruana, R .
MACHINE LEARNING, 1997, 28 (01) :41-75
[10]   Reduction strategies for hierarchical multi-label classification in protein function prediction [J].
Cerri, Ricardo ;
Barros, Rodrigo C. ;
de Carvalho, Andre C. P. L. F. ;
Jin, Yaochu .
BMC BIOINFORMATICS, 2016, 17