Dynamic Embedding Projection-Gated Convolutional Neural Networks for Text Classification

被引:73
作者
Tan, Zhipeng [1 ,2 ]
Chen, Jing [1 ,2 ]
Kang, Qi [1 ,2 ]
Zhou, MengChu [3 ,4 ]
Abusorrah, Abdullah [4 ,5 ]
Sedraoui, Khaled [4 ,5 ]
机构
[1] Tongji Univ, Coll Elect & Informat Engn, Dept Control Sci & Engn, Shanghai 201804, Peoples R China
[2] Tongji Univ, Shanghai Inst Intelligent Sci & Technol, Shanghai 200092, Peoples R China
[3] New Jersey Inst Technol, Dept Elect & Comp Engn, Newark, NJ 07102 USA
[4] King Abdulaziz Univ, Ctr Res Excellence Renewable Energy & Power Syst, Jeddah 21481, Saudi Arabia
[5] King Abdulaziz Univ, Dept Elect & Comp Engn, Jeddah 21481, Saudi Arabia
关键词
Road transportation; Logic gates; Computer architecture; Convolution; Training; Standards; Electronic mail; Convolutional neural network (CNN); dynamic embedding projection gate; multi-class and multi-label text classification; natural language processing (NLP); MODEL;
D O I
10.1109/TNNLS.2020.3036192
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Text classification is a fundamental and important area of natural language processing for assigning a text into at least one predefined tag or category according to its content. Most of the advanced systems are either too simple to get high accuracy or centered on using complex structures to capture the genuinely required category information, which requires long time to converge during their training stage. In order to address such challenging issues, we propose a dynamic embedding projection-gated convolutional neural network (DEP-CNN) for multi-class and multi-label text classification. Its dynamic embedding projection gate (DEPG) transforms and carries word information by using gating units and shortcut connections to control how much context information is incorporated into each specific position of a word-embedding matrix in a text. To our knowledge, we are the first to apply DEPG over a word-embedding matrix. The experimental results on four known benchmark datasets display that DEP-CNN outperforms its recent peers.
引用
收藏
页码:973 / 982
页数:10
相关论文
共 52 条
[1]  
Adhikari A, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4046
[2]  
[Anonymous], 1998, 98 IDSIA
[3]  
[Anonymous], 2016, arXiv preprint arXiv:1602.02410
[4]   AUTOMATED LEARNING OF DECISION RULES FOR TEXT CATEGORIZATION [J].
APTE, C ;
DAMERAU, F ;
WEISS, SM .
ACM TRANSACTIONS ON INFORMATION SYSTEMS, 1994, 12 (03) :233-251
[5]  
Bishop C.M., 1995, NEURAL NETWORKS PATT
[6]  
Bojanowski P., 2017, Trans. Assoc. Comput. Linguistics, V5, P135, DOI [DOI 10.1162/TACLA00051, 10.1162/tacl_a_00051, DOI 10.1162/TACL_A_00051]
[7]   Unsupervised Domain Adaptation With Adversarial Residual Transform Networks [J].
Cai, Guanyu ;
Wang, Yuqin ;
He, Lianghua ;
Zhou, MengChu .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (08) :3073-3086
[8]  
Chung J, 2014, CORR, P1
[9]  
Dauphin YN, 2017, PR MACH LEARN RES, V70
[10]  
Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171