Novel Efficient RNN and LSTM-Like Architectures: Recurrent and Gated Broad Learning Systems and Their Applications for Text Classification

被引:142
作者
Du, Jie [1 ]
Vong, Chi-Man [2 ]
Chen, C. L. Philip [3 ,4 ]
机构
[1] Shenzhen Univ, Sch Biomed Engn,Hlth Sci Ctr, Natl Reg Key Technol Engn Lab Med Ultrasound, Guangdong Key Lab Biomed Measurements & Ultrasoun, Shenzhen 518060, Peoples R China
[2] Univ Macau, Dept Comp & Informat Sci, Macau, Peoples R China
[3] South China Univ Technol, Sch Comp Sci & Engn, Guangzhou 510641, Peoples R China
[4] Univ Macau, Fac Sci & Technol, Macau, Peoples R China
基金
中国国家自然科学基金;
关键词
Logic gates; Learning systems; Training; Computer architecture; Recurrent neural networks; Task analysis; Broad learning system (BLS); sequence information; simultaneous learning; text classification; word importance; NEURAL-NETWORKS; MACHINE;
D O I
10.1109/TCYB.2020.2969705
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
High accuracy of text classification can be achieved through simultaneous learning of multiple information, such as sequence information and word importance. In this article, a kind of flat neural networks called the broad learning system (BLS) is employed to derive two novel learning methods for text classification, including recurrent BLS (R-BLS) and long short-term memory (LSTM)-like architecture: gated BLS (G-BLS). The proposed two methods possess three advantages: 1) higher accuracy due to the simultaneous learning of multiple information, even compared to deep LSTM that extracts deeper but single information only; 2) significantly faster training time due to the noniterative learning in BLS, compared to LSTM; and 3) easy integration with other discriminant information for further improvement. The proposed methods have been evaluated over 13 real-world datasets from various types of text classification. From the experimental results, the proposed methods achieve higher accuracies than LSTM while taking significantly less training time on most evaluated datasets, especially when the LSTM is in deep architecture. Compared to R-BLS, G-BLS has an extra forget gate to control the flow of information (similar to LSTM) to further improve the accuracy on text classification so that G-BLS is more effective while R-BLS is more efficient.
引用
收藏
页码:1586 / 1597
页数:12
相关论文
共 40 条
[1]  
[Anonymous], 2016, INT JOINT C ARTIFICI
[2]  
[Anonymous], 2008, P ICML
[3]  
[Anonymous], 2014, EMNLP, DOI DOI 10.3115/V1
[4]   Feed-forward neural networks [J].
Bebis, George ;
Georgiopoulos, Michael .
IEEE Potentials, 1994, 13 (04) :27-31
[5]  
Bird Steven, 2009, Natural language processing with Python: analyzing text with the natural language toolkit
[6]   Broad Learning System: An Effective and Efficient Incremental Learning System Without the Need for Deep Architecture [J].
Chen, C. L. Philip ;
Liu, Zhulin .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (01) :10-24
[7]  
Chung J., 2014, NIPS WORKSH DEEP LEA
[8]  
Collobert R, 2011, J MACH LEARN RES, V12, P2493
[9]  
Conneau Alexis, 2016, arXiv:1606.01781
[10]  
Devlin, 2018, CORR