Deep Refinement: capsule network with attention mechanism-based system for text classification

被引:30
作者
Jain, Deepak Kumar [1 ]
Jain, Rachna [2 ]
Upadhyay, Yash [2 ]
Kathuria, Abhishek [2 ]
Lan, Xiangyuan [3 ]
机构
[1] Chongqing Univ Posts & Telecommun, Key Lab Intelligent Air Ground Cooperat Control U, Coll Automat, Chongqing, Peoples R China
[2] Bharati Vidyapeeths Coll Engn, Dept Comp Sci & Engn, New Delhi, India
[3] Hong Kong Baptist Univ, Dept Comp Sci, Kowloon, Hong Kong, Peoples R China
关键词
Text classification; Capsule; Attention; LSTM; GRU; Neural network; NLP;
D O I
10.1007/s00521-019-04620-z
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Most of the text in the questions of community question-answering systems does not consist of a definite mechanism for the restriction of inappropriate and insincere content. A given piece of text can be insincere if it asserts false claims or assumes something which is debatable or has a non-neutral or exaggerated tone about an individual or a group. In this paper, we propose a pipeline called Deep Refinement which utilizes some of the state-of-the-art methods for information retrieval from highly sparse data such as capsule network and attention mechanism. We have applied the Deep Refinement pipeline to classify the text primarily into two categories, namely sincere and insincere. Our novel approach 'Deep Refinement' provides a system for the classification of such questions in order to ensure enhanced monitoring and information quality. The database used to understand the real concept of what actually makes up sincere and insincere includes quora insincere question dataset. Our proposed question classification method outperformed previously used text classification methods, as evident from the F1 score of 0.978.
引用
收藏
页码:1839 / 1856
页数:18
相关论文
共 42 条
[1]  
[Anonymous], 2019, QUORA INSINCERE QUES
[2]  
[Anonymous], IEEE ACCESS
[3]  
[Anonymous], ARXIV180400538
[4]  
[Anonymous], 2010 2 INT C COMP RE
[5]  
[Anonymous], ARXIV151108630
[6]  
[Anonymous], ARXIV190403100
[7]  
[Anonymous], ARXIV180400968
[8]  
[Anonymous], ARXIV150406580
[9]  
[Anonymous], 2019, IEEE T CYBERNETICS, DOI DOI 10.1109/TCYB.2018.2831447
[10]  
[Anonymous], 2018, 6 INT C LEARN REPR I