Sentence Embedding Approach using LSTM Auto-encoder for Discussion Threads Summarization

被引:3
作者
Khan, Abdul Wali [1 ]
Al-Obeidat, Feras [2 ]
Khalid, Afsheen [1 ]
Amin, Adnan [1 ]
Moreira, Fernando [3 ,4 ]
机构
[1] Inst Management Sci Peshawar, Ctr Excellence Informat Technol, Peshawar, Pakistan
[2] Zayed Univ Abu Dhabi, Coll Technol Innovat, Abu Dhabi, U Arab Emirates
[3] Univ Portucalense, IJP, REMIT, Porto, Portugal
[4] Univ Aveiro Portugal, IEETA, Aveiro, Portugal
关键词
Sentence embedding; LSTM Auto-encoder; CBOW; Deep learning; Machine learning; NLP; DOCUMENT;
D O I
10.2298/CSIS221210055K
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Online discussion forums are repositories of valuable information where users interact and articulate their ideas and opinions, and share experiences about numerous topics. These online discussion forums are internet-based online communities where users can ask for help and find the solution to a problem. A new user of online discussion forums becomes exhausted from reading the significant number of irrelevant replies in a discussion. An automated discussion thread summarizing system (DTS) is necessary to create a candid view of the entire discussion of a query. Most of the previous approaches for automated DTS use the continuous bag of words (CBOW) model as a sentence embedding tool, which is poor at capturing the overall meaning of the sentence and is unable to grasp word dependency. To overcome these limitations, we introduce the LSTM Auto-encoder as a sentence embedding technique to improve the performance of DTS. The empirical result in the context of the proposed approach's average precision, recall, and F-measure with respect to ROGUE-1 and ROUGE-2 of two standard experimental datasets demonstrates the effectiveness and efficiency of the proposed approach and outperforms the state-of-the-art CBOW model in sentence embedding tasks and boost the performance of the automated DTS model.
引用
收藏
页码:1367 / 1387
页数:21
相关论文
共 44 条
[41]   Text summarization using unsupervised deep learning [J].
Yousefi-Azar, Mahmood ;
Hamey, Len .
EXPERT SYSTEMS WITH APPLICATIONS, 2017, 68 :93-105
[42]   Single-document and multi-document summarization techniques for email threads using sentence compression [J].
Zajic, David M. ;
Dorr, Bonnie J. ;
Lin, Jimmy .
INFORMATION PROCESSING & MANAGEMENT, 2008, 44 (04) :1600-1610
[43]   Automatic Twitter Topic Summarization With Speech Acts [J].
Zhang, Renxian ;
Li, Wenjie ;
Gao, Dehong ;
Ouyang, You .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2013, 21 (03) :649-658
[44]  
Zhou LW, 2006, PROCEEDINGS OF THE 7TH INTERNATIONAL SCIENTIFIC CONFERENCE ELECTRIC POWER ENGINEERING 2006, P237