Extending The Performance of Extractive Text Summarization By Ensemble Techniques

被引:0
作者
Bharadwaj, Aprameya [1 ]
Srinivasan, Arvind [1 ]
Kasi, Anish [1 ]
Das, Bhaskarjyoti [1 ]
机构
[1] PES Univ, Dept CSE, Bangalore, Karnataka, India
来源
2019 11TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING (ICOAC 2019) | 2019年
关键词
Text Summarization; Ensemble; Soft Voting; ROUGE; LexRank; TextRank; Luhn; LSA; KL;
D O I
10.1109/icoac48765.2019.246854
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Text summarization techniques help in automatically shortening the length of text data as well as fluently and accurately passing on the intended message. Extractive Text. summarization methods have been well-researched but each such algorithm produces a potentially different summary. There is no standalone "best" algorithm using this method. The algorithm proposed in this paper gives a way to combine the strengths of all the existing algorithms to make text summarization more robust. Though ensemble optimization is a popular technique in machine learning, it has not been tried exhaustively in text summarization. Specifically, soft voting has not been attempted so far. In this paper, we extends the concept of voting classifiers in Machine Learning in text domain and propose a novel optimized ensemble based approach to text summarization. The quality of the summary generated is evaluated based on the Recall Oriented Understudy for Gisting Evaluation (ROUGE) metric. As seen in the results, the proposed model outperforms baseline extractive text summarization models such as textrank, lexrank, LSA, Luhn and KL summarizers by more than 15%.
引用
收藏
页码:282 / 288
页数:7
相关论文
共 25 条
[1]   COSUM: Text summarization based on clustering and optimization [J].
Alguliyev, Rasim M. ;
Aliguliyev, Ramiz M. ;
Isazade, Nijat R. ;
Abdi, Asad ;
Idris, Norisma .
EXPERT SYSTEMS, 2019, 36 (01)
[2]  
Amrieh E.A., 2016, International Journal of Database Theory and Application, V9, P119, DOI [DOI 10.14257/IJDTA.2016.9.8.13, 10.14257/ijdta.2016.9.8.13]
[3]  
Ashraf Mudasir, 2019, Emerging Technologies in Data Mining and Information Security. Proceedings of IEMIS 2018. Advances in Intelligent Systems and Computing (AISC 813), P321, DOI 10.1007/978-981-13-1498-8_29
[4]  
Cheng Jianpeng, 2016, Long Papers, DOI [10.18653/V1/P16-1046, DOI 10.18653/V1/P16-1046]
[5]   Ensemble Algorithms for Microblog Summarization [J].
Dutta, Soumi ;
Chandra, Vibhash ;
Mehra, Kanav ;
Das, Asit Kumar ;
Chakraborty, Tanmoy ;
Ghosh, Saptarshi .
IEEE INTELLIGENT SYSTEMS, 2018, 33 (03) :4-14
[6]   LexRank: Graph-based lexical centrality as salience in text summarization [J].
Erkan, G ;
Radev, DR .
JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2004, 22 :457-479
[7]   Word-sentence co-ranking for automatic extractive text summarization [J].
Fang, Changjian ;
Mu, Dejun ;
Deng, Zhenghong ;
Wu, Zhiang .
EXPERT SYSTEMS WITH APPLICATIONS, 2017, 72 :189-195
[8]   Assessing sentence scoring techniques for extractive text summarization [J].
Ferreira, Rafael ;
Cabral, Luciano de Souza ;
Lins, Rafael Dueire ;
Pereira e Silva, Gabriel ;
Freitas, Fred ;
Cavalcanti, George D. C. ;
Lima, Rinaldo ;
Simske, Steven J. ;
Favaro, Luciano .
EXPERT SYSTEMS WITH APPLICATIONS, 2013, 40 (14) :5755-5764
[9]  
Gong Y., 2001, SIGIR 2001, P19, DOI DOI 10.1145/383952.383955
[10]  
Gupta Vishal, 2010, Journal of Emerging Technologies in Web Intelligence, V2, P258, DOI 10.4304/jetwi.2.3.258-268