A New Approach for Multi-Document Summarization based on Latent Semantic Analysis

被引:3
作者
Xiong, Shuchu [1 ]
Luo, Yihui [1 ]
机构
[1] Hunan Univ Commerce, Coll Comp & Informat, Changsha, Hunan, Peoples R China
来源
2014 SEVENTH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN (ISCID 2014), VOL 1 | 2014年
关键词
multi-document summrization; latent semantic analysis; singular value decomposition; forward selection;
D O I
10.1109/ISCID.2014.27
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-document summary plays an increasingly important role with the exponential document growth on the web. Among many traditional multi-document summarization techniques, the latent semantic analysis (LSA) is a unique duo to its using latent semantic information instead of original feature, which results in a better performance. However, since those approaches based on LSA evaluate and select sentence individually, none of them is able to remove the redundant sentences. In this paper, we propose a new method to evaluate a sentence subset based on its capacity to reproduce term projections on right singular vectors. Finally, the experiments on DUC2002 and DUC2004 datasets validate the effectiveness of our proposed methods.
引用
收藏
页码:177 / 180
页数:4
相关论文
共 11 条
[1]   GenDocSum plus MCLR: Generic document summarization based on maximum coverage and less redundancy [J].
Alguliev, Rasim M. ;
Aliguliyev, Ramiz M. ;
Hajirahimova, Makrufa S. .
EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (16) :12460-12473
[2]  
Carbonell J., 1998, Proceedings of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, P335, DOI 10.1145/290941.291025
[3]  
Erkan G., 2004, Proceedings of EMNLP, V4
[4]  
Gong Y.H., 2002, P ACM SIGIR NEW ORL, P19
[5]   Automatic generic document summarization based on non-negative matrix factorization [J].
Lee, Ju-Hong ;
Park, Sun ;
Ahn, Chan-Min ;
Kim, Daeho .
INFORMATION PROCESSING & MANAGEMENT, 2009, 45 (01) :20-34
[6]  
McDonald R, 2007, LECT NOTES COMPUT SC, V4425, P557
[7]   Centroid-based summarization of multiple documents [J].
Radev, DR ;
Jing, HY ;
Stys, M ;
Tam, D .
INFORMATION PROCESSING & MANAGEMENT, 2004, 40 (06) :919-938
[8]  
Shen D, 2007, 20TH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P2868
[9]  
Steinberger J., 2004, Proc. ISIM, V8, P93
[10]   Two uses of anaphora resolution in summarization [J].
Steinberger, Josef ;
Poesio, Massimo ;
Kabadjov, Mijail A. ;
Jezek, Karel .
INFORMATION PROCESSING & MANAGEMENT, 2007, 43 (06) :1663-1680