SRL-GSM: A hybrid approach based on semantic role labeling and general statistic method for text summarization

被引:7
作者
Suanmali L. [1 ]
Salim N. [2 ]
Binwahlan M.S. [3 ]
机构
[1] Faculty of Computer Science and Technology, Suan Dusit Rajabhat University, Dusit Bangkok
[2] Faculty of Computer Science and Information Systems, Universiti Teknologi Malaysia, 81310 Skudai, Johor
[3] Faculty of Applied Science, Hadhramout University of Science and Technology, Seiyun, Hadhramout
关键词
Semantic similarity; Text summarization sentence extrzction;
D O I
10.3923/jas.2010.166.173
中图分类号
学科分类号
摘要
Sentence extraction techniques are commonly used to produce extraction summaries. The goal of text summarization based on extraction approach is to identify the most important set of sentences for the overall understanding of a given document. One of the methods to obtain suitable sentences is to assign some numerical measure of a sentence for summary called sentence weighting and then select the best ones. In this study, we propose Semantic Role Labeling (SRL) approach to improve the quality of the summary created by the general statistic method. We calculate a couple of sentence semantic similarity based on the similarity of the pair of words using WordNet thesaurus to discover the word relationship between sentences. We perform text summarization based on General Statistic Method (GSM) and then combine it with the SRL method. We compare our results with the baseline summarizer and Microsoft Word 2007 summarizers. The results show that SRL-GSM and GSM give the best average precision, recall and f-measure for creation of summaries. © 2010 AsianNetwork for Scientific Information.
引用
收藏
页码:166 / 173
页数:7
相关论文
共 25 条
  • [1] Aliruliyev R.M., A new sentence similarity measure and sentence based extractive technique for automatic text summarization, Expert Syst. Appl. Int. J. Arch, 36, pp. 7764-7772, (2009)
  • [2] Barzilay R., Elhadad M., Using lexical chains for text summarization, Proceedings of The Intelligent Scalable Text Summarization Workshop, pp. 10-17, (1997)
  • [3] Workshop On Text Summarization, (2002)
  • [4] Dao T., Simpson T., Measuring similarity between sentences, Proceedings of The Document Understanding Conference, (2005)
  • [5] Edmundson H.P., New methods in automatic extracting, J. Assoc. Comput. Machinery, 16, pp. 264-285, (1969)
  • [6] Fattah M.A., Ren F., GA, MR, FFNN, PNN and GMM based models for automatic text summarization, Comput. Speech Language, 23, pp. 126-144, (2009)
  • [7] Fellbaum C., Wordnet: An Electronic Lexical Database, (1998)
  • [8] Gong Y., Liu X., Generic text summarization using relevance measure and latent semantic analysis, Proceedings of The 24th Annual International Acm Sigir Conference On Research and Development In Information Retrieval, pp. 9-25, (2001)
  • [9] Ko Y., Seo J., An effective sentence-extraction technique using contextual information and statistical approaches for text summarization, Pattern Recognition Lett. Arch, 29, pp. 1366-1371, (2008)
  • [10] Kupiec J., Pedersen J., Chen F., A trainable document summarizer, Proceedings of The 18th Annual International Acm Sigir Conference On Research and Development In Information Retrieval, pp. 68-73, (1995)