System of automatic chinese webpage summarization based on the random walk algorithm of dynamic programming

被引:0
作者
Wang, Feng [1 ,4 ]
Qin, XiaoMing [2 ,5 ]
Wang, Yizhen [3 ]
Wei, Xinjiang [1 ,4 ]
机构
[1] School of Mathermatics and Statistics Science, Ludong University, Yantai
[2] College of Computer and Information Engineering, Jiaozuo Teachers College, Jiaozuo
[3] Tilburg University, Tilburg
[4] Key Laboratory of Language Resource Development and Application of Shandong Province, Yantai
[5] Institute of Computer Application Technology, Jiaozuo Teachers College, Jiaozuo
来源
Open Cybernetics and Systemics Journal | 2015年 / 9卷
基金
中国国家自然科学基金;
关键词
Automatic summarization; Random walk; Semantic relatedness; Seme based graph; Webpage;
D O I
10.2174/1874110X01509011315
中图分类号
学科分类号
摘要
As the Internet becomes more and more deeply connected with our life, the Internet has brought together mass text material, and it is still in explosive growth. In order to quickly and accurately to help users find the required content, the traditional solution is to use a search engine. However, the results of existing automatic webpage summarization systems for search engine are of low quality. Because they just based on statistical method, gather some sentences in the web document beside the search phrases. Neither symbolizes the subject of the document, nor take into account the user search phrases. According to the shortages, An automatic webpage summarization systems is realized. On the basis of the work done, this paper proposed an automatic text summarization method based on relation graph and text structure analysis. This method firstly segment text into semantic paragraphs. For each semantic paragraph, a subject term discover method based on relation graph analysis is proposed. At last, both search phrase and document subject are take into account, it extracts summary according to the guidance of the subject terms. © Wang et al.
引用
收藏
页码:1315 / 1322
页数:7
相关论文
共 8 条
[1]  
Banerjee S., Pedersen T., Extended gloss overlaps as a measure of semantic relatedness, IJCAI, 3, pp. 805-810, (2003)
[2]  
Gabrilovich E., Markovitch S., Computing semantic relatedness using wikipedia-based explicit semantic analysis, IJCAI, 7, pp. 1606-1611, (2007)
[3]  
Liu Q., Li S., Word similarity computing based on how-net, Computational Linguistics and Chinese Language Processing, 7, 2, pp. 59-76, (2002)
[4]  
Berkhin P., A survey on pagerank computing, Internet Mathematics, 2, 1, pp. 73-120, (2005)
[5]  
Yi W., Xiaolin W., Algorithm for words' semantic relevancy based on modified algorithm for sememes' relevancy, Jouranal of The China Society For Scientific And Technical Information, 31, 12, pp. 1271-1275, (2012)
[6]  
Yun Xu X., Zhang F.F., Semantic relevancy computing based on hownet, Transactions of Beijing Institute of Technology, 25, 5, pp. 411-414, (2005)
[7]  
Hughes T., Ramage D., Lexical Semantic Relatedness with Random Graph Walks, pp. 581-589, (2007)
[8]  
Shuqin Z., Yangyang W., The model of words relation computing based on the HowNet, Microcomputer and ITS Applications, 31, 8, pp. 77-80, (2012)