Ontology-enriched multi-document summarization in disaster management using submodular function

被引:15
作者
Wu, Keshou [1 ]
Li, Lei [2 ]
Li, Jingxuan [2 ]
Li, Tao [2 ]
机构
[1] Xiamen Univ Technol, Dept Comp Sci & Technol, Xiamen 361024, Peoples R China
[2] Florida Int Univ, Sch Comp & Informat Sci, Miami, FL 33199 USA
基金
美国国家科学基金会; 中国国家自然科学基金;
关键词
Multi-document summarization; Ontology; Submodularity;
D O I
10.1016/j.ins.2012.10.019
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In disaster management, a myriad of news and reports relevant to the disaster may be recorded in the form of text document. A challenging problem is how to provide concise and informative reports from a large collection of documents, to help domain experts analyze the trend of the disaster. In this paper, we explore the feasibility of using a domain-specific ontology to facilitate the summarization task, and propose TELESUM, an ontology-enriched multi-document summarization approach, where the submodularity hidden in among ontological concepts is investigated. Empirical experiments on the collection of press releases by Miami-Dade County Department of Emergency Management during Hurricane Wilma in 2005 demonstrate the efficacy and effectiveness Of TELESUM in disaster management. Further, our proposed framework can be extended to summarizing general documents by employing public ontologies, e.g., Wikipedia. Extensive evaluation on the generalized framework is conducted on DUC04-05 datasets, and shows that our method is competitive with other approaches. (C) 2012 Elsevier Inc. All rights reserved.
引用
收藏
页码:118 / 129
页数:12
相关论文
共 36 条
[1]  
[Anonymous], 2008, Proceedings of SIGIR, DOI 10.1145/1390334.1390384
[2]  
[Anonymous], 2000, Speech and language processing: An introduction to natural language processing, computational linguistics, and speech recognition
[3]   A complex network approach to text summarization [J].
Antiqueira, Lucas ;
Oliveira, Osvaldo N., Jr. ;
Costa, Luciano da Fontoura ;
Volpe Nunes, Maria das Gracas .
INFORMATION SCIENCES, 2009, 179 (05) :584-599
[4]  
Bollegala D., 2012, INFORM SCI
[5]   A spectral analysis approach to document summarization: Clustering and ranking sentences simultaneously [J].
Cai, Xiaoyan ;
Li, Wenjie .
INFORMATION SCIENCES, 2011, 181 (18) :3816-3827
[6]  
Carbonell J., 1998, Proceedings of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, P335, DOI 10.1145/290941.291025
[7]  
Daumé H, 2006, COLING/ACL 2006, VOLS 1 AND 2, PROCEEDINGS OF THE CONFERENCE, P305
[8]  
Erkan G., 2004, Proceedings of EMNLP, V4
[9]  
Filatova Elena., 2006, Proceed- ings of the COLING/ACL on Main conference poster ses- sions, P207
[10]  
Goldstein J., 2000, NAACL ANLP 2000 WORK