IKDSumm: Incorporating key-phrases into BERT for extractive disaster tweet summarization

被引:6
作者
Garg, Piyush Kumar [1 ]
Chakraborty, Roshni [2 ]
Gupta, Srishti [1 ]
Dandapat, Sourav Kumar [1 ]
机构
[1] Indian Inst Technol Patna, Dept Comp Sci & Engn, Dayalpur Daulatpur, Bihar, India
[2] Univ Tartu, Inst Comp Sci, Tartu, Estonia
关键词
Social media; Disaster events; Tweet summarization; Key-phrase extraction; INFORMATION; ONTOLOGY;
D O I
10.1016/j.csl.2024.101649
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Online social media platforms, such as Twitter, are one of the most valuable sources of information during disaster events. Humanitarian organizations, government agencies, and volunteers rely on a concise compilation of such information for effective disaster management. Existing methods to make such compilations are mostly generic summarization approaches that do not exploit domain knowledge. In this paper, we propose a disaster-specific tweet summarization framework, IKDSumm, , which initially identifies the crucial and important information from each tweet related to a disaster through key-phrases of that tweet. We identify these key- phrases by utilizing the domain knowledge (using existing ontology) of disasters without any human intervention. Further, we utilize these key-phrases to automatically generate a summary of the tweets. Therefore, given tweets related to a disaster, IKDSumm ensures fulfillment of the summarization key objectives, such as information coverage, relevance, and diversity in summary without any human intervention. We evaluate the performance of IKDSumm with 8 state-of-the-art techniques on 12 disaster datasets. The evaluation results show that IKDSumm outperforms existing techniques by approximately 2 - 79% in terms of ROUGE-N F1-score.
引用
收藏
页数:15
相关论文
共 66 条
[1]  
Ahuja O, 2022, PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), P6494
[2]  
Alam F, 2021, Proceedings of the International AAAI Conference on Web and Social Media, V15, P933, DOI [10.1609/icwsm.v15i1.18116, DOI 10.1609/ICWSM.V15I1.18116, 10.1609/icwsm.v15i1.18116]
[3]   Creating corroborated crisis reports from social media data through formal concept analysis [J].
Andrews, Simon ;
Gibson, Helen ;
Domdouzis, Konstantinos ;
Akhgar, Babak .
JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2016, 47 (02) :287-312
[4]   A Graph is Worth a Thousand Words: Telling Event Stories using Timeline Summarization Graphs [J].
Ansah, Jeffery ;
Liu, Lin ;
Kang, Wei ;
Kwashie, Selasi ;
Liu, Jixue ;
Li, Jiuyong .
WEB CONFERENCE 2019: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2019), 2019, :2565-2571
[5]  
Bhattacharya Paheli, 2021, ICAIL '21: Proceedings of the Eighteenth International Conference on Artificial Intelligence and Law, P22, DOI 10.1145/3462757.3466092
[6]  
Bhattacharya P, 2019, LECT NOTES COMPUT SC, V11437, P413, DOI 10.1007/978-3-030-15712-8_27
[7]  
Castillo C, 2016, BIG CRISIS DATA: SOCIAL MEDIA IN DISASTERS AND TIME-CRITICAL SITUATIONS
[8]  
Chakraborty R, 2022, Arxiv, DOI arXiv:2201.07472
[9]   Tweet Summarization of News Articles: An Objective Ordering-Based Perspective [J].
Chakraborty, Roshni ;
Bhavsar, Maitry ;
Dandapat, Sourav Kumar ;
Chandra, Joydeep .
IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2019, 6 (04) :761-777
[10]   A Network Based Stratification Approach for Summarizing Relevant Comment Tweets of News Articles [J].
Chakraborty, Roshni ;
Bhavsar, Maitry ;
Dandapat, Sourav ;
Chandra, Joydeep .
WEB INFORMATION SYSTEMS ENGINEERING, WISE 2017, PT I, 2017, 10569 :33-48