Deep Learning-Based Short Text Summarization: An Integrated BERT and Transformer Encoder-Decoder Approach

被引:0
作者
Ghanem, Fahd A. [1 ,2 ]
Padma, M. C. [1 ]
Abdulwahab, Hudhaifa M. [3 ]
Alkhatib, Ramez [4 ]
机构
[1] Univ Mysore, PES Coll Engn, Dept Comp Sci & Engn, Mandya 571401, India
[2] Hodeidah Univ, Coll Educ Zabid, Dept Comp Sci, POB 3114, Hodeidah, Yemen
[3] VTU, Ramaiah Inst Technol, Dept Comp Applicat, Bangalore 560054, India
[4] BMB Nord, Res Ctr Borstel, Pk Allee 35, D-23845 Borstel, Germany
关键词
attention mechanism; BERT; deep learning; short text summarization; transformer-based encoder-decoder; Twitter summarization; TWITTER;
D O I
10.3390/computation13040096
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
The field of text summarization has evolved from basic extractive methods that identify key sentences to sophisticated abstractive techniques that generate contextually meaningful summaries. In today's digital landscape, where an immense volume of textual data is produced every day, the need for concise and coherent summaries is more crucial than ever. However, summarizing short texts, particularly from platforms like Twitter, presents unique challenges due to character constraints, informal language, and noise from elements such as hashtags, mentions, and URLs. To overcome these challenges, this paper introduces a deep learning framework for automated short text summarization on Twitter. The proposed approach combines bidirectional encoder representations from transformers (BERT) with a transformer-based encoder-decoder architecture (TEDA), incorporating an attention mechanism to improve contextual understanding. Additionally, long short-term memory (LSTM) networks are integrated within BERT to effectively capture long-range dependencies in tweets and their summaries. This hybrid model ensures that generated summaries remain informative, concise, and contextually relevant while minimizing redundancy. The performance of the proposed framework was assessed using three benchmark Twitter datasets-Hagupit, SHShoot, and Hyderabad Blast-with ROUGE scores serving as the evaluation metric. Experimental results demonstrate that the model surpasses existing approaches in accurately capturing key information from tweets. These findings underscore the framework's effectiveness in automated short text summarization, offering a robust solution for efficiently processing and summarizing large-scale social media content.
引用
收藏
页数:21
相关论文
共 53 条
[51]   State-of-the-art approach to extractive text summarization: a comprehensive review [J].
Yadav, Avaneesh Kumar ;
Ranvijay ;
Yadav, Rama Shankar ;
Maurya, Ashish Kumar .
MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (19) :29135-29197
[52]  
Zhang HP, 2023, FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EMNLP 2023), P10644
[53]  
Zhao Y, 2025, Arxiv, DOI arXiv:2502.19723