A survey of text summarization and Headline Generation methods in Arabic

被引:0
作者
Shaibani, Arwa [1 ]
Elnagar, Ashraf [1 ]
机构
[1] Univ Sharjah, Dept Comp Sci, Sharjah, U Arab Emirates
来源
PROCEEDINGS OF THE 2024 9TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING TECHNOLOGIES, ICMLT 2024 | 2024年
关键词
DEEP LEARNING; HEADLINES GENERATION; EVALUATION METRICS;
D O I
10.1145/3674029.3674078
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
HEADLINE GENERATION IS CONSIDERED ONE OF THE CHALLENGING TASKS IN TEXT SUMMARIZATION. THE INCREASING NUMBER OF SUCCESSFUL MODELS IN THE TEXT SUMMARIZATION FIELD HAS SHOWN THE ABILITY TO AUTOMATICALLY GENERATE INFORMATIVE HEADLINES FOR ARTICLES INSTEAD OF HAVING CATCHY, MISLEADING HEADLINES WRITTEN BY JOURNALISTS. HOWEVER, MOST OF THE CURRENT WORK IN HEADLINE GENERATION FOCUSES ON GENERATING ENGLISH HEADLINES ONLY, WITH VERY FEW PAPERS PROPOSING ARABIC HEADLINE GENERATION AND USING SMALL-SIZED DATASETS COMPARED TO ENGLISH DATASETS. THIS PAPER DESCRIBES A LITERATURE SURVEY OF TEXT SUMMARIZATION AND HEADLINE GENERATION METHODS IN ARABIC. SUMMARIZATION SYSTEMS, IN GENERAL, CAN BE MONOLINGUAL OR MULTILINGUAL SYSTEMS WHERE SUMMARIZATION METHODS CAN BE CLASSIFIED ACCORDING TO SEVERAL CATEGORIES, SUCH AS LINGUISTIC OR STATISTICAL AND EXTRACTIVE OR ABSTRACTIVE. THEREFORE, WE WILL HIGHLIGHT THE AVAILABLE RESEARCH AND APPROACHES USED IN ARABIC. WE WILL DISCUSS THE APPROACH ARCHITECTURE, USED DATASETS, EVALUATION METRICS, AND THE RESULTS FOR EACH APPROACH. FINALLY, IN CONCLUSION, WE WILL DISCUSS THE POSSIBLE FUTURE WORK FOR HEADLINE GENERATION IN ARABIC.
引用
收藏
页码:317 / 323
页数:7
相关论文
共 31 条
  • [1] Al-Debsi Ridhwan, 2019, Nadia: News articles dataset in arabic for multi-label text categorization
  • [2] Al-Radaideh Qasem A., 2018, A hybrid approach for arabic text summarization using domain knowledge and genetic algorithms, V10, P4
  • [3] Alami Nabil, 2018, Using unsupervised deep learning for automatic summarization of Arabic documents, V43, P12
  • [4] Alhagri Mazen A., 2015, Saudi Newspapers Arabic Corpus (SaudiNewsNet)
  • [5] Alotaiby Fahad, 2012, New approaches to automatic headline generation for Arabic documents, V3, P1
  • [6] Alshanqiti Abdullah, 2021, Leveraging DistilBERT for summarizing Arabic text: an extractive dual-stage approach, V9
  • [7] Azmi Aqil M., 2018, An abstractive Arabic text summarizer with user controlled granularity, V54, P6
  • [8] Chouigui Amina, 2017, ANT corpus: an Arabic news text collection for textual classification
  • [9] Einea Omar, 2019, SANAD: Single-Label Arabic News Articles Dataset for Automatic Text Categorization
  • [10] El-Kassas Wafaa S., 2021, Automatic text summarization: A comprehensive survey, V165