Using Unsupervised Deep Learning for Automatic Summarization of Arabic Documents

被引:17
作者
Alami, Nabil [1 ]
En-nahnahi, Noureddine [1 ]
Ouatik, Said Alaoui [1 ]
Meknassi, Mohammed [1 ]
机构
[1] Sidi Mohamed Ben Abdellah Univ, Fac Sci Dhar EL Mahraz, LIM, Fes, Morocco
关键词
Arabic text summarization; Deep learning; Unsupervised feature learning; Variational auto-encoder; Graph-based summarization; Query-based summarization; RECOGNITION; CLASSIFICATION; ALGORITHM;
D O I
10.1007/s13369-018-3198-y
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Traditional Arabic text summarization (ATS) systems are based on bag-of-words representation, which involve a sparse and high-dimensional input data. Thus, dimensionality reduction is greatly needed to increase the power of features discrimination. In this paper, we present a new method for ATS using variational auto-encoder (VAE) model to learn a feature space from a high-dimensional input data. We explore several input representations such as term frequency (tf), tf-idf and both local and global vocabularies. All sentences are ranked according to the latent representation produced by the VAE. We investigate the impact of using VAE with two summarization approaches, graph-based and query-based approaches. Experiments on two benchmark datasets specifically designed for ATS show that the VAE using tf-idf representation of global vocabularies clearly provides a more discriminative feature space and improves the recall of other models. Experiment results confirm that the proposed method leads to better performance than most of the state-of-the-art extractive summarization approaches for both graph-based and query-based summarization approaches.
引用
收藏
页码:7803 / 7815
页数:13
相关论文
共 50 条
  • [41] Unsupervised video summarization using cluster analysis for automatic vehicles counting and recognizing
    Rabbouch, Hana
    Saadaoui, Foued
    Mraihi, Rafaa
    NEUROCOMPUTING, 2017, 260 : 157 - 173
  • [42] Effective Deep Learning Models for Automatic Diacritization of Arabic Text
    Madhfar, Mokthar Ali Hasan
    Qamar, Ali Mustafa
    IEEE ACCESS, 2021, 9 : 273 - 288
  • [43] Improving Automatic Source Code Summarization via Deep Reinforcement Learning
    Wan, Yao
    Zhao, Zhou
    Yang, Min
    Xu, Guandong
    Ying, Haochao
    Wu, Jian
    Yu, Philip S.
    PROCEEDINGS OF THE 2018 33RD IEEE/ACM INTERNATIONAL CONFERENCE ON AUTOMTED SOFTWARE ENGINEERING (ASE' 18), 2018, : 397 - 407
  • [44] Cyberbullying Detection Model for Arabic Text Using Deep Learning
    Albayari, Reem
    Abdallah, Sherief
    Shaalan, Khaled
    JOURNAL OF INFORMATION & KNOWLEDGE MANAGEMENT, 2024,
  • [45] Automatic segmentation of deep endometriosis in the rectosigmoid using deep learning
    Figueredo, Weslley Kelson Ribeiro
    Silva, Aristofanes Correa
    de Paiva, Anselmo Cardoso
    Diniz, Joao Otavio Bandeira
    Brandao, Alice
    Oliveira, Marco Aurelio Pinho
    IMAGE AND VISION COMPUTING, 2024, 151
  • [46] Automatic Gender Authentication from Arabic Speech Using Hybrid Learning
    Khan, Amjad Rehman
    JOURNAL OF ADVANCES IN INFORMATION TECHNOLOGY, 2024, 15 (04) : 532 - 543
  • [47] Automatic dottization of Arabic text (Rasms) using deep recurrent neural networks
    Alhathloul, Zainab
    Ahmad, Irfan
    PATTERN RECOGNITION LETTERS, 2022, 162 : 47 - 55
  • [48] Automatic tunnel lining crack evaluation and measurement using deep learning
    Dang, L. Minh
    Wang, Hanxiang
    Li, Yanfen
    Park, Yesul
    Oh, Chanmi
    Nguyen, Tan N.
    Moon, Hyeonjoon
    TUNNELLING AND UNDERGROUND SPACE TECHNOLOGY, 2022, 124
  • [49] Unsupervised obstacle detection in driving environments using deep-learning-based stereovision
    Dairi, Abdelkader
    Harrou, Fouzi
    Senouci, Mohamed
    Sun, Ying
    ROBOTICS AND AUTONOMOUS SYSTEMS, 2018, 100 : 287 - 301
  • [50] Evaluation of Kidney Histological Images Using Unsupervised Deep Learning
    Sato, Noriaki
    Uchino, Eiichiro
    Kojima, Ryosuke
    Sakuragi, Minoru
    Hiragi, Shusuke
    Minamiguchi, Sachiko
    Haga, Hironori
    Yokoi, Hideki
    Yanagita, Motoko
    Okuno, Yasushi
    KIDNEY INTERNATIONAL REPORTS, 2021, 6 (09): : 2445 - 2454