Automatic Summarization of the Arabic Documents using NMF: A Preliminary Study

被引:0
|
作者
Mohamed, A. A. [1 ]
机构
[1] Prince Sattam bin Abdulaziz Univ, Al Kharj, Saudi Arabia
来源
PROCEEDINGS OF 2016 11TH INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING & SYSTEMS (ICCES) | 2016年
关键词
Arabic Text Summarization; Text Mining; Information Retrieving; Natural Language Processing (NLP); \on negative Matrix Factorization (NMT); TEXT;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The exponential growth of the Internet produces a huge amount of documents online. Finding the desired documents from amongst these huge resources is a difficult task. This problem is known as "Information Overloading". Automatic Text Summarization techniques (ATS) try to solve this problem by extracting the essential sentences that cover most of the main issues in the document. So the user will spend less time and effort to identify the main ideas of the document. Research in this field in the Arabic language is relatively new compared with the available research in English. This paper presents a preliminary study that investigates the effectiveness of using Non negative Matrix Factorization (NMF) algorithm to summarize the Arabic documents. The researcher of the present study has built an Arabic corpus of 150 documents manually and conducted extensive experiments by using different sentences scoringalgorithms and term weighting schemes. The performance of the proposed algorithm has been measured, and the extensive experiments have shown that the NMF algorithm yields promising results.
引用
收藏
页码:235 / 240
页数:6
相关论文
共 39 条
  • [21] ArA*summarizer: An Arabic text summarization system based on subtopic segmentation and using an A* algorithm for reduction
    Bahloul, Belahcene
    Aliane, Hassina
    Benmohammed, Mohamed
    EXPERT SYSTEMS, 2020, 37 (02)
  • [22] Karc1 summarization: A simple and effective approach for automatic text summarization using Karc1 entropy
    Hark, Cengiz
    Karci, Ali
    INFORMATION PROCESSING & MANAGEMENT, 2020, 57 (03)
  • [23] Automatic classification of academic documents using text mining techniques
    Nunez, Haydemar
    Ramos, Esmeralda
    2012 XXXVIII CONFERENCIA LATINOAMERICANA EN INFORMATICA (CLEI), 2012,
  • [24] Automatic Bangla Text Summarization Using Term Frequency and Semantic Similarity Approach
    Sarkar, Avik
    Hossen, Md Sharif
    2018 21ST INTERNATIONAL CONFERENCE OF COMPUTER AND INFORMATION TECHNOLOGY (ICCIT), 2018,
  • [25] Automatic keyphrase annotation of scientific documents using Wikipedia and genetic algorithms
    Joorabchi, Arash
    Mahdi, Abdulhussain E.
    JOURNAL OF INFORMATION SCIENCE, 2013, 39 (03) : 410 - 426
  • [26] Cross Language Information Retrieval Model For Discovering WSDL Documents Using Arabic Language Query
    Sultan, Torkey I.
    Khedr, Ayman E.
    Alsheref, Fahad Kamal
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2013, 4 (08) : 118 - 129
  • [27] Automatic Translation of Continuous and Fixed Arabic Frozen Expressions Using the NooJ Platform
    Kourtin, Asmaa
    Mbarki, Samir
    FORMALIZING NATURAL LANGUAGES: APPLICATIONS TO NATURAL LANGUAGE PROCESSING AND DIGITAL HUMANITIES, NOOJ 2023, 2024, 1816 : 213 - 224
  • [28] Efficient Voting-Based Extractive Automatic Text Summarization Using Prominent Feature Set
    Meena, Yogesh Kumar
    Gopalani, Dinesh
    IETE JOURNAL OF RESEARCH, 2016, 62 (05) : 581 - 590
  • [29] Automatic Keyphrase Extraction from Persian Scientific Documents Using Semantic Relations
    Farahani, Bahare Davoodabadi
    Fatemi, Seied Omid
    Ghorbani, Mohsen
    2019 27TH IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE 2019), 2019, : 1972 - 1978
  • [30] A Preliminary Study on Semi-automatic Construction of Vietnamese Ontology
    Bao An Nguyen
    Yang, Don-Lin
    2011 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2011, : 3403 - 3408