Automatic Summarization of the Arabic Documents using NMF: A Preliminary Study

被引：0

作者：

Mohamed, A. A. ^{[1
]}

机构：

[1] Prince Sattam bin Abdulaziz Univ, Al Kharj, Saudi Arabia

来源：

PROCEEDINGS OF 2016 11TH INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING & SYSTEMS (ICCES) | 2016年

关键词：

Arabic Text Summarization; Text Mining; Information Retrieving; Natural Language Processing (NLP); \on negative Matrix Factorization (NMT); TEXT;

D O I：

暂无

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

The exponential growth of the Internet produces a huge amount of documents online. Finding the desired documents from amongst these huge resources is a difficult task. This problem is known as "Information Overloading". Automatic Text Summarization techniques (ATS) try to solve this problem by extracting the essential sentences that cover most of the main issues in the document. So the user will spend less time and effort to identify the main ideas of the document. Research in this field in the Arabic language is relatively new compared with the available research in English. This paper presents a preliminary study that investigates the effectiveness of using Non negative Matrix Factorization (NMF) algorithm to summarize the Arabic documents. The researcher of the present study has built an Arabic corpus of 150 documents manually and conducted extensive experiments by using different sentences scoringalgorithms and term weighting schemes. The performance of the proposed algorithm has been measured, and the extensive experiments have shown that the NMF algorithm yields promising results.

引用

页码：235 / 240

页数：6

共 39 条

[21] ArA*summarizer: An Arabic text summarization system based on subtopic segmentation and using an A* algorithm for reduction
Bahloul, Belahcene
Aliane, Hassina
Benmohammed, Mohamed
EXPERT SYSTEMS, 2020, 37 (02)
[22] Karc1 summarization: A simple and effective approach for automatic text summarization using Karc1 entropy
Hark, Cengiz
Karci, Ali
INFORMATION PROCESSING & MANAGEMENT, 2020, 57 (03)
[23] Automatic classification of academic documents using text mining techniques
Nunez, Haydemar
Ramos, Esmeralda
2012 XXXVIII CONFERENCIA LATINOAMERICANA EN INFORMATICA (CLEI), 2012,
[24] Automatic Bangla Text Summarization Using Term Frequency and Semantic Similarity Approach
Sarkar, Avik
Hossen, Md Sharif
2018 21ST INTERNATIONAL CONFERENCE OF COMPUTER AND INFORMATION TECHNOLOGY (ICCIT), 2018,
[25] Automatic keyphrase annotation of scientific documents using Wikipedia and genetic algorithms
Joorabchi, Arash
Mahdi, Abdulhussain E.
JOURNAL OF INFORMATION SCIENCE, 2013, 39 (03) : 410 - 426
[26] Cross Language Information Retrieval Model For Discovering WSDL Documents Using Arabic Language Query
Sultan, Torkey I.
Khedr, Ayman E.
Alsheref, Fahad Kamal
INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2013, 4 (08) : 118 - 129
[27] Automatic Translation of Continuous and Fixed Arabic Frozen Expressions Using the NooJ Platform
Kourtin, Asmaa
Mbarki, Samir
FORMALIZING NATURAL LANGUAGES: APPLICATIONS TO NATURAL LANGUAGE PROCESSING AND DIGITAL HUMANITIES, NOOJ 2023, 2024, 1816 : 213 - 224
[28] Efficient Voting-Based Extractive Automatic Text Summarization Using Prominent Feature Set
Meena, Yogesh Kumar
Gopalani, Dinesh
IETE JOURNAL OF RESEARCH, 2016, 62 (05) : 581 - 590
[29] Automatic Keyphrase Extraction from Persian Scientific Documents Using Semantic Relations
Farahani, Bahare Davoodabadi
Fatemi, Seied Omid
Ghorbani, Mohsen
2019 27TH IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE 2019), 2019, : 1972 - 1978
[30] A Preliminary Study on Semi-automatic Construction of Vietnamese Ontology
Bao An Nguyen
Yang, Don-Lin
2011 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2011, : 3403 - 3408

← 1 2 3 4 →