Automatic Text Summarization in Natural Language Processing

被引:0
作者
Desai, M. R. [1 ]
Gachhinakatti, Bhagyashree [1 ]
Balaganur, Pooja [1 ]
Rajeshwari, Y. [1 ]
Rathod, Laxmi [1 ]
机构
[1] BLDEAs VP Dr PG Halakatti Coll Engn & Technol, Dept Comp Sci & Engn, Vijayapur, India
来源
2021 IEEE INTERNATIONAL CONFERENCE ON MOBILE NETWORKS AND WIRELESS COMMUNICATIONS (ICMNWC) | 2021年
关键词
Natural Language Processing; Text Summarization; Sentence Ranking; Word Frequency; ROUGE;
D O I
10.1109/ICMNWC52512.2021.9688499
中图分类号
TN [电子技术、通信技术];
学科分类号
0809 ;
摘要
Text summarization is one of the challenging and engrossing problems in the field of Natural Language Processing. The goal of Text Summarization is to read, understand and extract meaningful information from the unstructured data. The idea behind summarization is to get concise and precise summary from the original document. We used Sentence Ranking method based on Word Frequency to determine the important sentences in original text. The Summarizer uses three features to determine important Sentence in a document. The Term Frequency, similarity with the Title and Sentence score are the features considered. Values are assigned to each word and a sentence in a document and create a score for each word and a sentence. Normalize the scores to avoid raw scores that skew the results. The Scores of top 'N' sentences are taken for summarization. Text cleaning, Sentence tokenization, Word tokenization, Normalized Word frequency table generation, and Summarization are steps used in the Sentence Ranking Method. The method is implemented on WikiHow dataset. The results of proposed model are evaluated with ROUGE toolkit which has three evaluation metrics- Precision, Recall and F-Score.
引用
收藏
页数:5
相关论文
共 18 条
  • [1] [Anonymous], 2020, 2 INT C COMP NETW CO
  • [2] Ansary Md Siam, 2021, Proceedings of 2021 International Conference on Information and Communication Technology for Sustainable Development (ICICT4SD), P11, DOI 10.1109/ICICT4SD50815.2021.9396855
  • [3] Natural Language Processing (NLP) based Text Summarization - A Survey
    Awasthi, Ishitva
    Gupta, Kuntal
    Bhogal, Prabjot Singh
    Anand, Sahejpreet Singh
    Soni, Piyush Kumar
    [J]. PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON INVENTIVE COMPUTATION TECHNOLOGIES (ICICT 2021), 2021, : 1310 - 1317
  • [4] Deep Extractive Text Summarization
    Bhargava, Rupal
    Sharma, Yashvardhan
    [J]. INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND DATA SCIENCE, 2020, 167 : 138 - 146
  • [5] Chen PY, 2018, AAAI CONF ARTIF INTE, P10
  • [6] Issam Kalliath Abdul Rasheed, 2021, ARXIV PREPRINT ARXIV
  • [7] Krishnan D, 2019, PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND CONTROL SYSTEMS (ICCS), P521, DOI [10.1109/ICCS45141.2019.9065651, 10.1109/iccs45141.2019.9065651]
  • [8] Madhuri J., 2019, Proceedings of the 2019 international conference on data science and communication (IconDSC), P1, DOI DOI 10.1109/ICONDSC.2019.8817040
  • [9] Miller Derek, 2019, ARXIVABS190604165
  • [10] Payak Avinash, 2020, 2020 International Conference on Electronics and Sustainable Communication Systems (ICESC). Proceedings, P98, DOI 10.1109/ICESC48915.2020.9155852