Related Blogs' Summarization With Natural Language Processing

被引:0
|
作者
Baliyan, Niyati [1 ]
Sharma, Aarti [1 ]
机构
[1] Indira Gandhi Delhi Tech Univ Women, Dept Informat Technol, Church Rd, New Delhi 110006, India
关键词
topic modelling; tokenization; stop words; stemming; vectorization; summarization;
D O I
10.1093/comjnl/bxaa110
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
There is plethora of information present on the web, on a given topic, in different forms i.e. blogs, articles, websites, etc. However, not all of the information is useful. Perusing and going through all of the information to get the understanding of the topic is a very tiresome and time-consuming task. Most of the time we end up investing in reading content that we later understand was not of importance to us. Due to the lack of capacity of the human to grasp vast quantities of information, relevant and crisp summaries are always desirable. Therefore, in this paper, we focus on generating a new blog entry containing the summary of multiple blogs on the same topic. Different approaches of clustering, modelling, content generation and summarization are applied to reach the intended goal. This system also eliminates the repetitive content giving savings on time and quantity, thereby making learning more comfortable and effective. Overall, a significant reduction in the number of words in the new blog generated by the system is observed by using the proposed novel methodology.
引用
收藏
页码:347 / 357
页数:11
相关论文
共 50 条
  • [31] An Automatic Process to Convert Documents into Abstracts by Using Natural Language Processing Techniques
    Jayaraju, Ch.
    Basha, Zareena Noor
    Madhavarao, E.
    Kalyani, M.
    ICT AND CRITICAL INFRASTRUCTURE: PROCEEDINGS OF THE 48TH ANNUAL CONVENTION OF COMPUTER SOCIETY OF INDIA - VOL I, 2014, 248 : 31 - 39
  • [32] Explaining tourist revisit intention using natural language processing and classification techniques
    Andreas Gregoriades
    Maria Pampaka
    Herodotos Herodotou
    Evripides Christodoulou
    Journal of Big Data, 10
  • [33] Explaining tourist revisit intention using natural language processing and classification techniques
    Gregoriades, Andreas
    Pampaka, Maria
    Herodotou, Herodotos
    Christodoulou, Evripides
    JOURNAL OF BIG DATA, 2023, 10 (01)
  • [34] High Priority Tweet Detection And Summarization In Natural Disasters
    Kebabci, Kadir
    Karsligil, M. Elif
    2015 23RD SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2015, : 1280 - 1283
  • [35] Stress detection using natural language processing and machine learning over social interactions
    Tanya Nijhawan
    Girija Attigeri
    T. Ananthakrishna
    Journal of Big Data, 9
  • [36] NATURAL LANGUAGE PROCESSING FOR STORYTELLING AND ROLE PLAYING: A TRAINING SYSTEM BASED ON THE PROPP MODEL
    Despontin, Marco
    Sbattella, Licia
    Tedesco, Roberto
    3RD INTERNATIONAL CONFERENCE OF EDUCATION, RESEARCH AND INNOVATION (ICERI2010), 2010, : 5036 - 5045
  • [37] Stress detection using natural language processing and machine learning over social interactions
    Nijhawan, Tanya
    Attigeri, Girija
    Ananthakrishna, T.
    JOURNAL OF BIG DATA, 2022, 9 (01)
  • [38] CliniViewer: A tool for viewing electronic medical records based on natural language processing and XML
    Liu, Hongfang
    Friedman, Carol
    Studies in Health Technology and Informatics, 2004, 107 : 639 - 643
  • [39] Research of heuristic approaches for determining the tonality of text messages in natural language processing problems
    Polyakov, Evgeniy
    Polyakov, Sergey
    Abramov, Pavel
    2019 XVI INTERNATIONAL SYMPOSIUM PROBLEMS OF REDUNDANCY IN INFORMATION AND CONTROL SYSTEMS (REDUNDANCY), 2019, : 159 - 164
  • [40] CliniViewer: A tool for viewing electronic medical records based on natural language processing and XML
    Liu, HF
    Friedman, C
    MEDINFO 2004: PROCEEDINGS OF THE 11TH WORLD CONGRESS ON MEDICAL INFORMATICS, PT 1 AND 2, 2004, 107 : 639 - 643