Topic Modeling, Sentiment Analysis and Text Summarization for Analyzing News Headlines and Articles

被引:0
|
作者
Thakur, Omswroop [1 ]
Saritha, Sri Khetwat [1 ]
Jain, Sweta [1 ]
机构
[1] Maulana Azad Natl Inst Technol, Dept CSE, Bhopal 481001, India
关键词
NLP; COVID-19; XLNet; BERTopic; Topic modeling; SOCIAL MEDIA; COVID-19; NETWORKS; SYSTEM; CLASSIFICATION; CORONAVIRUS; TWEETS; NLP;
D O I
10.1007/978-3-031-24352-3_18
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Newspapers and News Websites have become a part and a crucial medium in society. They provide information regarding the events that are happening around and how society is getting influenced by these events. For example, a pandemic like Covid-19 has raised the importance of these mediums. They have been giving detailed news to society on a variety of topics, such as how to detect the strains of the coronavirus, reasons for lockdown along with what are the other restrictions to be followed during the pandemic. They also provided information about the government policies which were built to be taken care of in case of pandemics and so on and they kept updated with the details about the development of the vaccines. Due to this lot of information on Covid-19 is generated. Examining the different topics/themes/issues and the sentiments expressed by different countries will aid in the understanding of the covid-19. This paper discusses the various models which were built to identify the topics, sentiments, and summarization of news headlines and articles regarding Covid-19. The proposed topic model has achieved a Silhouette score of 0.6407036, 0.6645274, 0.6262914, and 0.6234863 for 4 countries like South Korea, Japan, the UK, India on the news articles dataset, and it was found that the United Kingdom was the worst-hit, and it had the largest percentage of negative sentiments. The proposed XlNet sentiment classification model obtained a validation accuracy of 93.75%.
引用
收藏
页码:220 / 239
页数:20
相关论文
共 50 条
  • [11] A Novel Approach to Categorize News Articles From Headlines and Short Text
    Abyaad, Rafid
    Kabir, Md Rayhan
    Hasan, Sakib
    2020 IEEE REGION 10 SYMPOSIUM (TENSYMP) - TECHNOLOGY FOR IMPACTFUL SUSTAINABLE DEVELOPMENT, 2020, : 162 - 165
  • [12] Language Model-Driven Topic Clustering and Summarization for News Articles
    Yang, Peng
    Li, Wenhan
    Zhao, Guangzhen
    IEEE ACCESS, 2019, 7 : 185506 - 185519
  • [13] Ontology-based text summarization for business news articles
    Wu, CW
    Liu, CL
    COMPUTERS AND THEIR APPLICATIONS, 2003, : 389 - 392
  • [14] Sentiment-topic modeling in text mining
    Lin, Chenghua
    Ibeke, Ebuka
    Wyner, Adam
    Guerin, Frank
    WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2015, 5 (05) : 246 - 254
  • [15] Two-level Text Summarization from Online News Sources with Sentiment Analysis
    Mirani, Tarun B.
    Sasi, Sreela
    2017 INTERNATIONAL CONFERENCE ON NETWORKS & ADVANCES IN COMPUTATIONAL TECHNOLOGIES (NETACT), 2017, : 19 - 24
  • [16] Topic Modeling and Sentiment Analysis of Arabic News Headlines for a Societal Well-Being Scoring and Monitoring System: Moroccan Use Case
    Jannani, Ayoub
    Bouhsissin, Soukaina
    Sael, Nawal
    Benabbou, Faouzia
    IEEE ACCESS, 2025, 13 : 26345 - 26364
  • [17] A dataset for Sentiment analysis of Entities in News headlines (SEN)
    Baraniak, Katarzyna
    Sydow, Marcin
    KNOWLEDGE-BASED AND INTELLIGENT INFORMATION & ENGINEERING SYSTEMS (KSE 2021), 2021, 192 : 3627 - 3636
  • [18] Statistical Topic Modeling for Urdu Text Articles
    Rehman, Anwar Ur
    Rehman, Zobia
    Akram, Junaid
    Ali, Waqar
    Shah, Munam Ali
    Salman, Muhammad
    2018 24TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATION AND COMPUTING (ICAC' 18), 2018, : 62 - 67
  • [19] Hierarchical Topic Modeling for Urdu Text Articles
    Rehman, Anwar Ur
    Khan, Ali Haider
    Aftab, Mustansar
    Rehman, Zobia
    Shah, Munam Ali
    2019 25TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATION AND COMPUTING (ICAC), 2019, : 464 - 469
  • [20] Sentiment Analysis on News Articles for Stocks
    Kalyanaraman, Vaanchitha
    Kazi, Sarah
    Tondulkar, Rohan
    Oswal, Sangeeta
    ASIA MODELLING SYMPOSIUM 2014 (AMS 2014), 2014, : 10 - 15