Topic Modeling, Sentiment Analysis and Text Summarization for Analyzing News Headlines and Articles

被引:0
|
作者
Thakur, Omswroop [1 ]
Saritha, Sri Khetwat [1 ]
Jain, Sweta [1 ]
机构
[1] Maulana Azad Natl Inst Technol, Dept CSE, Bhopal 481001, India
关键词
NLP; COVID-19; XLNet; BERTopic; Topic modeling; SOCIAL MEDIA; COVID-19; NETWORKS; SYSTEM; CLASSIFICATION; CORONAVIRUS; TWEETS; NLP;
D O I
10.1007/978-3-031-24352-3_18
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Newspapers and News Websites have become a part and a crucial medium in society. They provide information regarding the events that are happening around and how society is getting influenced by these events. For example, a pandemic like Covid-19 has raised the importance of these mediums. They have been giving detailed news to society on a variety of topics, such as how to detect the strains of the coronavirus, reasons for lockdown along with what are the other restrictions to be followed during the pandemic. They also provided information about the government policies which were built to be taken care of in case of pandemics and so on and they kept updated with the details about the development of the vaccines. Due to this lot of information on Covid-19 is generated. Examining the different topics/themes/issues and the sentiments expressed by different countries will aid in the understanding of the covid-19. This paper discusses the various models which were built to identify the topics, sentiments, and summarization of news headlines and articles regarding Covid-19. The proposed topic model has achieved a Silhouette score of 0.6407036, 0.6645274, 0.6262914, and 0.6234863 for 4 countries like South Korea, Japan, the UK, India on the news articles dataset, and it was found that the United Kingdom was the worst-hit, and it had the largest percentage of negative sentiments. The proposed XlNet sentiment classification model obtained a validation accuracy of 93.75%.
引用
收藏
页码:220 / 239
页数:20
相关论文
共 50 条
  • [1] Topic-Aware Sentiment Analysis of News Articles
    Akhmetov, Iskander
    Gelbukh, Alexander
    Mussabayev, Rustam
    COMPUTACION Y SISTEMAS, 2022, 26 (01): : 423 - 439
  • [2] Automatic Text Summarization of News Articles
    Sethi, Prakhar
    Sonawane, Sameer
    Khanwalker, Saumitra
    Keskar, R. B.
    2017 INTERNATIONAL CONFERENCE ON BIG DATA, IOT AND DATA SCIENCE (BID), 2017, : 23 - 29
  • [3] Abstractive Text Summarization with Application to Bulgarian News Articles
    Taushanov, Nikola
    Koychev, Ivan
    Nakov, Preslav
    PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE COMPUTATIONAL LINGUISTICS IN BULGARIA (CLIB '18), 2018, : 15 - 22
  • [4] Automatic Text Summarization of News Articles in Serbian Language
    Kosmajac, Dijana
    Keselj, Vlado
    2019 18TH INTERNATIONAL SYMPOSIUM INFOTEH-JAHORINA (INFOTEH), 2019,
  • [5] HNTSumm: Hybrid text summarization of transliterated news articles
    Muniraj P.
    Sabarmathi K.R.
    Leelavathi R.
    Balaji B S.
    International Journal of Intelligent Networks, 2023, 4 : 53 - 61
  • [6] Comparing News Articles and Tweets About COVID-19 in Brazil: Sentiment Analysis and Topic Modeling Approach
    de Melo, Tiago
    Figueiredo, Carlos M. S.
    JMIR PUBLIC HEALTH AND SURVEILLANCE, 2021, 7 (02):
  • [7] A Sesotho news headlines dataset for sentiment analysis
    Mokhosi, Refuoe
    Shivachi, Casper-Shikali
    Sethobane, Matello
    DATA IN BRIEF, 2024, 54
  • [8] Topic Modeling Based Text Summarization Approach
    Yu, Shusi
    Wang, Wei
    2015 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND ENGINEERING APPLICATIONS (CSEA 2015), 2015, : 203 - 207
  • [9] Text Mining and Sentiment Analysis of Newspaper Headlines
    Hossain, Arafat
    Karimuzzaman, Md
    Hossain, Md Moyazzem
    Rahman, Azizur
    INFORMATION, 2021, 12 (10)
  • [10] Analyzing public discourse on photovoltaic (PV) adoption in Indonesia: A topic-based sentiment analysis of news articles and social media
    Mulyani, Yun Prihantina
    Saifurrahman, Anas
    Arini, Hilya Mudrika
    Rizqiawan, Arwindra
    Hartono, Budi
    Utomo, Dhanan Sarwo
    Spanellis, Agnessa
    Beltran, Macarena
    Nahor, Kevin Marojahan Banjar
    Paramita, Dhyana
    Harefa, Wira Dranata
    JOURNAL OF CLEANER PRODUCTION, 2024, 434