Multimodal Hinglish Tweet Dataset for Deep Pragmatic Analysis

被引:1
作者
Pratibha [1 ]
Kaur, Amandeep [1 ]
Khurana, Meenu [2 ]
Damasevicius, Robertas [3 ]
机构
[1] Chitkara Univ, Inst Engn & Technol, Rajpura 140601, Punjab, India
[2] Chitkara Univ, Sch Engn & Technol, Baddi 173205, Himachal Prades, India
[3] Vytautas Magnus Univ, Dept Appl Informat, LT-53361 Kaunas, Lithuania
关键词
hinglish; pragmatic analysis; sentiment analysis; tweet dataset; SENTIMENT ANALYSIS; MODEL;
D O I
10.3390/data9020038
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Wars, conflicts, and peace efforts have become inherent characteristics of regions, and understanding the prevailing sentiments related to these issues is crucial for finding long-lasting solutions. Twitter/'X', with its vast user base and real-time nature, provides a valuable source to assess the raw emotions and opinions of people regarding war, conflict, and peace. This paper focuses on collecting and curating hinglish tweets specifically related to wars, conflicts, and associated taxonomy. The creation of said dataset addresses the existing gap in contemporary literature, which lacks comprehensive datasets capturing the emotions and sentiments expressed by individuals regarding wars, conflicts, and peace efforts. This dataset holds significant value and application in deep pragmatic analysis as it enables future researchers to identify the flow of sentiments, analyze the information architecture surrounding war, conflict, and peace effects, and delve into the associated psychology in this context. To ensure the dataset's quality and relevance, a meticulous selection process was employed, resulting in the inclusion of explanable 500 carefully chosen search filters. The dataset currently has 10,040 tweets that have been validated with the help of human expert to make sure they are correct and accurate.
引用
收藏
页数:19
相关论文
共 49 条
[1]  
Agarwal N S., 2022, Exploring Public Opinion Dynamics on the Verge of World War III using Russia-Ukraine war-Tweets Dataset"
[2]   Arabic Offensive and Hate Speech Detection Using a Cross-Corpora Multi-Task Learning Model [J].
Aldjanabi, Wassen ;
Dahou, Abdelghani ;
Al-qaness, Mohammed A. A. ;
Abd Elaziz, Mohamed ;
Helmi, Ahmed Mohamed ;
Damasevicius, Robertas .
INFORMATICS-BASEL, 2021, 8 (04)
[3]   Machine learning techniques for emotion detection and sentiment analysis: current state, challenges, and future directions [J].
Alslaity, Alaa ;
Orji, Rita .
BEHAVIOUR & INFORMATION TECHNOLOGY, 2024, 43 (01) :139-164
[4]  
Askasnr S., 2012, End of US-Afghan War Tweet Data
[5]   Hindu Nationalism Online: Twitter as Discourse and Interface [J].
Bhatia, Kiran Vinod .
RELIGIONS, 2022, 13 (08)
[6]   Hope speech detection in YouTube comments [J].
Chakravarthi, Bharathi Raja .
SOCIAL NETWORK ANALYSIS AND MINING, 2022, 12 (01)
[7]   State of the art: a review of sentiment analysis based on sequential transfer learning [J].
Chan, Jireh Yi-Le ;
Bea, Khean Thye ;
Leow, Steven Mun Hong ;
Phoong, Seuk Wai ;
Cheng, Wai Khuen .
ARTIFICIAL INTELLIGENCE REVIEW, 2023, 56 (01) :749-780
[8]  
Chen E, 2023, Arxiv, DOI [arXiv:2203.07488, 10.48550/ARXIV.2203.07488, DOI 10.48550/ARXIV.2203.07488]
[9]   Termite: Visualization Techniques for Assessing Textual Topic Models [J].
Chuang, Jason ;
Manning, Christopher D. ;
Heer, Jeffrey .
PROCEEDINGS OF THE INTERNATIONAL WORKING CONFERENCE ON ADVANCED VISUAL INTERFACES, 2012, :74-77
[10]   Survey on sentiment analysis: evolution of research methods and topics [J].
Cui, Jingfeng ;
Wang, Zhaoxia ;
Ho, Seng-Beng ;
Cambria, Erik .
ARTIFICIAL INTELLIGENCE REVIEW, 2023, 56 (08) :8469-8510