Audio-Based Hate Speech Classification from Online Short-Form Videos

被引：4

作者：

Ibanez, Michael ^{[1
]}

Sapinit, Ranz ^{[1
]}

Reyes, Lloyd Antonie ^{[1
]}

Hussien, Mohammed ^{[1
]}

Imperial, Joseph Marvin ^{[1
]}

Rodriguez, Ramon ^{[1
]}

机构：

[1] Natl Univ, Manila, Philippines

来源：

2021 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP) | 2021年

关键词：

hate speech; tiktok; audio classification; machine learning; speech processing;

D O I：

10.1109/IALP54817.2021.9675250

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this study, we pioneer the development of an audio-based hate speech classifier from online, short-form TikTok videos using traditional machine learning algorithms such as Logistic Regression, Random Forest, and Support Vector Machines. We scraped over 4746 videos using the TikTok API tool and extracted audio-based features such as MFCCs, Spectral Centroid, Rolloff, Bandwidth, Zero-Crossing Rate, and Chroma values as primary feature sets. Results show that using the extracted predictors for hate speech detection can obtain up to 78.5% accuracy on an optimized Random Forest model, crossing the 50% benchmark for models in this task. In addition, comparing the Information Gain scores and globally learned model weights identified that Spectral Rolloff and MFCCs are top predictors in discriminating hate speech for the Filipino language.

引用

页码：72 / 77

页数：6

共 47 条

[31] Hate speech in the Internet context: Unpacking the roles of Internet penetration, online legal regulation, and online opinion polarization from a transnational perspective
Liu, Zikun
Luo, Chen
Lu, Jia
INFORMATION DEVELOPMENT, 2024, 40 (04) : 533 - 549
[32] Un-Compromised Credibility: Social Media Based Multi-Class Hate Speech Classification for Text
Qureshi, Khubaib Ahmed
Sabih, Muhammad
IEEE ACCESS, 2021, 9 : 109465 - 109477
[33] Gender-Based Hate Speech: Contributions to the Global Policy Debate From Latin America
Godinez, Paulina
Rico, Stephanie
Sarikakis, Katharine
INTERNATIONAL JOURNAL OF COMMUNICATION, 2022, 16 : 4758 - 4778
[34] Federated-Learning Topic Modeling Based Text Classification Regarding Hate Speech During COVID-19 Pandemic
Kamran, Muhammad
Saeed, Ammar
Almaghthawi, Ahmed
INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (11) : 560 - 567
[35] The Role of Empathy in Reducing Hate Speech Proliferation. Two Contact-Based Interventions in Online and Off-line Settings
Soral, Wiktor
Malinowska, Katarzyna
Bilewicz, Michal
PEACE AND CONFLICT-JOURNAL OF PEACE PSYCHOLOGY, 2022, 28 (03) : 361 - 371
[36] An attention Long Short-Term Memory based system for automatic classification of speech intelligibility
Fernandez-Diaz, Miguel
Gallardo-Antolin, Ascension
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2020, 96
[37] AI-based removal of hate speech from digital social networks: chances and risks for freedom of expression
Frank Dietrich
AI and Ethics, 2025, 5 (3): : 2943 - 2953
[38] Targets of Online Hate Speech in Context. A Comparative Digital Social Science Analysis of Comments on Public Facebook Pages from Romania and Hungary
Meza, Radu
Vincze, Hanna Orsolya
Mogos, Andreea
INTERSECTIONS-EAST EUROPEAN JOURNAL OF SOCIETY AND POLITICS, 2018, 4 (04): : 26 - 50
[39] A Text Classification Based Method for Context Extraction from Online Reviews
Zahra Lahlou, Fatima
Mountassir, Asmaa
Benbrahim, Houda
Kassou, Ismail
2013 8TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS: THEORIES AND APPLICATIONS (SITA), 2013,
[40] QoE Estimation of WebRTC-based Audio-visual Conversations from Facial and Speech Features
Bingol, Gulnaziye
Porcu, Simone
Floris, Alessandro
Atzori, Luigi
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 20 (05)

← 1 2 3 4 5 →