Audio-Based Hate Speech Classification from Online Short-Form Videos

被引：4

作者：

Ibanez, Michael ^{[1
]}

Sapinit, Ranz ^{[1
]}

Reyes, Lloyd Antonie ^{[1
]}

Hussien, Mohammed ^{[1
]}

Imperial, Joseph Marvin ^{[1
]}

Rodriguez, Ramon ^{[1
]}

机构：

[1] Natl Univ, Manila, Philippines

来源：

2021 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP) | 2021年

关键词：

hate speech; tiktok; audio classification; machine learning; speech processing;

D O I：

10.1109/IALP54817.2021.9675250

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this study, we pioneer the development of an audio-based hate speech classifier from online, short-form TikTok videos using traditional machine learning algorithms such as Logistic Regression, Random Forest, and Support Vector Machines. We scraped over 4746 videos using the TikTok API tool and extracted audio-based features such as MFCCs, Spectral Centroid, Rolloff, Bandwidth, Zero-Crossing Rate, and Chroma values as primary feature sets. Results show that using the extracted predictors for hate speech detection can obtain up to 78.5% accuracy on an optimized Random Forest model, crossing the 50% benchmark for models in this task. In addition, comparing the Information Gain scores and globally learned model weights identified that Spectral Rolloff and MFCCs are top predictors in discriminating hate speech for the Filipino language.

引用

页码：72 / 77

页数：6

共 47 条

[41] Is Speech the New Blood? Recent Progress in AI-Based Disease Detection From Audio in a Nutshell
Milling, Manuel
Pokorny, Florian B. B.
Bartl-Pokorny, Katrin D. D.
Schuller, Bjorn W.
FRONTIERS IN DIGITAL HEALTH, 2022, 4
[42] Machine Learning-Based Detection and Classification of Neurodevelopmental Disorders from Speech Patterns
Mouad, El Omari
Hanae, Belmajdoub
Khalid, Minaoui
ENGINEERING APPLICATIONS OF NEURAL NETWORKS, EANN 2024, 2024, 2141 : 235 - 246
[43] A Machine Learning Approach to Recognize Speakers Region of the United Kingdom from Continuous Speech Based on Accent Classification
Hossain, Md Fahad
Hasan, Md Mehedi
Ali, Hasmot
Sarker, Md Rahmatul Kabir Rasel
Hassan, Md Toukirul
PROCEEDINGS OF 2020 11TH INTERNATIONAL CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING (ICECE), 2020, : 210 - 213
[44] <bold>ARTICULATORY FEATURE-BASED METHODS FOR ACOUSTIC AND AUDIO-VISUAL SPEECH RECOGNITION: SUMMARY FROM THE 2006 JHU SUMMERWORKSHOP</bold>
Livescu, Karen
Cetin, Oezguer
Hasegawa-Johnson, Mark
King, Simon
Bartels, Chris
Borges, Nash
Kantor, Arthur
Lal, Partha
Yung, Lisa
Bezman, Ari
Dawson-Haggerty, Stephen
Woods, Bronwyn
Frankel, Joe
Magimai-Doss, Mathew
Saenko, Kate
2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 621 - +
[45] Machine learning-based classification of Parkinson's disease using acoustic features: Insights from multilingual speech tasks
Department of AI & Informatics, Graduate School, Sangmyung University, Hongjimun 2-Gil 20, Jongno-gu, Seoul
03016, Korea, Republic of
不详
07061, Korea, Republic of
不详
03016, Korea, Republic of
不详
03080, Korea, Republic of
Comput. Biol. Med.,
[46] Sarcastic user behavior classification and prediction from social media data using firebug swarm optimization-based long short-term memory
E. Karthik
T. Sethukarasi
The Journal of Supercomputing, 2022, 78 : 5333 - 5357
[47] Sarcastic user behavior classification and prediction from social media data using firebug swarm optimization-based long short-term memory
Karthik, E.
Sethukarasi, T.
JOURNAL OF SUPERCOMPUTING, 2022, 78 (04) : 5333 - 5357

← 1 2 3 4 5 →