Audio-Based Hate Speech Classification from Online Short-Form Videos

被引:4
作者
Ibanez, Michael [1 ]
Sapinit, Ranz [1 ]
Reyes, Lloyd Antonie [1 ]
Hussien, Mohammed [1 ]
Imperial, Joseph Marvin [1 ]
Rodriguez, Ramon [1 ]
机构
[1] Natl Univ, Manila, Philippines
来源
2021 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP) | 2021年
关键词
hate speech; tiktok; audio classification; machine learning; speech processing;
D O I
10.1109/IALP54817.2021.9675250
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this study, we pioneer the development of an audio-based hate speech classifier from online, short-form TikTok videos using traditional machine learning algorithms such as Logistic Regression, Random Forest, and Support Vector Machines. We scraped over 4746 videos using the TikTok API tool and extracted audio-based features such as MFCCs, Spectral Centroid, Rolloff, Bandwidth, Zero-Crossing Rate, and Chroma values as primary feature sets. Results show that using the extracted predictors for hate speech detection can obtain up to 78.5% accuracy on an optimized Random Forest model, crossing the 50% benchmark for models in this task. In addition, comparing the Information Gain scores and globally learned model weights identified that Spectral Rolloff and MFCCs are top predictors in discriminating hate speech for the Filipino language.
引用
收藏
页码:72 / 77
页数:6
相关论文
共 47 条
  • [21] Enhanced Seagull Optimization with Natural Language Processing Based Hate Speech Detection and Classification
    Halawani, Hanan T.
    Alghamdi, Hanan M.
    Hamza, Saadia Hassan Abdalaha
    Abdel-Khalek, Sayed
    Mansour, Romany F.
    APPLIED SCIENCES-BASEL, 2022, 12 (16):
  • [22] Handling Imbalance Issue in Hate Speech Classification using Sampling-based Methods
    Rathpisey, Heng
    Adji, Teguh Bharata
    2019 5TH INTERNATIONAL CONFERENCE ON SCIENCE ININFORMATION TECHNOLOGY (ICSITECH): EMBRACING INDUSTRY 4.0 - TOWARDS INNOVATION IN CYBER PHYSICAL SYSTEM, 2019, : 193 - 198
  • [23] "Short is the Road that Leads from Fear to Hate": Fear Speech in Indian WhatsApp Groups
    Saha, Punyajoy
    Mathew, Binny
    Garimella, Kiran
    Mukherjee, Animesh
    PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE 2021 (WWW 2021), 2021, : 1110 - 1121
  • [24] 'I want to record and share my wonderful journey': Chinese Millennials' production and sharing of short-form travel videos on TikTok or Douyin
    Du, Xin
    Liechty, Toni
    Santos, Carla A.
    Park, Jeongeun
    CURRENT ISSUES IN TOURISM, 2022, 25 (21) : 3412 - 3424
  • [26] Real-World Speech/Non-Speech Audio Classification Based on Sparse Representation Features and GPCs
    Shi, Ziqiang
    Han, Jiqing
    Zheng, Tieran
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2412 - 2415
  • [27] Learning from online hate speech and digital racism: From automated to diffractive methods in social media analysis
    Giraud, Eva Haifa
    Poole, Elizabeth
    de Quincey, Ed
    Richardson, John E.
    SOCIOLOGICAL REVIEW, 2025,
  • [28] MetaHate: AI-based hate speech detection for secured online gaming in metaverse using blockchain
    Sanghvi, Harshil
    Bhavsar, Rushir
    Hundlani, Vini
    Gohil, Lata
    Vyas, Tarjni
    Nair, Anuja
    Desai, Shivani
    Jadav, Nilesh Kumar
    Tanwar, Sudeep
    Sharma, Ravi
    Yamsani, Nagendar
    SECURITY AND PRIVACY, 2024, 7 (02)
  • [29] From prejudice to marginalization: Tracing the forms of online hate speech targeting LGBTQ plus and Muslim communities
    Unlu, Ali
    Truong, Sophie
    Sawhney, Nitin
    Tammi, Tuukka
    Kotonen, Tommi
    NEW MEDIA & SOCIETY, 2025,
  • [30] The impact of content characteristics of Short-Form video ads on consumer purchase Behavior: Evidence from TikTok
    Meng, Lu
    Kou, Sining
    Duan, Shen
    Bie, Yongyue
    JOURNAL OF BUSINESS RESEARCH, 2024, 183