An audio-based anger detection algorithm using a hybrid artificial neural network and fuzzy logic model

被引:0
作者
Arihant Surana
Manish Rathod
Shilpa Gite
Shruti Patil
Ketan Kotecha
Ganeshsree Selvachandran
Shio Gai Quek
Ajith Abraham
机构
[1] Symbiosis Institute of Technology,Symbiosis International (Deemed University)
[2] Symbiosis Centre for Applied Artificial Intelligence,School of Business
[3] Symbiosis International (Deemed University),Institute of Actuarial Science and Data Analytics
[4] Monash University Malaysia,School of Computer Science Engineering & Technology
[5] UCSI University,undefined
[6] Bennett University,undefined
来源
Multimedia Tools and Applications | 2024年 / 83卷
关键词
Audio Emotion Recognition; Variable Audio Sources; Audio Classification; ANN; Fuzzy Logic;
D O I
暂无
中图分类号
学科分类号
摘要
Audio Emotion Recognition (AER) is an important factor for Human Emotion Analysis with or without any visual aiding components. Such audio has different modular parameters, such as rhythm, tone, and pitch. However, emotions are highly complex, and the way they get delivered to human ears with preconceived emotions are then instantly understood by humans, and this is something that has been perfected after thousands of years of human evolution. Artificial intelligence (AI) enabled AER has captured worldwide attention in the last couple of years and has gained increasing importance amongst AI researchers in various fields. It has become increasingly important in recent years, especially after the start of the Covid-19 pandemic that has resulted in work from home, online schooling, and online learning on a mass scale due to large-scale lockdowns and movement control orders around the world. The audio quality on online platforms differs from device to device and is dependent on the quality or the bandwidth of the Internet connection used in such applications. Therefore, as the world is recovering from the Covid-19 pandemic, an algorithm for anger detection proves necessary in maintaining public security and general safety and can also help in the early detection of mental health issues or anger management issues. This is because the presence of an angry person in public can pose a threat to the people around and may also impose a risk of damage to public property. As a result, detecting the presence of anger emotion through voices in all public places proves to be the first line of defense against any outbreaks of public nuisance or even violent crimes. Moreover, the more prominent the anger emotion of a person, the more amount of attention must be given to the person by the public security forces. This study uses a collection of audio files from the CREMA-D dataset as the input, where a collection of 364 audio files from 91 actors, each with three degrees of showing anger and a neutral emotion were used. All audio files in this collection use the script “It’s eleven o’clock”. A hybrid algorithm of artificial neural network (ANN) and fuzzy logic, along with a dedicated preprocessing technique specifically for handling audio files were introduced. A comprehensive discussion and analysis of the results was presented in which the proposed algorithm was compared with all the other audio classification algorithms that exist in literature, many of which merely deployed a readily made general purpose neural network-based algorithm. This brute force method of relying on an overly complicated computational structure proves too low in efficiency as the number of nodes involved in the computational process far surpasses the number of preprocessed inputs. On top of this, descriptions about preprocessing procedures for audio classification among all recent works are found to be unclear. Finally, the limitations and suggestions for improvements of the experimental setup, and the potential applications of the findings are also discussed and analyzed in the conclusion of this study.
引用
收藏
页码:38909 / 38929
页数:20
相关论文
共 50 条
  • [21] A review on modeling of solar photovoltaic systems using artificial neural networks, fuzzy logic, genetic algorithm and hybrid models
    Garud, Kunal Sandip
    Jayaraj, Simon
    Lee, Moo-Yeon
    INTERNATIONAL JOURNAL OF ENERGY RESEARCH, 2021, 45 (01) : 6 - 35
  • [22] Quantitative feature evaluation using hybrid neural network and fuzzy logic approach
    Jiang, H
    Feng, X
    PROCEEDINGS OF THE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS 2003, VOLS 1-4, 2003, : 421 - 425
  • [23] Modeling and Optimization of Membrane Chemical Cleaning by Artificial Neural Network, Fuzzy Logic, and Genetic Algorithm
    Madaeni, S. S.
    Hasankiadeh, N. Tavajohi
    Tavakolian, H. R.
    CHEMICAL ENGINEERING COMMUNICATIONS, 2012, 199 (03) : 399 - 416
  • [24] Vibration-based crack prediction on a beam model using hybrid butterfly optimization algorithm with artificial neural network
    Abdelwahhab Khatir
    Roberto Capozucca
    Samir Khatir
    Erica Magagnini
    Frontiers of Structural and Civil Engineering, 2022, 16 : 976 - 989
  • [25] Vibration-based crack prediction on a beam model using hybrid butterfly optimization algorithm with artificial neural network
    KHATIR Abdelwahhab
    CAPOZUCCA Roberto
    KHATIR Samir
    MAGAGNINI Erica
    Frontiers of Structural and Civil Engineering, 2022, 16 (08) : 976 - 989
  • [26] Vibration-based crack prediction on a beam model using hybrid butterfly optimization algorithm with artificial neural network
    Khatir, Abdelwahhab
    Capozucca, Roberto
    Khatir, Samir
    Magagnini, Erica
    FRONTIERS OF STRUCTURAL AND CIVIL ENGINEERING, 2022, 16 (08) : 976 - 989
  • [27] Weld residual stress prediction using artificial neural network and Fuzzy logic modeling
    Dhas, J. Edwin Raja
    Kumanan, Somasundaram
    INDIAN JOURNAL OF ENGINEERING AND MATERIALS SCIENCES, 2011, 18 (05) : 351 - 360
  • [28] Intrusion Detection in Wireless Network Using Fuzzy Logic Implemented with Genetic Algorithm
    Reddy, S. Sai Satyanarayana
    Chatterjee, Priyadarshini
    Mamatha, Ch
    COMPUTING AND NETWORK SUSTAINABILITY, 2019, 75
  • [29] DDoS Detection Algorithm Based on Fuzzy Logic
    Ates, Cagatay
    Ozdel, Suleyman
    Anarim, Emin
    2020 28TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2020,
  • [30] Detection of epileptiform discharges in the EEG by a hybrid system comprising mimetic, self-organized artificial neural network, and fuzzy logic stages
    James, CJ
    Jones, RD
    Bones, PJ
    Carroll, GJ
    CLINICAL NEUROPHYSIOLOGY, 1999, 110 (12) : 2049 - 2063