An audio-based anger detection algorithm using a hybrid artificial neural network and fuzzy logic model

被引:0
|
作者
Arihant Surana
Manish Rathod
Shilpa Gite
Shruti Patil
Ketan Kotecha
Ganeshsree Selvachandran
Shio Gai Quek
Ajith Abraham
机构
[1] Symbiosis Institute of Technology,Symbiosis International (Deemed University)
[2] Symbiosis Centre for Applied Artificial Intelligence,School of Business
[3] Symbiosis International (Deemed University),Institute of Actuarial Science and Data Analytics
[4] Monash University Malaysia,School of Computer Science Engineering & Technology
[5] UCSI University,undefined
[6] Bennett University,undefined
来源
Multimedia Tools and Applications | 2024年 / 83卷
关键词
Audio Emotion Recognition; Variable Audio Sources; Audio Classification; ANN; Fuzzy Logic;
D O I
暂无
中图分类号
学科分类号
摘要
Audio Emotion Recognition (AER) is an important factor for Human Emotion Analysis with or without any visual aiding components. Such audio has different modular parameters, such as rhythm, tone, and pitch. However, emotions are highly complex, and the way they get delivered to human ears with preconceived emotions are then instantly understood by humans, and this is something that has been perfected after thousands of years of human evolution. Artificial intelligence (AI) enabled AER has captured worldwide attention in the last couple of years and has gained increasing importance amongst AI researchers in various fields. It has become increasingly important in recent years, especially after the start of the Covid-19 pandemic that has resulted in work from home, online schooling, and online learning on a mass scale due to large-scale lockdowns and movement control orders around the world. The audio quality on online platforms differs from device to device and is dependent on the quality or the bandwidth of the Internet connection used in such applications. Therefore, as the world is recovering from the Covid-19 pandemic, an algorithm for anger detection proves necessary in maintaining public security and general safety and can also help in the early detection of mental health issues or anger management issues. This is because the presence of an angry person in public can pose a threat to the people around and may also impose a risk of damage to public property. As a result, detecting the presence of anger emotion through voices in all public places proves to be the first line of defense against any outbreaks of public nuisance or even violent crimes. Moreover, the more prominent the anger emotion of a person, the more amount of attention must be given to the person by the public security forces. This study uses a collection of audio files from the CREMA-D dataset as the input, where a collection of 364 audio files from 91 actors, each with three degrees of showing anger and a neutral emotion were used. All audio files in this collection use the script “It’s eleven o’clock”. A hybrid algorithm of artificial neural network (ANN) and fuzzy logic, along with a dedicated preprocessing technique specifically for handling audio files were introduced. A comprehensive discussion and analysis of the results was presented in which the proposed algorithm was compared with all the other audio classification algorithms that exist in literature, many of which merely deployed a readily made general purpose neural network-based algorithm. This brute force method of relying on an overly complicated computational structure proves too low in efficiency as the number of nodes involved in the computational process far surpasses the number of preprocessed inputs. On top of this, descriptions about preprocessing procedures for audio classification among all recent works are found to be unclear. Finally, the limitations and suggestions for improvements of the experimental setup, and the potential applications of the findings are also discussed and analyzed in the conclusion of this study.
引用
收藏
页码:38909 / 38929
页数:20
相关论文
共 50 条
  • [1] An audio-based anger detection algorithm using a hybrid artificial neural network and fuzzy logic model
    Surana, Arihant
    Rathod, Manish
    Gite, Shilpa
    Patil, Shruti
    Kotecha, Ketan
    Selvachandran, Ganeshsree
    Quek, Shio Gai
    Abraham, Ajith
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (13) : 38909 - 38929
  • [2] Audio-based snore detection using deep neural networks
    Xie, Jiali
    Aubert, Xavier
    Long, Xi
    Dijk, Johannes van
    Arsenali, Bruno
    Fonseca, Pedro
    Overeem, Sebastiaan
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2021, 200 (200)
  • [3] Event detection in an audio-based sensor network
    Smeaton, Alan F.
    McHugh, Michael
    MULTIMEDIA SYSTEMS, 2006, 12 (03) : 179 - 194
  • [4] Event detection in an audio-based sensor network
    Alan F. Smeaton
    Michael McHugh
    Multimedia Systems, 2006, 12 : 179 - 194
  • [5] Fuzzy logic and Artificial Neural Network approaches in odor detection
    Meegahapola, Lasantha
    Karunadasa, J. P.
    Sandasiri, Kasun
    Tharanga, Damith
    Jayasekara, Dammika
    2006 INTERNATIONAL CONFERENCE ON INFORMATION AND AUTOMATION, 2007, : 92 - 97
  • [6] Detection of Audio-based Emergency Situations using Perception Sensor Network
    Quang Nguyen
    Yun, Sang-Seok
    Choi, JongSuk
    2016 13TH INTERNATIONAL CONFERENCE ON UBIQUITOUS ROBOTS AND AMBIENT INTELLIGENCE (URAI), 2016, : 763 - 766
  • [7] Spam detection using hybrid Artificial Neural Network and Genetic Algorithm
    Arram, Anas
    Mousa, Hisham
    Zainal, Anazida
    2013 13TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS (ISDA), 2013, : 336 - 340
  • [8] MPPT Algorithm Based on Fuzzy Logic and Artificial Neural Network (ANN) for a Hybrid Solar/Wind Power Generation System
    Elaissaoui, Hayat
    Zerouali, Mohammed
    El Ougli, Abdelghani
    Tidhaf, Belkassem
    2020 FOURTH INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING IN DATA SCIENCES (ICDS), 2020,
  • [9] Image Enhancement using Artificial Neural Network and Fuzzy Logic
    Narnaware, Shweta
    Khedgaonkar, Roshni
    2015 INTERNATIONAL CONFERENCE ON INNOVATIONS IN INFORMATION, EMBEDDED AND COMMUNICATION SYSTEMS (ICIIECS), 2015,
  • [10] Fault Prediction Using Artificial Neural Network and Fuzzy Logic
    Virk, Shafqat M.
    Muhammad, Aslam
    Martinez-Enriquez, A. M.
    PROCEEDINGS OF THE SPECIAL SESSION OF THE SEVENTH MEXICAN INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE - MICAI 2008, 2008, : 149 - +