An audio-based anger detection algorithm using a hybrid artificial neural network and fuzzy logic model

被引:0
作者
Arihant Surana
Manish Rathod
Shilpa Gite
Shruti Patil
Ketan Kotecha
Ganeshsree Selvachandran
Shio Gai Quek
Ajith Abraham
机构
[1] Symbiosis Institute of Technology,Symbiosis International (Deemed University)
[2] Symbiosis Centre for Applied Artificial Intelligence,School of Business
[3] Symbiosis International (Deemed University),Institute of Actuarial Science and Data Analytics
[4] Monash University Malaysia,School of Computer Science Engineering & Technology
[5] UCSI University,undefined
[6] Bennett University,undefined
来源
Multimedia Tools and Applications | 2024年 / 83卷
关键词
Audio Emotion Recognition; Variable Audio Sources; Audio Classification; ANN; Fuzzy Logic;
D O I
暂无
中图分类号
学科分类号
摘要
Audio Emotion Recognition (AER) is an important factor for Human Emotion Analysis with or without any visual aiding components. Such audio has different modular parameters, such as rhythm, tone, and pitch. However, emotions are highly complex, and the way they get delivered to human ears with preconceived emotions are then instantly understood by humans, and this is something that has been perfected after thousands of years of human evolution. Artificial intelligence (AI) enabled AER has captured worldwide attention in the last couple of years and has gained increasing importance amongst AI researchers in various fields. It has become increasingly important in recent years, especially after the start of the Covid-19 pandemic that has resulted in work from home, online schooling, and online learning on a mass scale due to large-scale lockdowns and movement control orders around the world. The audio quality on online platforms differs from device to device and is dependent on the quality or the bandwidth of the Internet connection used in such applications. Therefore, as the world is recovering from the Covid-19 pandemic, an algorithm for anger detection proves necessary in maintaining public security and general safety and can also help in the early detection of mental health issues or anger management issues. This is because the presence of an angry person in public can pose a threat to the people around and may also impose a risk of damage to public property. As a result, detecting the presence of anger emotion through voices in all public places proves to be the first line of defense against any outbreaks of public nuisance or even violent crimes. Moreover, the more prominent the anger emotion of a person, the more amount of attention must be given to the person by the public security forces. This study uses a collection of audio files from the CREMA-D dataset as the input, where a collection of 364 audio files from 91 actors, each with three degrees of showing anger and a neutral emotion were used. All audio files in this collection use the script “It’s eleven o’clock”. A hybrid algorithm of artificial neural network (ANN) and fuzzy logic, along with a dedicated preprocessing technique specifically for handling audio files were introduced. A comprehensive discussion and analysis of the results was presented in which the proposed algorithm was compared with all the other audio classification algorithms that exist in literature, many of which merely deployed a readily made general purpose neural network-based algorithm. This brute force method of relying on an overly complicated computational structure proves too low in efficiency as the number of nodes involved in the computational process far surpasses the number of preprocessed inputs. On top of this, descriptions about preprocessing procedures for audio classification among all recent works are found to be unclear. Finally, the limitations and suggestions for improvements of the experimental setup, and the potential applications of the findings are also discussed and analyzed in the conclusion of this study.
引用
收藏
页码:38909 / 38929
页数:20
相关论文
共 50 条
[41]   An edge detection method by combining fuzzy logic and neural network [J].
Wang, R ;
Gao, LQ ;
Yang, S ;
Liu, YC .
Proceedings of 2005 International Conference on Machine Learning and Cybernetics, Vols 1-9, 2005, :4539-4543
[42]   Fuzzy logic and neural network based gender classification using three features [J].
Meena, K. ;
Subramaniam, K. R. ;
Gomathy, M. .
INTERNATIONAL JOURNAL OF SIGNAL AND IMAGING SYSTEMS ENGINEERING, 2014, 7 (02) :75-82
[43]   Performance Analysis of Brain Tumor Detection based on Fuzzy Logic and Neural Network Classifier [J].
Anbumozhi, Selladurai .
CURRENT MEDICAL IMAGING, 2016, 12 (04) :304-312
[44]   Perovskite lattice constant prediction framework using optimized artificial neural network and fuzzy logic models by metaheuristic algorithms [J].
Bouzateur, Inas ;
Ouali, Mohammed Assam ;
Bennacer, Hamza ;
Ladjal, Mohamed ;
Khmaissia, Fadoua ;
Rahman, Mohd Amiruddin Abd ;
Boukortt, Abdelkader .
MATERIALS TODAY COMMUNICATIONS, 2023, 37
[45]   Perovskite lattice constant prediction framework using optimized artificial neural network and fuzzy logic models by metaheuristic algorithms [J].
Bouzateur, Inas ;
Ouali, Mohammed Assam ;
Bennacer, Hamza ;
Ladjal, Mohamed ;
Khmaissia, Fadoua ;
Abd Rahman, Mohd Amiruddin ;
Boukortt, Abdelkader .
MATERIALS TODAY COMMUNICATIONS, 2023, 37
[46]   Edge Detection of Images Using Improved Fuzzy C-Means and Artificial Neural Network Technique [J].
Dhivya, R. ;
Prakash, R. .
JOURNAL OF MEDICAL IMAGING AND HEALTH INFORMATICS, 2019, 9 (06) :1284-1293
[47]   A Comparative Study of Three Artificial Intelligence Techniques: Genetic Algorithm, Neural Network, and Fuzzy Logic, on Scheduling Problem [J].
Ansari, Abdollah ;
Abu Bakar, Azuraliza .
PROCEEDINGS 2014 4TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE WITH APPLICATIONS IN ENGINEERING AND TECHNOLOGY ICAIET 2014, 2014, :31-36
[48]   Applicability of Fuzzy Logic and Artificial Neural Network for Unpaved Airfield Surface Bearing Strength Prediction [J].
Cicmanec, Ludek .
SENSORS, 2021, 21 (10)
[49]   Hybrid Intelligent System for Disease Diagnosis Based on Artificial Neural Networks, Fuzzy Logic, and Genetic Algorithms [J].
Al-Absi, Hamada R. H. ;
Abdullah, Azween ;
Hassan, Mahamat Issa ;
Shaban, Khaled Bashir .
INFORMATICS ENGINEERING AND INFORMATION SCIENCE, PT II, 2011, 252 :128-+
[50]   Image Edge Detection Algorithm Based on Fuzzy Logic [J].
Zhao, Jian .
PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON COMPUTER, NETWORKS AND COMMUNICATION ENGINEERING (ICCNCE 2013), 2013, 30 :530-532