Multi-Resolution Feature Extraction Algorithm in Emotional Speech Recognition

被引:2
作者
Zelenik, Ales [1 ]
Kacic, Zdravko [2 ]
机构
[1] NXP Semicond Gratkorn GmbH, A-8101 Gratkorn, Austria
[2] Fac Elect Engn & Comp Sci, Maribor 2000, Slovenia
关键词
Speech; emotion recognition; segmentation; multi-resolution;
D O I
10.5755/j01.eee.21.5.13328
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper a new approach for recognizing emotional speech from audio recordings is presented. In order to obtain the optimum processing window width for feature extraction and to achieve the highest level of recognition rates, a trade-off between time and frequency resolution must be made. At this point, we define a new procedure that combines the advantages of narrower and wider windows and takes advantage of dynamic adjustment of the time and frequency resolution of individual feature characteristics. To achieve higher recognition rates two major procedures are added to the multi-resolution feature-extraction concept, one being the exclusion of features calculated on different processing window widths and the other the idea to use only the parts of recordings with most explicit emotions. To confirm the benefits of the algorithm the audio recordings from the emotional speech database Interface along with four different classifiers were used in evaluation. The highest level of emotion recognition rate with multi-resolution approach exceeded the recognition rate of the best single-resolution approach by 3.5 % with the average improvement of 1.5 % in absolute terms.
引用
收藏
页码:54 / 58
页数:5
相关论文
共 50 条
[31]   An electromagnetic interference source imaging algorithm of multi-resolution partitions [J].
Du, Xin ;
Xie, Shuguo ;
Hao, Xuchun ;
Wang, Chao .
Qiangjiguang Yu Lizishu/High Power Laser and Particle Beams, 2015, 27 (10)
[32]   Multi-resolution analysis for region of interest extraction in thermographic, nondestructive evaluation [J].
Ortiz-Jaramillo, B. ;
Fandino-Toro, H. A. ;
Benitez-Restrepo, H. D. ;
Orjuela-Vargas, S. A. ;
Castellanos-Dominguez, G. ;
Philips, W. .
IMAGE PROCESSING: ALGORITHMS AND SYSTEMS X AND PARALLEL PROCESSING FOR IMAGING APPLICATIONS II, 2012, 8295
[33]   On multi-resolution and variable-resolution [J].
Li, ZN .
INFORMATION INTELLIGENCE AND SYSTEMS, VOLS 1-4, 1996, :719-724
[34]   Utilizing oscillator neural networks to realize multi-resolution pattern recognition [J].
Lu, ZD ;
Yan, PF .
8TH INTERNATIONAL CONFERENCE ON NEURAL INFORMATION PROCESSING, VOLS 1-3, PROCEEDING, 2001, :192-196
[35]   Combining Multi-resolution Wavelets with Principal Component Analysis for Face Recognition [J].
Farhan, Hameed R. ;
Abbas, Hawraa H. ;
Shahadi, Haider I. .
INTERNATIONAL CONFERENCE OF INFORMATION AND COMMUNICATION TECHNOLOGY (ICICT 2019), 2019, :154-159
[36]   A MULTI-DILATION AND MULTI-RESOLUTION FULLY CONVOLUTIONAL NETWORK FOR SINGING MELODY EXTRACTION [J].
Gao, Ping ;
You, Cheng-You ;
Chi, Tai-Shih .
2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, :551-555
[37]   Multi-Resolution Feature Embedded Level Set Model for Crosshatched Texture Segmentation [J].
Prabhakar, K. ;
Sadyojatha, K. M. .
INTERNATIONAL JOURNAL OF ELECTRICAL AND COMPUTER ENGINEERING SYSTEMS, 2023, 14 (04) :371-379
[38]   A GPU-based multi-resolution algorithm for simulation of seed dispersal [J].
Fan, Jing ;
Ji, Hai-feng ;
Guan, Xin-xin ;
Tang, Ying .
JOURNAL OF ZHEJIANG UNIVERSITY-SCIENCE C-COMPUTERS & ELECTRONICS, 2012, 13 (11) :816-827
[39]   A GPU-based multi-resolution algorithm for simulation of seed dispersal [J].
Jing Fan ;
Hai-feng Ji ;
Xin-xin Guan ;
Ying Tang .
Journal of Zhejiang University SCIENCE C, 2012, 13 :816-827
[40]   Development of a watershed algorithm for multi-resolution, multi-dimensional clustering of hyperspectral data [J].
Jellison, GP ;
Hemmer, TH ;
Wilson, DG .
ALGORITHMS AND TECHNOLOGIES FOR MULTISPECTRAL, HYPERSPECTRAL, AND ULTRASPECTRAL IMAGERY VIII, 2002, 4725 :290-301