Attentive Convolutional Recurrent Neural Network Using Phoneme-Level Acoustic Representation for Rare Sound Event Detection

被引:3
作者
Upadhyay, Shreya G. [1 ,2 ]
Su, Bo-Hao [1 ,2 ]
Lee, Chi-Chun [1 ,2 ]
机构
[1] Natl Tsing Hua Univ, Dept Elect Engn, Hsinchu, Taiwan
[2] MOST Joint Res Ctr AI Technol & All Vista Healthc, Hsinchu, Taiwan
来源
INTERSPEECH 2020 | 2020年
关键词
sound event detection; convolution recurrent neural network; attention; automatic speech recognition; CLASSIFICATION;
D O I
10.21437/Interspeech.2020-2585
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
A well-trained Acoustic Sound Event Detection system captures the patterns of the sound to accurately detect events of interest in an auditory scene, which enables applications across domains of multimedia, smart living, and even health monitoring. Due to the scarcity and the weak labelling nature of the sound event data, it is often challenging to train an accurate and robust acoustic event detection model directly, especially for those rare occurrences. In this paper, we proposed an architecture which takes the advantage of integrating ASR network representations as additional input when training a sound event detector. Here we used the convolutional bi-directional recurrent neural network (CBRNN), which includes both spectral and temporal attentions, as the SED classifier and further combined the ASR feature representations when performing the end-to-end CBRNN training. Our experiments on the TUT 2017 rare sound event detection dataset showed that with the inclusion of ASR features, the overall discriminative performance of the end-to-end sound event detection system has improved; the average performance of our proposed framework in terms of f-score and error rates are 97 % and 0.05 % respectively.
引用
收藏
页码:3102 / 3106
页数:5
相关论文
共 34 条
[21]   Speech signal-based accurate neurological disorders detection using convolutional neural network and recurrent neural network based deep network [J].
Soylu, Emel ;
Guel, Sema ;
Koca, Kuebra Aslan ;
Tuerkoglu, Muammer ;
Terzi, Murat ;
Senguer, Abdulkadir .
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2025, 149
[22]   A study on the waveform-based end-to-end deep convolutional neural network for weakly supervised sound event detection [J].
Lee, Seokjin ;
Kim, Minhan ;
Jeong, Youngho .
JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2020, 39 (01) :24-31
[23]   Automatic Detection of Power Quality Disturbance Using Convolutional Neural Network Structure with Gated Recurrent Unit [J].
Yigit, Enes ;
Ozkaya, Umut ;
Ozturk, Saban ;
Singh, Dilbag ;
Gritli, Hassene .
MOBILE INFORMATION SYSTEMS, 2021, 2021
[24]   Sound Event Detection in Real Life Audio using Perceptual Linear Predictive Feature with Neural Network [J].
Feroze, Khizer ;
Maud, Abdur Rahman .
PROCEEDINGS OF 2018 15TH INTERNATIONAL BHURBAN CONFERENCE ON APPLIED SCIENCES AND TECHNOLOGY (IBCAST), 2018, :377-382
[25]   A Multi-Task Learning Framework for Sound Event Detection using High-level Acoustic Characteristics of Sounds [J].
Khandelwal, Tanmay ;
Das, Rohan Kumar .
INTERSPEECH 2023, 2023, :1214-1218
[26]   DIFFCRNN: A NOVEL APPROACH FOR DETECTING SOUND EVENTS IN SMART HOME SYSTEMS USING DIFFUSION-BASED CONVOLUTIONAL RECURRENT NEURAL NETWORK [J].
Al Dabel, Maryam M. .
SCALABLE COMPUTING-PRACTICE AND EXPERIENCE, 2024, 25 (05) :3796-3811
[27]   Using multi-stream hierarchical deep neural network to extract deep audio feature for acoustic event detection [J].
Li, Yanxiong ;
Zhang, Xue ;
Jin, Hai ;
Li, Xianku ;
Wang, Qin ;
He, Qianhua ;
Huang, Qian .
MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (01) :897-916
[28]   An Efficient Hybrid Model for Acute Myeloid Leukaemia detection using Convolutional Bi-LSTM based Recurrent Neural Network [J].
Ramya, V. Jeya ;
Lakshmi, S. .
COMPUTER METHODS IN BIOMECHANICS AND BIOMEDICAL ENGINEERING-IMAGING AND VISUALIZATION, 2023, 11 (03) :413-424
[29]   An intelligent skin cancer detection system using two-level multi-column convolutional neural network architecture [J].
Sivakumar, Akash ;
Vedhapriyavadhana, R. ;
Ganapathy, Sannasi .
Neural Computing and Applications, 2024, 36 (30) :19191-19207
[30]   Deep Learning-Based Plant Leaf Disease Detection Using Scaled Immutable Feature Selection Using Adaptive Deep Convolutional Recurrent Neural Network [J].
Jayashree S. ;
Sumalatha V. .
SN Computer Science, 4 (5)