A Study on Transfer Learning for Acoustic Event Detection in a Real Life Scenario

被引:0
作者
Arora, Prerna [1 ]
Haeb-Umbach, Reinhold [1 ]
机构
[1] Paderborn Univ, Dept Commun Engn, Paderborn, Germany
来源
2017 IEEE 19TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP) | 2017年
关键词
KNOWLEDGE TRANSFER;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this work, we address the limited availability of large annotated databases for real-life audio event detection by utilizing the concept of transfer learning. This technique aims to transfer knowledge from a source domain to a target domain, even if source and target have different feature distributions and label sets. We hypothesize that all acoustic events share the same inventory of basic acoustic building blocks and differ only in the temporal order of these acoustic units. We then construct a deep neural network with convolutional layers for extracting the acoustic units and a recurrent layer for capturing the temporal order. Under the above hypothesis, transfer learning from a source to a target domain with a different acoustic event inventory is realized by transferring the convolutional layers from the source to the target domain. The recurrent layer is, however, learnt directly from the target domain. Experiments on the transfer from a synthetic source database to the real life target database of DCASE 2016 demonstrate that transfer learning leads to improved detection performance on average. However, the successful transfer to detect events which are very different from what was seen in the source domain, could not be verified.
引用
收藏
页数:6
相关论文
共 35 条
  • [1] [Anonymous], ARXIV170301127
  • [2] [Anonymous], 2014, IEEE C COMP VIS PATT
  • [3] [Anonymous], PROC CVPR IEEE
  • [4] [Anonymous], 2001, DETECTING SOUND EVEN
  • [5] [Anonymous], ARXIV170206286
  • [6] [Anonymous], HETEROGENEOUS TRANSF
  • [7] [Anonymous], 2008, Advances in neural information processing systems, DOI DOI 10.5555/2981780.2981825
  • [8] [Anonymous], 26 IEEE INT S ROB HU
  • [9] [Anonymous], 2017, IEEE INT C AC SPEECH
  • [10] [Anonymous], P EUR