INFORMED EGO-NOISE SUPPRESSION USING MOTOR DATA-DRIVEN DICTIONARIES

被引:0
作者
Schmidt, Alexander [1 ]
Kellermann, Walter [1 ]
机构
[1] Friedrich Alexander Univ Erlangen Nurnberg, Multimedia Commun & Signal Proc, Cauerstr 7, D-91058 Erlangen, Germany
来源
2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2019年
关键词
Ego-noise suppression; robot; motor data; dictionary learning; nonnegative matrix factorization; NONNEGATIVE MATRIX FACTORIZATION;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The suppression of ego-noise for (humanoid) robots is typically addressed by learning-based techniques. In this paper, we propose a novel approach which models significant parts of ego-noise spectrograms based on motor data and does not require a prior training step. Accordingly, the intrinsic harmonic structure of ego noise is taken into account by introducing a nonnegative matrix factorization (NMF) framework with motor data-driven dictionaries. Limited improvement was observed by employing an additional pre-trained small-sized dictionary accounting for the residual ego-noise. The presented approach exhibits comparable suppression performance to an audio only-based approach trained specifically to the scenario, while the number of dictionary elements which require prior learning can be reduced by a factor of two. For ego-noise resulting from previously unseen movements, the proposed method shows consistently superior suppression results while the audio only-based approach degrades drastically.
引用
收藏
页码:116 / 120
页数:5
相关论文
共 22 条
  • [1] K-SVD: An algorithm for designing overcomplete dictionaries for sparse representation
    Aharon, Michal
    Elad, Michael
    Bruckstein, Alfred
    [J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2006, 54 (11) : 4311 - 4322
  • [2] [Anonymous], 2001, ADV NEURAL INFORM PR
  • [3] [Anonymous], 2013, TECHNOLOGY BINAURAL
  • [4] An audio-visual corpus for speech perception and automatic speech recognition (L)
    Cooke, Martin
    Barker, Jon
    Cunningham, Stuart
    Shao, Xu
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2006, 120 (05) : 2421 - 2424
  • [5] Deleforge A, 2015, INT CONF ACOUST SPEE, P355, DOI 10.1109/ICASSP.2015.7177990
  • [6] Fevotte C., 2005, Tech. Rep. 1706
  • [7] Algorithms for Nonnegative Matrix Factorization with the β-Divergence
    Fevotte, Cedric
    Idier, Jerome
    [J]. NEURAL COMPUTATION, 2011, 23 (09) : 2421 - 2456
  • [8] Nonnegative Matrix Factorization with the Itakura-Saito Divergence: With Application to Music Analysis
    Fevotte, Cedric
    Bertin, Nancy
    Durrieu, Jean-Louis
    [J]. NEURAL COMPUTATION, 2009, 21 (03) : 793 - 830
  • [9] Mechatronic design of NAO humanoid
    Gouaillier, David
    Hugel, Vincent
    Blazevic, Pierre
    Kilner, Chris
    Monceaux, Jerome
    Lafourcade, Pascal
    Marnier, Brice
    Serre, Julien
    Maisonnier, Bruno
    [J]. ICRA: 2009 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS 1-7, 2009, : 2124 - +
  • [10] Hennequin R., 2010, P 13 INT C DIG AUD E, P8