END-TO-END SOURCE SEPARATION WITH ADAPTIVE FRONT-ENDS

被引:0
|
作者
Venkataramani, Shrikant [1 ]
Casebeer, Jonah [1 ]
Smaragdis, Paris [2 ]
机构
[1] Univ Illinois, Champaign, IL 61820 USA
[2] Univ Illinois, Adobe Res, Champaign, IL USA
关键词
Auto-encoders; adaptive transforms; source separation; deep learning;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Source separation and other audio applications have traditionally relied on the use of short-time Fourier transforms as a front-end frequency domain representation step. The unavailability of a neural network equivalent to forward and inverse transforms hinders the implementation of end-to-end learning systems for these applications. We develop an auto-encoder neural network that can act as an equivalent to short-time front-end transforms. We demonstrate the ability of the network to learn optimal, real-valued basis functions directly from the raw waveform of a signal and further show how it can be used as an adaptive front-end for supervised source separation. In terms of separation performance, these transforms significantly outperform their Fourier counterparts. Finally, we also propose and interpret a novel source to distortion ratio based cost function for end-to-end source separation.
引用
收藏
页码:684 / 688
页数:5
相关论文
共 50 条
  • [1] End-to-end simulation: The front end
    Haber, I
    Bieniosek, FM'
    Celata, BM
    Friedman, A
    Grote, DP
    Henestroza, E
    Vay, JL
    Bernal, S
    Kishek, RA
    O'Shea, PG
    Reiser, M
    Herrmannsfeldt, WB
    LASER AND PARTICLE BEAMS, 2002, 20 (03) : 431 - 433
  • [2] End-to-end music source separation: is it possible in the waveform domain?
    Lluis, Francesc
    Pons, Jordi
    Serra, Xavier
    INTERSPEECH 2019, 2019, : 4619 - 4623
  • [3] END-TO-END SOUND SOURCE SEPARATION CONDITIONED ON INSTRUMENT LABELS
    Slizovskaia, Olga
    Kim, Leo
    Haro, Gloria
    Gomez, Emilia
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 306 - 310
  • [4] FRONT-ENDS FOR AIR SEPARATION PLANTS - THE COLD FACTS
    KERRY, FG
    CHEMICAL ENGINEERING PROGRESS, 1991, 87 (08) : 48 - 54
  • [5] END-TO-END NON-NEGATIVE AUTOENCODERS FOR SOUND SOURCE SEPARATION
    Venkataramani, Shrikant
    Tzinis, Efthymios
    Sinaragdis, Paris
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 116 - 120
  • [6] LIGHTWEIGHT END-TO-END DEEP LEARNING MODEL FOR MUSIC SOURCE SEPARATION
    Wang, Yao-Ting
    Lin, Yi-Xing
    Liang, Kai-Wen
    Tai, Tzu-Chiang
    Wang, Jia-Ching
    2022 13TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2022, : 315 - 318
  • [7] Direction specific ambisonics source separation with end-to-end deep learning
    Lluis, Francesc
    Meyer-Kahlen, Nils
    Chatziioannou, Vasileios
    Hofmann, Alex
    ACTA ACUSTICA, 2023, 7
  • [8] INTELLIGENT FRONT-ENDS
    MERTENS, P
    WIRTSCHAFTSINFORMATIK, 1992, 34 (03): : 269 - 282
  • [9] Designing front-ends
    Carr, J
    ELECTRONICS WORLD, 2001, 107 (1777): : 50 - 53
  • [10] CoSSD - An end-to-end framework for multi-instance source separation and detection
    Baligar, Shrishail
    Newsam, Shawn
    2022 30TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2022), 2022, : 150 - 154