HARMONIC-PERCUSSIVE SOURCE SEPARATION WITH DEEP NEURAL NETWORKS AND PHASE RECOVERY

被引:0
作者
Drossos, Konstantinos [1 ]
Magron, Paul [1 ]
Mimilakis, Stylianos Ioannis [2 ]
Virtanen, Tuomas [1 ]
机构
[1] Tampere Univ Technol, Lab Signal Proc, Tampere, Finland
[2] Fraunhofer IDMT, Ilmenau, Germany
来源
2018 16TH INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC) | 2018年
基金
芬兰科学院; 欧洲研究理事会; 欧盟地平线“2020”;
关键词
harmonic/percussive source separation; deep neural networks; MaD TwinNet; phase recovery; sinusoidal model;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Harmonic/percussive source separation (HPSS) consists in separating the pitched instruments from the percussive parts in a music mixture. In this paper, we propose to apply the recently introduced Masker-Denoiser with twin networks (MaD TwinNet) system to this task. MaD TwinNet is a deep learning architecture that has reached state-of-the-art results in monaural singing voice separation. Herein, we propose to apply it to HPSS by using it to estimate the magnitude spectrogram of the percussive source. Then, we retrieve the complex-valued short-time Fourier transform of the sources by means of a phase recovery algorithm, which minimizes the reconstruction error and enforces the phase of the harmonic part to follow a sinusoidal phase model. Experiments conducted on realistic music mixtures show that this novel separation system outperforms the previous state-of-the art kernel additive model approach.
引用
收藏
页码:421 / 425
页数:5
相关论文
共 50 条
[41]   Stable recovery of entangled weights: Towards robust identification of deep neural networks from minimal samples [J].
Fiedler, Christian ;
Fornasier, Massimo ;
Klock, Timo ;
Rauchensteiner, Michael .
APPLIED AND COMPUTATIONAL HARMONIC ANALYSIS, 2023, 62 :123-172
[42]   DISCRIMINATIVE MULTIPLE SOUND SOURCE LOCALIZATION BASED ON DEEP NEURAL NETWORKS USING INDEPENDENT LOCATION MODEL [J].
Takeda, Ryu ;
Komatani, Kazunori .
2016 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2016), 2016, :603-609
[43]   A Method to Build Digital Twin of Atmospheric Turbulence Phase Screens With Comprehensible Deep Neural Networks [J].
Xiang Zhang ;
Peng Jia ;
Wang, Weihua .
ADAPTIVE OPTICS SYSTEMS VIII, 2022, 12185
[44]   Robust and Efficient Phase Estimation in Legged Robots via Signal Imaging and Deep Neural Networks [J].
Yazdipaz, Kamyab ;
Kohli, Nooshin ;
Golestaneh, Seyed Ali ;
Shahbazi, Mohammad .
IEEE ACCESS, 2025, 13 :49018-49029
[45]   Data recovery of 2D lifetime-based phosphor thermometry using deep neural networks [J].
Jung, Juyong ;
Kim, Mirae ;
Cai, Tao ;
Liu, Yingzheng ;
Kim, Kyung Chun .
MEASUREMENT SCIENCE AND TECHNOLOGY, 2023, 34 (07)
[46]   Adversarial robustness evaluation of multiple-source remote sensing image recognition based on deep neural networks [J].
Sun H. ;
Xu Y. ;
Chen J. ;
Lei L. ;
Ji K. ;
Kuang G. .
National Remote Sensing Bulletin, 2023, 27 (08) :1951-1963
[47]   Speech Separation Based on Improved Deep Neural Networks with Dual Outputs of Speech Features for Both Target and Interfering Speakers [J].
Tu, Yanhui ;
Du, Jun ;
Xu, Yong ;
Dai, Lirong ;
Lee, Chin-Hui .
2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2014, :250-+
[48]   Enhanced phase recovery in in-line holography with self-supervised complex-valued neural networks [J].
Dou, Jiazhen ;
An, Qiming ;
Liu, Xiaosong ;
Mai, Yujian ;
Zhong, Liyun ;
Di, Jianglei ;
Qin, Yuwen .
OPTICS AND LASERS IN ENGINEERING, 2025, 184
[49]   Power Flow Analysis Using Deep Neural Networks in Three-Phase Unbalanced Smart Distribution Grids [J].
Tiwari, Deepak ;
Zideh, Mehdi Jabbari ;
Talreja, Veeru ;
Verma, Vishal ;
Solanki, Sarika Khushalani ;
Solanki, Jignesh .
IEEE ACCESS, 2024, 12 :29959-29970
[50]   Molecular simulations and deep neural networks-based interpretable machine learning modelling of reverse adsorptive MOFs for ethane/ethylene separation [J].
Yadava, Khushboo ;
Srivastava, Shrey ;
Yadav, Ashutosh .
CANADIAN JOURNAL OF CHEMICAL ENGINEERING, 2025, 103 (03) :1083-1098