Deep Neural Network for Multi-Pitch Estimation Using Weighted Cross Entropy Loss

被引:1
|
作者
Stone, Samuel [1 ]
Spector, Evan [1 ]
机构
[1] SRC Inc, North Syracuse, NY 13212 USA
来源
2021 IEEE WESTERN NEW YORK IMAGE AND SIGNAL PROCESSING WORKSHOP (WNYISPW) | 2021年
关键词
Frequency Estimation; Machine Learning; Harmonic Analysis;
D O I
10.1109/WNYISPW53194.2021.9661285
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Multi-Pitch Estimation, the estimation of multiple overlapping or polyphonic harmonic fundamental frequencies, has a wide range of applications, including automatic music transcription, power systems, and radar signal processing. Multiple fundamental frequencies represent a challenge due to the added complexity of the overlapping signals. This paper presents a Deep Learning approach to estimating multiple fundamental frequencies. The network is trained in a supervised fashion to generate a pseudospectrum representing the fundamental frequencies. Training data is represented by a sparse binary vector the size of the pseudospectrum, indicating the location of fundamental frequencies. A weighted binary cross-entropy loss function is used to correct for class imbalance caused by the sparsity of the signal space relative to the full spectrum. We show comparable performance to existing techniques while requiring fewer operations and samples due to a simpler frequency-domain-only architecture.
引用
收藏
页数:3
相关论文
共 50 条
  • [1] Multi-pitch estimation exploiting block sparsity
    Adalbjornsson, Stefan I.
    Jakobsson, Andreas
    Christensen, Mads G.
    SIGNAL PROCESSING, 2015, 109 : 236 - 247
  • [2] The multi-pitch estimation problem: Some new solutions
    Christensen, Mads Graeesboll
    Stoica, Petre
    Jakobsson, Andreas
    Jensen, Soren Holdt
    2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL III, PTS 1-3, PROCEEDINGS, 2007, : 1221 - +
  • [3] Deep Neural Network for Magnetic Core Loss Estimation using the MagNet Experimental Database
    Shen, Xiaobing
    Wouters, Hans
    Martinez, Wilmar
    2022 24TH EUROPEAN CONFERENCE ON POWER ELECTRONICS AND APPLICATIONS (EPE'22 ECCE EUROPE), 2022,
  • [4] EXPECTATION-MAXIMIZATION ALGORITHM FOR MULTI-PITCH ESTIMATION AND SEPARATION OF OVERLAPPING HARMONIC SPECTRA
    Badeau, Roland
    Emiya, Valentin
    David, Bertrand
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 3073 - 3076
  • [5] Multi-Layer Combined Frequency and Periodicity Representations for Multi-Pitch Estimation of Multi-Instrument Music
    Matsunaga, Tomoki
    Saito, Hiroaki
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 3171 - 3184
  • [6] Evolving a Multi-Classifier System for Multi-Pitch Estimation of Piano Music and Beyond: An Application of Cartesian Genetic Programming
    Miragaia, Rolando
    Fernandez, Francisco
    Reis, Gustavo
    Inacio, Tiago
    APPLIED SCIENCES-BASEL, 2021, 11 (07):
  • [7] Facial expression recognition using convolutional neural network with weighted loss function
    Luan, Jiawei
    2020 IEEE 3RD INTERNATIONAL CONFERENCE ON INFORMATION COMMUNICATION AND SIGNAL PROCESSING (ICICSP 2020), 2020, : 252 - 257
  • [8] An Intelligent Vehicle Price Estimation Approach Using a Deep Neural Network Model
    Alnajim, Thuraya
    Alshahrani, Nouf
    Asiri, Omar
    WORLD ELECTRIC VEHICLE JOURNAL, 2024, 15 (08):
  • [9] Radio propagation prediction using deep neural network and building occupancy estimation
    Inoue, Kazuya
    Ichige, Koichi
    Nagao, Tatsuya
    Hayashi, Takahiro
    IEICE COMMUNICATIONS EXPRESS, 2020, 9 (10): : 506 - 511
  • [10] Channel Estimation with Fully Connected Deep Neural Network
    Sokullu, Radosveta
    Yildirim, Mete
    WIRELESS PERSONAL COMMUNICATIONS, 2022, 125 (03) : 2305 - 2317