A PARALLEL FUSION APPROACH TO PIANO MUSIC TRANSCRIPTION BASED ON CONVOLUTIONAL NEURAL NETWORK

被引:0
作者
Cong, Fu'ze [1 ]
Liu, Shuchang [1 ]
Guo, Li [1 ]
Wiggins, Geraint A. [2 ,3 ]
机构
[1] Beijing Univ Posts & Telecommun, Minist Educ, Key Lab Univ Wireless Commun, Beijing, Peoples R China
[2] Queen Mary Univ London, Sch Elect Engn & Comp Sci, London, England
[3] Free Univ Brussels, Dept Comp Sci, AI Lab, Brussels, Belgium
来源
2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2018年
关键词
Automatic music transcription; deep learning; convolutional neural network; note onset/offset detection;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, a supervised approach based on Convolutional Neural Networks (CNN) for polyphonic piano transcription is presented. The system consists of pitch detection model, onset/offset detection model, and note search model. The pitch detection model is a single-channel CNN predicting the probabilities of pitches contained in one frame of the audio. The onset/offset model based on dual-channel CNN is used for estimating the probabilities of each pitch's onset or offset in a frame. The note search model is rule-based; it integrates the outputs of the pitch model and onset/offset model to determine the final onset, offset and pitch of notes in audio. Two experiments with different dataset conditions are accomplished to compare with state-of-the-art approaches on the same datasets. Experimental results reveal that the proposed approach preforms better in both frame- and note-based metrics.
引用
收藏
页码:391 / 395
页数:5
相关论文
共 22 条
  • [1] ABDALLAH SA, 2004, P 5 INT S MUS INF RE
  • [2] [Anonymous], P WORKSH ADV MOD AC
  • [3] [Anonymous], 2010, PROC ICML
  • [4] [Anonymous], 2014, INT C LEARN REPR
  • [5] Bay M., 2009, P 13 INT SOC MUS INF
  • [6] Automatic music transcription: challenges and future directions
    Benetos, Emmanouil
    Dixon, Simon
    Giannoulis, Dimitrios
    Kirchhoff, Holger
    Klapuri, Anssi
    [J]. JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2013, 41 (03) : 407 - 434
  • [7] A Shift-Invariant Latent Variable Model for Automatic Music Transcription
    Benetos, Emmanouil
    Dixon, Simon
    [J]. COMPUTER MUSIC JOURNAL, 2012, 36 (04) : 81 - 94
  • [8] Bock S., 2012, P IEEE INT C AC SPEE
  • [9] CALCULATION OF A CONSTANT-Q SPECTRAL TRANSFORM
    BROWN, JC
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1991, 89 (01) : 425 - 434
  • [10] Eyben F., 2010, P 11 INT SOC MUS INF