A PARALLEL FUSION APPROACH TO PIANO MUSIC TRANSCRIPTION BASED ON CONVOLUTIONAL NEURAL NETWORK

被引：0

作者：

Cong, Fu'ze ^{[1
]}

Liu, Shuchang ^{[1
]}

Guo, Li ^{[1
]}

Wiggins, Geraint A. ^{[2
,3
]}

机构：

[1] Beijing Univ Posts & Telecommun, Minist Educ, Key Lab Univ Wireless Commun, Beijing, Peoples R China

[2] Queen Mary Univ London, Sch Elect Engn & Comp Sci, London, England

[3] Free Univ Brussels, Dept Comp Sci, AI Lab, Brussels, Belgium

来源：

2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2018年

关键词：

Automatic music transcription; deep learning; convolutional neural network; note onset/offset detection;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

In this paper, a supervised approach based on Convolutional Neural Networks (CNN) for polyphonic piano transcription is presented. The system consists of pitch detection model, onset/offset detection model, and note search model. The pitch detection model is a single-channel CNN predicting the probabilities of pitches contained in one frame of the audio. The onset/offset model based on dual-channel CNN is used for estimating the probabilities of each pitch's onset or offset in a frame. The note search model is rule-based; it integrates the outputs of the pitch model and onset/offset model to determine the final onset, offset and pitch of notes in audio. Two experiments with different dataset conditions are accomplished to compare with state-of-the-art approaches on the same datasets. Experimental results reveal that the proposed approach preforms better in both frame- and note-based metrics.

引用

页码：391 / 395

页数：5

共 22 条

[1] ABDALLAH SA, 2004, P 5 INT S MUS INF RE
[2] [Anonymous], P WORKSH ADV MOD AC
[3] [Anonymous], 2010, PROC ICML
[4] [Anonymous], 2014, INT C LEARN REPR
[5] Bay M., 2009, P 13 INT SOC MUS INF
[6] Automatic music transcription: challenges and future directions
Benetos, Emmanouil
Dixon, Simon
Giannoulis, Dimitrios
Kirchhoff, Holger
Klapuri, Anssi
[J]. JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2013, 41 (03) : 407 - 434
[7] A Shift-Invariant Latent Variable Model for Automatic Music Transcription
Benetos, Emmanouil
Dixon, Simon
[J]. COMPUTER MUSIC JOURNAL, 2012, 36 (04) : 81 - 94
[8] Bock S., 2012, P IEEE INT C AC SPEE
[9] CALCULATION OF A CONSTANT-Q SPECTRAL TRANSFORM
BROWN, JC
[J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1991, 89 (01) : 425 - 434
[10] Eyben F., 2010, P 11 INT SOC MUS INF

← 1 2 3 →