NEURAL AUDIO-TO-SCORE MUSIC TRANSCRIPTION FOR UNCONSTRAINED POLYPHONY USING COMPACT OUTPUT REPRESENTATIONS

被引:7
作者
Arroyo, Victor [1 ]
Valero-Mas, Jose J. [1 ]
Calvo-Zaragoza, Jorge [1 ]
Pertusa, Antonio [1 ]
机构
[1] Univ Alicante, Univ Inst Comp Res IUII, Alicante, Spain
来源
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2022年
关键词
Audio-to-Score Transcription; Connectionist Temporal Classification; Unconstrained Polyphony;
D O I
10.1109/ICASSP43922.2022.9746239
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Neural Audio-to-Score (A2S) Music Transcription systems have shown promising results with pieces containing a fixed number of voices. However, they still exhibit fundamental limitations that constrain their applicability in wider scenarios. This work aims at tackling two of them: we introduce a novel output representation which addresses shortcomings related to the sequence-based A2S recognition framework and we report a first approximation to dealing with unconstrained polyphony. This is validated on a Convolutional Recurrent Neural Network (CRNN) with Connectionist Temporal Classification (CTC) A2S scheme using synthetic audio from string quartets and piano sonatas with intricate polyphonic mixtures. Our results, which improve fixed-polyphony state-of-the-art rates, may be considered a reference for future A2S works dealing with an unconstrained number of voices.
引用
收藏
页码:4603 / 4607
页数:5
相关论文
共 17 条
[1]  
Amodei Dario, 2015, COMPUTER RES REPOSIT
[2]  
Benetos E., 2012, P 13 INT SOC MUS INF, P379
[3]  
Cogliati Andrea, 2016, P 17 INT SOC MUS INF, P758
[4]  
Carvalho RGC, 2017, IEEE WORK APPL SIG, P151, DOI 10.1109/WASPAA.2017.8170013
[5]  
Graves A., 2006, 23 ICML, P369, DOI DOI 10.1145/1143844.1143891
[6]   Automatic Transcription of Recorded Music [J].
Grosche, Peter ;
Schuller, Bjoern ;
Mueller, Meinard ;
Rigoll, Gerhard .
ACTA ACUSTICA UNITED WITH ACUSTICA, 2012, 98 (02) :199-215
[7]   A Comparison of Deep Learning Methods for Timbre Analysis in Polyphonic Automatic Music Transcription [J].
Hernandez-Olivan, Carlos ;
Zay Pinilla, Ignacio ;
Hernandez-Lopez, Carlos ;
Beltran, Jose R. .
ELECTRONICS, 2021, 10 (07)
[8]   JOINT MULTI-PITCH DETECTION AND SCORE TRANSCRIPTION FOR POLYPHONIC PIANO MUSIC [J].
Liu, Lele ;
Morfi, Veronica ;
Benetos, Emmanouil .
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, :281-285
[9]  
Liu Lele, 2021, HDB ARTIFICIAL INTEL, P693
[10]  
Roman M. A., 2019, P 20 INT SOC MUS INF, P731