Pitch-informed solo and accompaniment separation towards its use in music education applications

被引:9
作者
Cano, Estefania [1 ]
Schuller, Gerald [2 ]
Dittmar, Christian [1 ]
机构
[1] Fraunhofer Inst Digital Media Technol IDMT, D-98693 Ilmenau, Germany
[2] Tech Univ Ilmenau, D-98693 Ilmenau, Germany
来源
EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING | 2014年
关键词
Dynamic Time Warping; Listening Test; Pitch Contour; Temporal Envelope; Music Education;
D O I
10.1186/1687-6180-2014-23
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
We present a system for the automatic separation of solo instruments and music accompaniment in polyphonic music recordings. Our approach is based on a pitch detection front-end and a tone-based spectral estimation. We assess the plausibility of using sound separation technologies to create practice material in a music education context. To better understand the sound separation quality requirements in music education, a listening test was conducted to determine the most perceptually relevant signal distortions that need to be improved. Results from the listening test show that solo and accompaniment tracks pose different quality requirements and should be optimized differently. We propose and evaluate algorithm modifications to better understand their effects on objective perceptual quality measures. Finally, we outline possible ways of optimizing our separation approach to better suit the requirements of music education applications.
引用
收藏
页数:19
相关论文
共 43 条
  • [31] Monaural Musical Sound Separation Based on Pitch and Common Amplitude Modulation
    Li, Yipeng
    Woodruff, John
    Wang, DeLiang
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2009, 17 (07): : 1361 - 1371
  • [32] Liutkus A, 2013, 14 INT WORKSH IM AUD
  • [33] Liutkus A, 2012, EUR SIGNAL PR CONF, P2407
  • [34] Marxer Ricard, 2012, Latent Variable Analysis and Signal Separation. Proceedings 10th International Conference, LVA/ICA 2012, P314, DOI 10.1007/978-3-642-28551-6_39
  • [35] Ono N., 2008, 2008 16 EUR SIGN PRO, P1
  • [36] QMU Centre for Digital Music, 2013, QMU CTR DIG MUS
  • [37] REpeating Pattern Extraction Technique (REPET): A Simple Method for Music/Voice Separation
    Rafii, Zafar
    Pardo, Bryan
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2013, 21 (01): : 71 - 82
  • [38] Melody Extraction From Polyphonic Music Signals Using Pitch Contour Characteristics
    Salamon, Justin
    Gomez, Emilia
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2012, 20 (06): : 1759 - 1770
  • [39] Simsekli U, 2012, EUR SIGNAL PR CONF, P2639
  • [40] SiSEC, 2011, SISEC 2013 RES