Pitch-informed solo and accompaniment separation towards its use in music education applications

被引:9
作者
Cano, Estefania [1 ]
Schuller, Gerald [2 ]
Dittmar, Christian [1 ]
机构
[1] Fraunhofer Inst Digital Media Technol IDMT, D-98693 Ilmenau, Germany
[2] Tech Univ Ilmenau, D-98693 Ilmenau, Germany
来源
EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING | 2014年
关键词
Dynamic Time Warping; Listening Test; Pitch Contour; Temporal Envelope; Music Education;
D O I
10.1186/1687-6180-2014-23
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
We present a system for the automatic separation of solo instruments and music accompaniment in polyphonic music recordings. Our approach is based on a pitch detection front-end and a tone-based spectral estimation. We assess the plausibility of using sound separation technologies to create practice material in a music education context. To better understand the sound separation quality requirements in music education, a listening test was conducted to determine the most perceptually relevant signal distortions that need to be improved. Results from the listening test show that solo and accompaniment tracks pose different quality requirements and should be optimized differently. We propose and evaluate algorithm modifications to better understand their effects on objective perceptual quality measures. Finally, we outline possible ways of optimizing our separation approach to better suit the requirements of music education applications.
引用
收藏
页数:19
相关论文
共 43 条
  • [1] Bosch JJ, 2012, EUR SIGNAL PR CONF, P2417
  • [2] Bregman A., 1990, Auditory Scene Analysis: The Perceptual Organization of Sound, DOI DOI 10.7551/MITPRESS/1486.001.0001
  • [3] Cano E., 2013, SOLO ACCOMPANIMENT S
  • [4] Cano E, 2013, 16 INT C DIG AUD EFF, P1
  • [5] Cano E., 2011, 12 INT SOC MUS INF R
  • [6] Cano Estefania., 2009, 12 INT C DIGITAL AUD, P1
  • [7] Dittmar C., 2012, MULTIMODAL MUSIC PRO, V3, P95, DOI 10.4230/DFU.Vol3.11041.95
  • [8] Dressler K., 2011, P 42 AES INT C SEM A, P1
  • [9] Soundprism: An Online System for Score-Informed Source Separation of Music Audio
    Duan, Zhiyao
    Pardo, Bryan
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2011, 5 (06) : 1205 - 1215
  • [10] Under-Determined Reverberant Audio Source Separation Using a Full-Rank Spatial Covariance Model
    Duong, Ngoc Q. K.
    Vincent, Emmanuel
    Gribonval, Remi
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (07): : 1830 - 1840