Automatic Transcription of Guitar Chords and Fingering From Audio

被引:34
作者
Barbancho, Ana M. [1 ]
Klapuri, Anssi [2 ]
Tardon, Lorenzo J. [1 ]
Barbancho, Isabel [1 ]
机构
[1] Univ Malaga, Dept Ingn Comunicac, ETS Ingn Telecomunicac, E-29071 Malaga, Spain
[2] Queen Mary Univ London, Ctr Digital Mus, London E1 4NS, England
来源
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | 2012年 / 20卷 / 03期
关键词
Acoustic signal analysis; chord transcription; hidden Markov model (HMM); multiple fundamental frequency estimation; music signal processing; SPEECH;
D O I
10.1109/TASL.2011.2174227
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper proposes a method for extracting the fingering configurations automatically from a recorded guitar performance. 330 different fingering configurations are considered, corresponding to different versions of the major, minor, major 7th, and minor 7th chords played on the guitar fretboard. The method is formulated as a hidden Markov model, where the hidden states correspond to the different fingering configurations and the observed acoustic features are obtained from a multiple fundamental frequency estimator that measures the salience of a range of candidate note pitches within individual time frames. Transitions between consecutive fingerings are constrained by a musical model trained on a database of chord sequences, and a heuristic cost function that measures the physical difficulty of moving from one configuration of finger positions to another. The method was evaluated on recordings from the acoustic, electric, and the Spanish guitar and clearly outperformed a non-guitar-specific reference chord transcription method despite the fact that the number of chords considered here is significantly larger.
引用
收藏
页码:915 / 921
页数:7
相关论文
共 38 条
[1]  
[Anonymous], 1990, COGNITIVE FDN MUSICA
[2]  
[Anonymous], 2000, Pattern Classification
[3]  
[Anonymous], 2000, Speech and language processing: An introduction to natural language processing, computational linguistics, and speech recognition
[4]  
Barbancho I., 2009, 126 AUD ENG SOC CONV, P1
[5]   To catch a chorus: Using chroma-based representations for audio thumbnailing [J].
Bartsch, MA ;
Wakefield, GH .
PROCEEDINGS OF THE 2001 IEEE WORKSHOP ON THE APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS, 2001, :15-18
[6]  
Bello J. P., 2005, P 6 INT C MUSIC INFO, P304, DOI 10.5281/zenodo.1417431
[7]  
Burgoyne J.A., 2007, P INT S MUSIC INFORM, P251
[8]  
Burns Anne-Marie., 2006, Proceedings of the International Conference on New Interfaces for Musical Expression, P196
[9]   Con tent-based music information retrieval: Current directions and future challenges [J].
Casey, Michael A. ;
Veltkamp, Remco ;
Goto, Masataka ;
Leman, Marc ;
Rhodes, Christophe ;
Slaney, Malcolm .
PROCEEDINGS OF THE IEEE, 2008, 96 (04) :668-696
[10]   YIN, a fundamental frequency estimator for speech and music [J].
de Cheveigné, A ;
Kawahara, H .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2002, 111 (04) :1917-1930