Harmonic Representation for CNN-LSTM Automatic Chord Recognition

被引:0
作者
Ito, Tsuyoshi [1 ]
Arai, Shuichi [1 ]
机构
[1] Tokyo City Univ, Grad Sch Engn, Setagaya Ku, 1-28-1 Tamazutsumi, Tokyo 1588557, Japan
来源
3RD INTERNATIONAL CONFERENCE ON CYBERNETICS AND INTELLIGENT SYSTEMS (ICORIS 2021) | 2021年
关键词
Deep Learning; Signal Processing; Pattern Recognition; Music Information Retrieval; Automatic Chord Recognition;
D O I
10.1109/ICORIS52787.2021.9649565
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Paper Since Chord progression is the element that determines the harmony of a piece of music, Automatic Chord Recognition (ACR) from audio signals is a crucial task in the field of Music Information Retrieval(MIR). Recently, various models using deep learning have been proposed, but there are few studies on their input features. Notes parts of the chord are the fundamental note, and its overtone ringed simultaneously. In order to model these audio signals efficiently, feature transforms such as "Constat-Q-Transform(CQT)" is used. However, due to the super-position of fundamental notes and overtones of various instruments in polyphonic music, it is considered difficult to model chords even by deep learning. Therefore, we focused on the structure, including fundamental notes are on the logarithm and its overtones are on the linear. In this paper, we propose a feature representation that can represent overtone structure for each fundamental note. Based on these feature representations, data-driven approach to learn the chord by CNN-LSTM model. We evaluated performance using 383 songs with publicly available annotations, and achieved the same performance with approximately one-tenth of the number of parameters than the existing methods.
引用
收藏
页码:196 / 200
页数:5
相关论文
共 17 条
  • [1] [Anonymous], 2013, P 14 INT SOC MUS INF
  • [2] CALCULATION OF A CONSTANT-Q SPECTRAL TRANSFORM
    BROWN, JC
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1991, 89 (01) : 425 - 434
  • [3] di Giorgi Bruno., 2013, NDS 13 P 8 INT WORKS, P1
  • [4] Fujishima T., 1999, P INT COMP MUS C, P464
  • [5] GOTO M, 2002, RWC MUSIC DATABASE P
  • [6] Harte C, 2010, AUTOMATIC EXTRACTION, V01
  • [7] Humphrey EJ, 2012, INT CONF ACOUST SPEE, P453, DOI 10.1109/ICASSP.2012.6287914
  • [8] Korzeniowski F., 2016, IEEE INT WORKS MACH
  • [9] Korzeniowski F., 2016, P 17 INT SOC MUS INF
  • [10] Lee K., 2006, P INT COMPUTER MUSIC, P306