An End-to-End Machine Learning System for Harmonic Analysis of Music

被引:33
|
作者
Ni, Yizhao [1 ]
McVicar, Matt [1 ]
Santos-Rodriguez, Raul [2 ]
De Bie, Tijl [1 ]
机构
[1] Univ Bristol, Dept Engn Math, Intelligent Syst Lab, Bristol BS8 1UB, Avon, England
[2] Univ Carlos III Madrid, Signal Theory & Commun Dept, E-28903 Getafe, Spain
来源
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | 2012年 / 20卷 / 06期
基金
英国工程与自然科学研究理事会;
关键词
Audio chord estimation; harmony progression analyzer (HPA); loudness-based chromagram; machine learning; meta-song evaluation; FEATURES; AUDIO;
D O I
10.1109/TASL.2012.2188516
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We present a new system for the harmonic analysis of popular musical audio. It is focused on chord estimation, although the proposed system additionally estimates the key sequence and bass notes. It is distinct from competing approaches in two main ways. First, it makes use of a new improved chromagram representation of audio that takes the human perception of loudness into account. Furthermore, it is the first system for joint estimation of chords, keys, and bass notes that is fully based on machine learning, requiring no expert knowledge to tune the parameters. This means that it will benefit from future increases in available annotated audio files, broadening its applicability to a wider range of genres. In all of three evaluation scenarios, including a new one that allows evaluation on audio for which no complete ground truth annotation is available, the proposed system is shown to be faster, more memory efficient, and more accurate than the state-of-the-art.
引用
收藏
页码:1771 / 1783
页数:13
相关论文
共 50 条
  • [1] END-TO-END LEARNING FOR MUSIC AUDIO
    Dieleman, Sander
    Schrauwen, Benjamin
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [2] SynthNet: Learning to Synthesize Music End-to-End
    Schimbinschi, Florin
    Walder, Christian
    Erfani, Sarah M.
    Bailey, James
    PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 3367 - 3374
  • [3] Analysis of the Effect of Sensors for End-to-End Machine Learning Odometry
    Rodriguez-Peral, Carlos Marquez
    Pena, Dexmont
    COMPUTER VISION - ECCV 2018 WORKSHOPS, PT VI, 2019, 11134 : 82 - 95
  • [4] PyODDS: An End-to-end Outlier Detection System with Automated Machine Learning
    Li, Yuening
    Zha, Daochen
    Venugopal, Praveen
    Zou, Na
    Hu, Xia
    WWW'20: COMPANION PROCEEDINGS OF THE WEB CONFERENCE 2020, 2020, : 153 - 157
  • [5] End-to-end Machine Learning using Kubeflow
    George, Johnu
    Saha, Amit
    PROCEEDINGS OF THE 5TH JOINT INTERNATIONAL CONFERENCE ON DATA SCIENCE & MANAGEMENT OF DATA, CODS COMAD 2022, 2022, : 336 - 338
  • [6] Machine Learning for End-to-End Congestion Control
    Zhang, Ticao
    Mao, Shiwen
    IEEE COMMUNICATIONS MAGAZINE, 2020, 58 (06) : 52 - 57
  • [7] End-to-End Machine Learning with Apache AsterixDB
    Alkowaileet, Wail
    Alsubaiee, Sattam
    Carey, Michael J.
    Li, Chen
    Ramampiaro, Heri
    Sinthong, Phanwadee
    Wang, Xikui
    PROCEEDINGS OF THE SECOND WORKSHOP ON DATA MANAGEMENT FOR END-TO-END MACHINE LEARNING, 2018,
  • [8] End-to-end Optimization of Machine Learning Prediction Queries
    Park, Kwanghyun
    Saur, Karla
    Banda, Dalitso
    Interlandi, Rathijit Sen Matteo
    Karanasos, Konstantinos
    PROCEEDINGS OF THE 2022 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA (SIGMOD '22), 2022, : 587 - 601
  • [9] Provenance Tracking for End-to-End Machine Learning Pipelines
    Grafberger, Stefan
    Groth, Paul
    Schelter, Sebastian
    COMPANION OF THE WORLD WIDE WEB CONFERENCE, WWW 2023, 2023, : 1512 - 1512
  • [10] Capturing end-to-end provenance for machine learning pipelines
    Schlegel, Marius
    Sattler, Kai-Uwe
    INFORMATION SYSTEMS, 2025, 132