A Music Cognition-Guided Framework for Multi-pitch Estimation

被引:2
|
作者
Li, Xiaoquan [1 ]
Yan, Yijun [2 ]
Soraghan, John [1 ]
Wang, Zheng [3 ]
Ren, Jinchang [2 ]
机构
[1] Univ Strathclyde, Dept Elect & Elect Engn, Glasgow, Lanark, Scotland
[2] Robert Gordon Univ, Natl Subsea Ctr, Aberdeen AB21 0BH, Scotland
[3] Tianjin Univ, Coll Intelligence & Comp, Tianjin, Peoples R China
关键词
Music cognition; Automatic music transcription; Multi-pitch estimation; Harmonic structure detection (HSD); Polyphonic music detection; TRANSCRIPTION; NETWORK;
D O I
10.1007/s12559-022-10031-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As one of the most important subtasks of automatic music transcription (AMT), multi-pitch estimation (MPE) has been studied extensively for predicting the fundamental frequencies in the frames of audio recordings during the past decade. However, how to use music perception and cognition for MPE has not yet been thoroughly investigated. Motivated by this, this demonstrates how to effectively detect the fundamental frequency and the harmonic structure of polyphonic music using a cognitive framework. Inspired by cognitive neuroscience, an integration of the constant Q transform and a state-of-the-art matrix factorization method called shift-invariant probabilistic latent component analysis (SI-PLCA) are proposed to resolve the polyphonic short-time magnitude log-spectra for multiple pitch estimation and source-specific feature extraction. The cognitions of rhythm, harmonic periodicity and instrument timbre are used to guide the analysis of characterizing contiguous notes and the relationship between fundamental frequency and harmonic frequencies for detecting the pitches from the outcomes of SI-PLCA. In the experiment, we compare the performance of proposed MPE system to a number of existing state-of-the-art approaches (seven weak learning methods and four deep learning methods) on three widely used datasets (i.e. MAPS, BACH10 and TRIOS) in terms of F-measure (F-1) values. The experimental results show that the proposed MPE method provides the best overall performance against other existing methods.
引用
收藏
页码:23 / 35
页数:13
相关论文
共 50 条
  • [41] EXPECTATION-MAXIMIZATION ALGORITHM FOR MULTI-PITCH ESTIMATION AND SEPARATION OF OVERLAPPING HARMONIC SPECTRA
    Badeau, Roland
    Emiya, Valentin
    David, Bertrand
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 3073 - 3076
  • [42] EFFICIENT IMPLEMENTATION OF PROBABILISTIC MULTI-PITCH TRACKING
    Wohlmayr, Michael
    Peharz, Robert
    Pernkopf, Franz
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 5412 - 5415
  • [43] Multimodal neuromarkers in schizophrenia via cognition-guided MRI fusion
    Sui, Jing
    Qi, Shile
    van Erp, Theo G. M.
    Bustillo, Juan
    Jiang, Rongtao
    Lin, Dongdong
    Turner, Jessica A.
    Damaraju, Eswar
    Mayer, Andrew R.
    Cui, Yue
    Fu, Zening
    Du, Yuhui
    Chen, Jiayu
    Potkin, Steven G.
    Preda, Adrian
    Mathalon, Daniel H.
    Ford, Judith M.
    Voyvodic, James
    Mueller, Bryon A.
    Belger, Aysenil
    McEwen, Sarah C.
    O'Leary, Daniel S.
    McMahon, Agnes
    Jiang, Tianzi
    Calhoun, Vince D.
    NATURE COMMUNICATIONS, 2018, 9
  • [44] Multimodal neuromarkers in schizophrenia via cognition-guided MRI fusion
    Jing Sui
    Shile Qi
    Theo G. M. van Erp
    Juan Bustillo
    Rongtao Jiang
    Dongdong Lin
    Jessica A. Turner
    Eswar Damaraju
    Andrew R. Mayer
    Yue Cui
    Zening Fu
    Yuhui Du
    Jiayu Chen
    Steven G. Potkin
    Adrian Preda
    Daniel H. Mathalon
    Judith M. Ford
    James Voyvodic
    Bryon A. Mueller
    Aysenil Belger
    Sarah C. McEwen
    Daniel S. O’Leary
    Agnes McMahon
    Tianzi Jiang
    Vince D. Calhoun
    Nature Communications, 9
  • [45] SPECTRAL MULTI-SCALE ANALYSIS FOR MULTI-PITCH TRACKING
    Ben Messaoud, Mohamed Anouar
    Bouzid, Aicha
    Ellouze, Noureddine
    2009 IEEE 13TH DIGITAL SIGNAL PROCESSING WORKSHOP & 5TH IEEE PROCESSING EDUCATION WORKSHOP, VOLS 1 AND 2, PROCEEDINGS, 2009, : 26 - 31
  • [46] Rhythm and pitch in music cognition
    Krumhansl, CL
    PSYCHOLOGICAL BULLETIN, 2000, 126 (01) : 159 - 179
  • [47] Cochannel Speech Separation Using Multi-pitch Estimation and Model Based Voiced Sequential Grouping
    Li, Ming
    Cao, Chuan
    Wang, Di
    Lu, Ping
    Fu, Qiang
    Yan, Yonghong
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 151 - 154
  • [48] The Harmonic Shift Algorithm for Efficient Multi-Pitch Detection
    Grinewitschus, Lukas
    Jung, Peter
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 548 - 561
  • [49] RETRACTED: Extraction of Music Main Melody and Multi-Pitch Estimation Method Based on Support Vector Machine in Big Data Environment (Retracted Article)
    Liang, Shaoru
    Shu, Ran
    JOURNAL OF ENVIRONMENTAL AND PUBLIC HEALTH, 2022, 2022
  • [50] A novel cognition-guided neuro-feedback treatment for methamphetamine addiction
    Bu, Junjie
    Cheng, Yan
    Gou, Huixing
    Li, Jian
    Zhang, Hao
    Zhang, Xiaochu
    INTERNATIONAL JOURNAL OF PSYCHOLOGY, 2023, 58 : 22 - 22