FREQUENCY-ANCHORED DEEP NETWORKS FOR POLYPHONIC MELODY EXTRACTION

被引:0
|
作者
Sharma, Aman Kumar [1 ]
Saxena, Kavya Ranjan [2 ]
Arora, Vipul [2 ]
机构
[1] Cisco Syst, MIG Routing, Bangalore, Karnataka, India
[2] Indian Inst Technol, Dept Elect Engn, Kanpur, Uttar Pradesh, India
关键词
Melody extraction; music information retrieval; pitch shifting; constant Q-transform(CQT); Deep neural network; MUSIC AUDIO; IDENTIFICATION; RETRIEVAL;
D O I
10.1109/NCC52529.2021.9530037
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Extraction of the predominant melodic line from polyphonic audio containing more than one source playing simultaneously is a challenging task in the field of music information retrieval. The proposed method aims at providing finer FOs, and not coarse notes while using deep classifiers. Frequency-anchored input features extracted from constant Q-transform allow the signatures of melody to be independent of F0. The proposed scheme also takes care of the data imbalance problem across classes, as it uses only two or three output classes as opposed to a large number of notes. Experimental evaluation shows the proposed method outperforms a state-of-the-art deep learning-based melody estimation method.
引用
收藏
页码:452 / 456
页数:5
相关论文
共 50 条
  • [21] Melody Extraction Based on Deep Harmonic Neural Network
    Huang, Yuzhi
    Liu, Gang
    PROCEEDINGS OF 2018 INTERNATIONAL CONFERENCE ON NETWORK INFRASTRUCTURE AND DIGITAL CONTENT (IEEE IC-NIDC), 2018, : 174 - 178
  • [22] Prediction of Polyphonic Alarm Sound by Deep Neural Networks
    Kishimoto K.
    Takemura T.
    Sugiyama O.
    Kojima R.
    Yakami M.
    Nambu M.
    Fujii K.
    Kuroda T.
    Transactions of Japanese Society for Medical and Biological Engineering, 2022, 60 (01) : 8 - 15
  • [23] PREDOMINANT MELODY EXTRACTION FROM VOCAL POLYPHONIC MUSIC SIGNAL BY COMBINED SPECTRO-TEMPORAL METHOD
    Reddy, Gurunath M.
    Rao, K. Sreenivasa
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 455 - 459
  • [24] Signal Feature Extraction of Music Melody Based on Deep Learning
    Jiang, Jinwen
    TRAITEMENT DU SIGNAL, 2022, 39 (06) : 2203 - 2209
  • [25] Deep Anchored Convolutional Neural Networks
    Huang, Jiahui
    Dwivedi, Kshitij
    Roig, Gemma
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, : 639 - 647
  • [26] Enhanced Harmonic Content and Vocal Note Based Predominant Melody Extraction from Vocal Polyphonic Music Signals
    Reddy, Gurunath M.
    Rao, K. Sreenivasa
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 3309 - 3313
  • [27] FREQUENCY-TEMPORAL ATTENTION NETWORK FOR SINGING MELODY EXTRACTION
    Yu, Shuai
    Sun, Xiaoheng
    Yu, Yi
    Li, Wei
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 251 - 255
  • [28] MCSSME: Multi-Task Contrastive Learning for Semi-supervised Singing Melody Extraction from Polyphonic Music
    Yu, Shuai
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 1, 2024, : 365 - 373
  • [29] Extraction and recognition of music melody features using a deep neural network
    Zhang, Zhongqing
    JOURNAL OF VIBROENGINEERING, 2023, 25 (04) : 769 - 777
  • [30] Deep Convolutional Neural Networks for Predominant Instrument Recognition in Polyphonic Music
    Han, Yoonchang
    Kim, Jaehun
    Lee, Kyogu
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (01) : 208 - 221