FREQUENCY-ANCHORED DEEP NETWORKS FOR POLYPHONIC MELODY EXTRACTION

被引:0
|
作者
Sharma, Aman Kumar [1 ]
Saxena, Kavya Ranjan [2 ]
Arora, Vipul [2 ]
机构
[1] Cisco Syst, MIG Routing, Bangalore, Karnataka, India
[2] Indian Inst Technol, Dept Elect Engn, Kanpur, Uttar Pradesh, India
关键词
Melody extraction; music information retrieval; pitch shifting; constant Q-transform(CQT); Deep neural network; MUSIC AUDIO; IDENTIFICATION; RETRIEVAL;
D O I
10.1109/NCC52529.2021.9530037
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Extraction of the predominant melodic line from polyphonic audio containing more than one source playing simultaneously is a challenging task in the field of music information retrieval. The proposed method aims at providing finer FOs, and not coarse notes while using deep classifiers. Frequency-anchored input features extracted from constant Q-transform allow the signatures of melody to be independent of F0. The proposed scheme also takes care of the data imbalance problem across classes, as it uses only two or three output classes as opposed to a large number of notes. Experimental evaluation shows the proposed method outperforms a state-of-the-art deep learning-based melody estimation method.
引用
收藏
页码:452 / 456
页数:5
相关论文
共 50 条
  • [41] Discriminative Feature Extraction with Deep Neural Networks
    Stuhlsatz, Andre
    Lippel, Jens
    Zielke, Thomas
    2010 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS IJCNN 2010, 2010,
  • [42] Deep Convolutional Neural Networks for Predominant Instrument Recognition in Polyphonic Music Using Discrete Wavelet Transform
    Dash, Sukanta Kumar
    Solanki, S. S.
    Chakraborty, Soubhik
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2024, 43 (7) : 4239 - 4271
  • [43] Deep Neural Networks for Web Page Information Extraction
    Gogar, Tomas
    Hubacek, Ondrej
    Sedivy, Jan
    ARTIFICIAL INTELLIGENCE APPLICATIONS AND INNOVATIONS, AIAI 2016, 2016, 475 : 154 - 163
  • [44] Automatic Document Metadata Extraction Based on Deep Networks
    Liu, Runtao
    Gao, Liangcai
    An, Dong
    Jiang, Zhuoren
    Tang, Zhi
    NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, NLPCC 2017, 2018, 10619 : 305 - 317
  • [45] Context extraction module for deep convolutional neural networks
    Singh, Pravendra
    Mazumder, Pratik
    Namboodiri, Vinay P.
    PATTERN RECOGNITION, 2022, 122
  • [46] Ontology Concept Extraction Algorithm for Deep Neural Networks
    Ponomarev, Andrew
    Agafonov, Anton
    2022 32ND CONFERENCE OF OPEN INNOVATIONS ASSOCIATION (FRUCT), 2022, : 221 - 226
  • [47] DeepRED - Rule Extraction from Deep Neural Networks
    Zilke, Jan Ruben
    Mencia, Eneldo Loza
    Janssen, Frederik
    DISCOVERY SCIENCE, (DS 2016), 2016, 9956 : 457 - 473
  • [48] Topology Reduction in Deep Convolutional Feature Extraction Networks
    Wiatowski, Thomas
    Grohs, Philipp
    Bolcskei, Helmut
    WAVELETS AND SPARSITY XVII, 2017, 10394
  • [49] Blind Source Separation in Polyphonic Music Recordings Using Deep Neural Networks Trained via Policy Gradients
    Schulze, Soeren
    Leuschner, Johannes
    King, Emily J.
    SIGNALS, 2021, 2 (04): : 637 - 661
  • [50] Modeling Temporal Tonal Relations in Polyphonic Music through Deep Networks with a Novel Image-Based Representation
    Chuan, Ching-Hua
    Herremans, Dorien
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 2159 - 2166