FREQUENCY-ANCHORED DEEP NETWORKS FOR POLYPHONIC MELODY EXTRACTION

被引:0
|
作者
Sharma, Aman Kumar [1 ]
Saxena, Kavya Ranjan [2 ]
Arora, Vipul [2 ]
机构
[1] Cisco Syst, MIG Routing, Bangalore, Karnataka, India
[2] Indian Inst Technol, Dept Elect Engn, Kanpur, Uttar Pradesh, India
关键词
Melody extraction; music information retrieval; pitch shifting; constant Q-transform(CQT); Deep neural network; MUSIC AUDIO; IDENTIFICATION; RETRIEVAL;
D O I
10.1109/NCC52529.2021.9530037
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Extraction of the predominant melodic line from polyphonic audio containing more than one source playing simultaneously is a challenging task in the field of music information retrieval. The proposed method aims at providing finer FOs, and not coarse notes while using deep classifiers. Frequency-anchored input features extracted from constant Q-transform allow the signatures of melody to be independent of F0. The proposed scheme also takes care of the data imbalance problem across classes, as it uses only two or three output classes as opposed to a large number of notes. Experimental evaluation shows the proposed method outperforms a state-of-the-art deep learning-based melody estimation method.
引用
收藏
页码:452 / 456
页数:5
相关论文
共 50 条
  • [31] Predominant Melody Extraction from Vocal Polyphonic Music Signal by Time-Domain Adaptive Filtering-Based Method
    M. Gurunath Reddy
    K. Sreenivasa Rao
    Circuits, Systems, and Signal Processing, 2018, 37 : 2911 - 2933
  • [32] Predominant Melody Extraction from Vocal Polyphonic Music Signal by Time-Domain Adaptive Filtering-Based Method
    Reddy, M. Gurunath
    Rao, K. Sreenivasa
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2018, 37 (07) : 2911 - 2933
  • [33] Polyphonic Sound Event Detection Using Multi Label Deep Neural Networks
    Cakir, Emre
    Heittola, Toni
    Huttunen, Heikki
    Virtanen, Tuomas
    2015 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2015,
  • [34] Between Homomorphic Signal Processing and Deep Neural Networks: Constructing Deep Algorithms for Polyphonic Music Transcription
    Su, Li
    2017 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC 2017), 2017, : 884 - 891
  • [35] Feature Extraction Using Time-Frequency Analysis for Monophonic-Polyphonic Wheeze Discrimination
    Ulukaya, Sezer
    Sen, Ipek
    Kahya, Yasemin P.
    2015 37TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2015, : 5412 - 5415
  • [36] DEEP CONVOLUTIONAL AND RECURRENT NETWORKS FOR POLYPHONIC INSTRUMENT CLASSIFICATION FROM MONOPHONIC RAW AUDIO WAVEFORMS
    Avramidis, Kleanthis
    Kratimenos, Agelos
    Garoufis, Christos
    Zlatintsi, Athanasia
    Maragos, Petros
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 3010 - 3014
  • [37] Extraction of the Sivers function with deep neural networks
    Fernando, I. P.
    Keller, D.
    PHYSICAL REVIEW D, 2023, 108 (05)
  • [38] Deep neural networks for climate relation extraction
    Zheng, J.
    Wang, J.
    Chen, S.
    Li, J.
    Chen, Y.
    Li, B.
    GLOBAL NEST JOURNAL, 2021, 23 (04): : 544 - 549
  • [39] Deep neural networks for Arabic information extraction
    Saadi, Abdelhalim
    Belhadef, Hacene
    SMART AND SUSTAINABLE BUILT ENVIRONMENT, 2020, 9 (04) : 467 - 482
  • [40] Deep Neural Networks for Automated Metadata Extraction
    El Omari, Abdellah
    Antari, Jilali
    Elkina, Hamza
    DIGITAL TECHNOLOGIES AND APPLICATIONS, ICDTA 2024, VOL 4, 2024, 1101 : 64 - 74