Accuracy comparisons of fingerprint based song recognition approaches using very high granularity

被引:1
|
作者
Serrano, Salvatore [1 ]
Scarpa, Marco [1 ]
机构
[1] Univ Messina, Dept Engn, Cda Dio Villaggio S Agata, I-98166 Messina, ME, Italy
关键词
Song recognition; Audio fingerprint; Power spectral density; Hamming distance; Binary fingerprints; AUDIO; RETRIEVAL;
D O I
10.1007/s11042-023-14787-2
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Music and song recognition is an activity of wide interest for researchers and companies due to the intrinsic challenges and the possible economical profits it can give. Despite basic algorithms about song recognition are simple in principle, it is quite difficult to obtain an efficient and robust approach able to generate an effective algorithm for identifying short piece of audio on the fly. In this paper, we compare the results obtained using a new algorithm we recently proposed against several baseline approaches in terms of accuracy when very short pieces of audio are processed. Experimental results, performed using both a subset of the MTG-Jamendo dataset and a proprietary audio corpus containing 7000 songs, show our approach outperform the others in particular for excerpts of audio shorter than 3s.
引用
收藏
页码:31591 / 31606
页数:16
相关论文
共 4 条
  • [1] Accuracy comparisons of fingerprint based song recognition approaches using very high granularity
    Salvatore Serrano
    Marco Scarpa
    Multimedia Tools and Applications, 2023, 82 : 31591 - 31606
  • [2] Multimodal speech recognition: increasing accuracy using high speed video data
    Ivanko, Denis
    Karpov, Alexey
    Fedotov, Dmitrii
    Kipyatkova, Irina
    Ryumin, Dmitry
    Ivanko, Dmitriy
    Minker, Wolfgang
    Zelezny, Milos
    JOURNAL ON MULTIMODAL USER INTERFACES, 2018, 12 (04) : 319 - 328
  • [3] Supervised Classification of Very High Resolution Optical Images Using Wavelet-Based Textural Features
    Regniers, Olivier
    Bombrun, Lionel
    Lafon, Virginie
    Germain, Christian
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2016, 54 (06): : 3722 - 3735
  • [4] Machine learning based approaches for clinical and non-clinical depression recognition and depression relapse prediction using audiovisual and EEG modalities: A comprehensive review
    Yasin, Sana
    Othmani, Alice
    Raza, Imran
    Hussain, Syed Asad
    COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 159