Audio Fingerprint Extraction Based on Locally Linear Embedding for Audio Retrieval System

被引:7
|
作者
Jia, Maoshen [1 ]
Li, Tianhao [1 ]
Wang, Jing [2 ]
机构
[1] Beijing Univ Technol, Fac Informat Technol, Beijing Key Lab Computat Intelligence & Intellige, Beijing 100124, Peoples R China
[2] Beijing Inst Technol, Sch Informat & Elect, Beijing 100081, Peoples R China
基金
中国国家自然科学基金;
关键词
audio fingerprint; sub-regions; dimensionality reduction; audio retrieval;
D O I
10.3390/electronics9091483
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the appearance of a large amount of audio data, people have a higher demand for audio retrieval, which can quickly and accurately find the required information. Audio fingerprint retrieval is a popular choice because of its excellent performance. However, there is a problem about the large amount of audio fingerprint data in the existing audio fingerprint retrieval method which takes up more storage space and affects the retrieval speed. Aiming at the problem, this paper presents a novel audio fingerprinting method based on locally linear embedding (LLE) that has smaller fingerprints and the retrieval is more efficient. The proposed audio fingerprint extraction divides the bands around each peak in the frequency domain into four groups of sub-regions and the energy of every sub-region is computed. Then the LLE is performed for each group, respectively, and the audio fingerprint is encoded by comparing adjacent energies. To solve the distortion of linear speed changes, a matching strategy based on dynamic time warping (DTW) is adopted in the retrieval part which can compare two audio segments with different lengths. To evaluate the retrieval performance of the proposed method, the experiments are carried out under different conditions of single and multiple groups' dimensionality reduction. Both of them can achieve a high recall and precision rate and has a better retrieval efficiency with less data compared with some state-of-the-art methods.
引用
收藏
页码:1 / 15
页数:15
相关论文
共 50 条
  • [1] NEURAL AUDIO FINGERPRINT FOR HIGH-SPECIFIC AUDIO RETRIEVAL BASED ON CONTRASTIVE LEARNING
    Chang, Sungkyun
    Lee, Donmoon
    Park, Jeongsoo
    Lim, Hyungui
    Lee, Kyogu
    Ko, Karam
    Han, Yoonchang
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 3025 - 3029
  • [2] Improved Algorithms of Music Information Retrieval based on Audio Fingerprint
    Jie, Tang
    Gang, Liu
    Jun, Guo
    IITAW: 2009 THIRD INTERNATIONAL SYMPOSIUM ON INTELLIGENT INFORMATION TECHNOLOGY APPLICATIONS WORKSHOPS, 2009, : 367 - 371
  • [3] A Music Identification System Based On Audio Fingerprint
    Fan, Yong
    Feng, Shuang
    2016 4TH INTL CONF ON APPLIED COMPUTING AND INFORMATION TECHNOLOGY/3RD INTL CONF ON COMPUTATIONAL SCIENCE/INTELLIGENCE AND APPLIED INFORMATICS/1ST INTL CONF ON BIG DATA, CLOUD COMPUTING, DATA SCIENCE & ENGINEERING (ACIT-CSII-BCD), 2016, : 363 - 367
  • [4] Audio fingerprint extraction for content identification
    Shiu, Y
    Yeh, CH
    Kuo, CCJ
    INTERNET MULTIMEDIA MANAGEMENT SYSTEMS IV, 2003, 5242 : 55 - 64
  • [5] A robust audio fingerprint extraction algorithm
    Lebosse, Jerome
    Brun, Luc
    Pailles, Jean Claude
    PROCEEDINGS OF THE FOURTH IASTED INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, PATTERN RECOGNITION, AND APPLICATIONS, 2007, : 269 - +
  • [6] Speaker identification based text to audio alignment for an audio retrieval system
    Roy, D
    Malamud, C
    1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 1099 - 1102
  • [7] Audio Fingerprint Extraction Based on Time-Frequency Domain
    Liu, Zhengzheng
    Li, Cong
    Cao, Sanxing
    2016 2ND IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATIONS (ICCC), 2016, : 1975 - 1979
  • [8] An Efficient Audio Fingerprint Search Algorithm for Music Retrieval
    Lee, Sunhyung
    Yook, Dongsuk
    Chang, Sukmoon
    IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2013, 59 (03) : 652 - 656
  • [9] Wavelet-based audio embedding and audio/video compression
    Mendenhall, MJ
    Claypoole, RL
    APPLICATIONS OF DIGITAL IMAGE PROCESSING XXIV, 2001, 4472 : 413 - 423
  • [10] Audio Fingerprint Retrieval Method Based on Feature Dimension Reduction and Feature Combination
    Zhang, Qiu-yu
    Xu, Fu-jiu
    Bai, Jian
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2021, 15 (02): : 522 - 539