Audio Fingerprint Extraction Based on Locally Linear Embedding for Audio Retrieval System

被引:7
|
作者
Jia, Maoshen [1 ]
Li, Tianhao [1 ]
Wang, Jing [2 ]
机构
[1] Beijing Univ Technol, Fac Informat Technol, Beijing Key Lab Computat Intelligence & Intellige, Beijing 100124, Peoples R China
[2] Beijing Inst Technol, Sch Informat & Elect, Beijing 100081, Peoples R China
基金
中国国家自然科学基金;
关键词
audio fingerprint; sub-regions; dimensionality reduction; audio retrieval;
D O I
10.3390/electronics9091483
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the appearance of a large amount of audio data, people have a higher demand for audio retrieval, which can quickly and accurately find the required information. Audio fingerprint retrieval is a popular choice because of its excellent performance. However, there is a problem about the large amount of audio fingerprint data in the existing audio fingerprint retrieval method which takes up more storage space and affects the retrieval speed. Aiming at the problem, this paper presents a novel audio fingerprinting method based on locally linear embedding (LLE) that has smaller fingerprints and the retrieval is more efficient. The proposed audio fingerprint extraction divides the bands around each peak in the frequency domain into four groups of sub-regions and the energy of every sub-region is computed. Then the LLE is performed for each group, respectively, and the audio fingerprint is encoded by comparing adjacent energies. To solve the distortion of linear speed changes, a matching strategy based on dynamic time warping (DTW) is adopted in the retrieval part which can compare two audio segments with different lengths. To evaluate the retrieval performance of the proposed method, the experiments are carried out under different conditions of single and multiple groups' dimensionality reduction. Both of them can achieve a high recall and precision rate and has a better retrieval efficiency with less data compared with some state-of-the-art methods.
引用
收藏
页码:1 / 15
页数:15
相关论文
共 50 条
  • [11] A POWER MASK BASED AUDIO FINGERPRINT
    Coover, Bob
    Han, Jinyu
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [12] A TV Commercial Retrieval System based on Audio Features
    Borras, Jose E.
    Igual, Jorge
    Fernandez-Llatas, Carlos
    Traver, Vicente
    PROCEEDINGS OF THE 10TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND MULTIMEDIA APPLICATIONS (SIGMAP 2013), 2013, : 65 - 70
  • [13] Fingerprint Extraction of Audio Signal using Wavelet Transform
    Kamaladas, M. Davidson
    Dialin, M. Maxina
    INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, IMAGE PROCESSING AND PATTERN RECOGNITION (ICSIPR 2013), 2013, : 308 - 312
  • [14] Anime Audio Retrieval Based on Audio Separation and Feature Recognition
    Li, De
    Xu, Wenying
    Jin, Xun
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2024, 2024
  • [15] Audio Watermarking Based on Vector Quantization Index Modulation Using Audio Fingerprint
    Nakaya, Shogo
    Wada, Shigeo
    ELECTRONICS AND COMMUNICATIONS IN JAPAN, 2015, 98 (03) : 14 - 23
  • [16] Audio fingerprint retrieval method using anti-fingerprint and frequency domain segmentation
    Chen, Shuli
    Zhang, Xueshuai
    Zhang, Pengyuan
    Liu, Jian
    Shengxue Xuebao/Acta Acustica, 2022, 47 (04): : 531 - 540
  • [17] Audio fingerprint retrieval algorithm using anti-fingerprint and frequency domain segmentation
    CHEN Shuli
    ZHANG Xueshuai
    ZHANG Pengyuan
    LIU Jian
    Chinese Journal of Acoustics, 2023, 42 (01) : 82 - 97
  • [18] Audio fingerprint matching based on a power weight
    Seo, Jin Soo
    Kim, Junghyun
    Kim, Hyemi
    JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2019, 38 (06): : 716 - 723
  • [19] Hierarchical system for content-based audio classification and retrieval
    Zhang, T
    Kuo, CCJ
    MULTIMEDIA STORAGE AND ARCHIVING SYSTEMS III, 1998, 3527 : 398 - 409
  • [20] Audio Retrieval Based on Wavelet Transform
    Chen, Deqin
    Zhang, Wenhui
    Zhang, Zhibo
    Huang, Wei
    Ao, Jia
    2017 16TH IEEE/ACIS INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCE (ICIS 2017), 2017, : 531 - 534