Audio Fingerprint Extraction Based on Locally Linear Embedding for Audio Retrieval System

被引:7
|
作者
Jia, Maoshen [1 ]
Li, Tianhao [1 ]
Wang, Jing [2 ]
机构
[1] Beijing Univ Technol, Fac Informat Technol, Beijing Key Lab Computat Intelligence & Intellige, Beijing 100124, Peoples R China
[2] Beijing Inst Technol, Sch Informat & Elect, Beijing 100081, Peoples R China
基金
中国国家自然科学基金;
关键词
audio fingerprint; sub-regions; dimensionality reduction; audio retrieval;
D O I
10.3390/electronics9091483
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the appearance of a large amount of audio data, people have a higher demand for audio retrieval, which can quickly and accurately find the required information. Audio fingerprint retrieval is a popular choice because of its excellent performance. However, there is a problem about the large amount of audio fingerprint data in the existing audio fingerprint retrieval method which takes up more storage space and affects the retrieval speed. Aiming at the problem, this paper presents a novel audio fingerprinting method based on locally linear embedding (LLE) that has smaller fingerprints and the retrieval is more efficient. The proposed audio fingerprint extraction divides the bands around each peak in the frequency domain into four groups of sub-regions and the energy of every sub-region is computed. Then the LLE is performed for each group, respectively, and the audio fingerprint is encoded by comparing adjacent energies. To solve the distortion of linear speed changes, a matching strategy based on dynamic time warping (DTW) is adopted in the retrieval part which can compare two audio segments with different lengths. To evaluate the retrieval performance of the proposed method, the experiments are carried out under different conditions of single and multiple groups' dimensionality reduction. Both of them can achieve a high recall and precision rate and has a better retrieval efficiency with less data compared with some state-of-the-art methods.
引用
收藏
页码:1 / 15
页数:15
相关论文
共 50 条
  • [41] Content-based retrieval of music and audio
    Foote, JT
    MULTIMEDIA STORAGE AND ARCHIVING SYSTEMS II, 1997, 3229 : 138 - 147
  • [42] Content-based classification and retrieval of audio
    Zhang, T
    Kuo, CCJ
    ADVANCED SIGNAL PROCESSING ALGORITHMS, ARCHITECTURES, AND IMPLEMENTATIONS VIII, 1998, 3461 : 432 - 443
  • [43] Audio-based Near-Duplicate Video Retrieval with Audio Similarity Learning
    Avgoustinakis, Pavlos
    Kordopatis-Zilos, Giorgos
    Papadopoulos, Symeon
    Symeonidis, Andreas L.
    Kompatsiaris, Ioannis
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 5828 - 5835
  • [44] Text-Based Audio Retrieval by Learning From Similarities Between Audio Captions
    Xie, Huang
    Khorrami, Khazar
    Rasanen, Okko
    Virtanen, Tuomas
    IEEE SIGNAL PROCESSING LETTERS, 2025, 32 : 221 - 225
  • [45] Locally linear embedding-based seismic attribute extraction and applications
    Liu Xing-Fang
    Zheng Xiao-Dong
    Xu Guang-Cheng
    Wang Ling
    Yang Hao
    APPLIED GEOPHYSICS, 2010, 7 (04) : 365 - 375
  • [46] Locally linear embedding-based seismic attribute extraction and applications
    Xing-Fang Liu
    Xiao-Dong Zheng
    Guang-Cheng Xu
    Ling Wang
    Hao Yang
    Applied Geophysics, 2010, 7 : 365 - 375
  • [47] Motion Key-Frames Extraction Based on Locally Linear Embedding
    Dong, Xulong
    Zhou, Dongsheng
    Zhang, Qiang
    MECHANICAL, ELECTRONIC AND ENGINEERING TECHNOLOGIES (ICMEET 2014), 2014, 538 : 476 - 480
  • [48] Linear Estimation Based Primary-Ambient Extraction for Stereo Audio Signals
    He, Jianjun
    Tan, Ee-Leng
    Gan, Woon-Seng
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2014, 22 (02) : 505 - 517
  • [49] New audio embedding technique based on neural network
    Wang, Huiqin
    Mao, Li
    Xiu, Keshan
    ICICIC 2006: FIRST INTERNATIONAL CONFERENCE ON INNOVATIVE COMPUTING, INFORMATION AND CONTROL, VOL 3, PROCEEDINGS, 2006, : 459 - +
  • [50] An audio recommendation system based on audio signature description scheme in MPEG-7 Audio
    Huang, YC
    Jeng, SK
    2004 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXP (ICME), VOLS 1-3, 2004, : 639 - 642