Audio Fingerprint Extraction Based on Locally Linear Embedding for Audio Retrieval System

被引:7
|
作者
Jia, Maoshen [1 ]
Li, Tianhao [1 ]
Wang, Jing [2 ]
机构
[1] Beijing Univ Technol, Fac Informat Technol, Beijing Key Lab Computat Intelligence & Intellige, Beijing 100124, Peoples R China
[2] Beijing Inst Technol, Sch Informat & Elect, Beijing 100081, Peoples R China
基金
中国国家自然科学基金;
关键词
audio fingerprint; sub-regions; dimensionality reduction; audio retrieval;
D O I
10.3390/electronics9091483
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the appearance of a large amount of audio data, people have a higher demand for audio retrieval, which can quickly and accurately find the required information. Audio fingerprint retrieval is a popular choice because of its excellent performance. However, there is a problem about the large amount of audio fingerprint data in the existing audio fingerprint retrieval method which takes up more storage space and affects the retrieval speed. Aiming at the problem, this paper presents a novel audio fingerprinting method based on locally linear embedding (LLE) that has smaller fingerprints and the retrieval is more efficient. The proposed audio fingerprint extraction divides the bands around each peak in the frequency domain into four groups of sub-regions and the energy of every sub-region is computed. Then the LLE is performed for each group, respectively, and the audio fingerprint is encoded by comparing adjacent energies. To solve the distortion of linear speed changes, a matching strategy based on dynamic time warping (DTW) is adopted in the retrieval part which can compare two audio segments with different lengths. To evaluate the retrieval performance of the proposed method, the experiments are carried out under different conditions of single and multiple groups' dimensionality reduction. Both of them can achieve a high recall and precision rate and has a better retrieval efficiency with less data compared with some state-of-the-art methods.
引用
收藏
页码:1 / 15
页数:15
相关论文
共 50 条
  • [21] Movie Audio Retrieval Based on HCT
    Ding, Xiang-rong
    Yang, Ji-chen
    MECHATRONICS AND INDUSTRIAL INFORMATICS, PTS 1-4, 2013, 321-324 : 1129 - +
  • [22] Audio Retrieval Based on Perceptual Similarity
    Zhang, Teng
    Wu, Ji
    Wang, Dingding
    Li, Tao
    2014 INTERNATIONAL CONFERENCE ON COLLABORATIVE COMPUTING: NETWORKING, APPLICATIONS AND WORKSHARING (COLLABORATECOM), 2014, : 342 - 348
  • [23] An audio representation for content based retrieval
    Melih, K
    Gonzalez, R
    Ogunbona, P
    IEEE TENCON'97 - IEEE REGIONAL 10 ANNUAL CONFERENCE, PROCEEDINGS, VOLS 1 AND 2: SPEECH AND IMAGE TECHNOLOGIES FOR COMPUTING AND TELECOMMUNICATIONS, 1997, : 207 - 210
  • [24] Audio Retrieval Based on Manifold Ranking
    Qin, Jing
    Liu, Xinyue
    Lin, Hongfei
    2014 SIXTH INTERNATIONAL SYMPOSIUM ON PARALLEL ARCHITECTURES, ALGORITHMS AND PROGRAMMING (PAAP), 2014, : 187 - 190
  • [25] PEAK-BASED PHILIPS FINGERPRINT ROBUST TO PITCH-SHIFT FOR MASSIVE AUDIO RETRIEVAL
    Chu, Renjie
    Niu, Baoning
    Yao, Shanshan
    Liu, Jianquan
    2019 IEEE FIFTH INTERNATIONAL CONFERENCE ON MULTIMEDIA BIG DATA (BIGMM 2019), 2019, : 314 - 320
  • [26] Daubechies Wavelets Based Robust Audio Fingerprinting for Content-Based Audio Retrieval
    Sun, Wei
    Lu, Zhe-Ming
    Yu, Fa-Xin
    Shen, Rong-Jun
    INTERNATIONAL JOURNAL OF DIGITAL CRIME AND FORENSICS, 2012, 4 (02) : 49 - 69
  • [27] Feature extraction of hyperspectral image based on locally linear embedding
    Dong, Chao
    Zhao, Huijie
    Beijing Hangkong Hangtian Daxue Xuebao/Journal of Beijing University of Aeronautics and Astronautics, 2010, 36 (08): : 957 - 960
  • [28] Fault feature extraction based on improved locally linear embedding
    Hu, Feng
    Su, Xun
    Liu, Wei
    Wu, Yu-Chuan
    Fan, Liang-Zhi
    Zhendong yu Chongji/Journal of Vibration and Shock, 2015, 34 (15): : 201 - 204
  • [29] CROSS MODAL AUDIO SEARCH AND RETRIEVAL WITH JOINT EMBEDDINGS BASED ON TEXT AND AUDIO
    Elizalde, Benjamin
    Zarar, Shuayb
    Raj, Bhiksha
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 4095 - 4099
  • [30] A Flexible and Scalable Audio Information Retrieval System for Mixed-Type Audio Signals
    Dogan, Ebru
    Sert, Mustafa
    Yazici, Adnan
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2011, 26 (10) : 952 - 970