Audio Fingerprint Extraction Based on Locally Linear Embedding for Audio Retrieval System

被引：7

作者：

Jia, Maoshen ^{[1
]}

Li, Tianhao ^{[1
]}

Wang, Jing ^{[2
]}

机构：

[1] Beijing Univ Technol, Fac Informat Technol, Beijing Key Lab Computat Intelligence & Intellige, Beijing 100124, Peoples R China

[2] Beijing Inst Technol, Sch Informat & Elect, Beijing 100081, Peoples R China

来源：

ELECTRONICS | 2020年 / 9卷 / 09期

基金：

中国国家自然科学基金;

关键词：

audio fingerprint; sub-regions; dimensionality reduction; audio retrieval;

D O I：

10.3390/electronics9091483

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

With the appearance of a large amount of audio data, people have a higher demand for audio retrieval, which can quickly and accurately find the required information. Audio fingerprint retrieval is a popular choice because of its excellent performance. However, there is a problem about the large amount of audio fingerprint data in the existing audio fingerprint retrieval method which takes up more storage space and affects the retrieval speed. Aiming at the problem, this paper presents a novel audio fingerprinting method based on locally linear embedding (LLE) that has smaller fingerprints and the retrieval is more efficient. The proposed audio fingerprint extraction divides the bands around each peak in the frequency domain into four groups of sub-regions and the energy of every sub-region is computed. Then the LLE is performed for each group, respectively, and the audio fingerprint is encoded by comparing adjacent energies. To solve the distortion of linear speed changes, a matching strategy based on dynamic time warping (DTW) is adopted in the retrieval part which can compare two audio segments with different lengths. To evaluate the retrieval performance of the proposed method, the experiments are carried out under different conditions of single and multiple groups' dimensionality reduction. Both of them can achieve a high recall and precision rate and has a better retrieval efficiency with less data compared with some state-of-the-art methods.

引用

页码：1 / 15

页数：15

共 50 条

[1] NEURAL AUDIO FINGERPRINT FOR HIGH-SPECIFIC AUDIO RETRIEVAL BASED ON CONTRASTIVE LEARNING
Chang, Sungkyun
Lee, Donmoon
Park, Jeongsoo
Lim, Hyungui
Lee, Kyogu
Ko, Karam
Han, Yoonchang
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 3025 - 3029
[2] Improved Algorithms of Music Information Retrieval based on Audio Fingerprint
Jie, Tang
Gang, Liu
Jun, Guo
IITAW: 2009 THIRD INTERNATIONAL SYMPOSIUM ON INTELLIGENT INFORMATION TECHNOLOGY APPLICATIONS WORKSHOPS, 2009, : 367 - 371
[3] A Music Identification System Based On Audio Fingerprint
Fan, Yong
Feng, Shuang
2016 4TH INTL CONF ON APPLIED COMPUTING AND INFORMATION TECHNOLOGY/3RD INTL CONF ON COMPUTATIONAL SCIENCE/INTELLIGENCE AND APPLIED INFORMATICS/1ST INTL CONF ON BIG DATA, CLOUD COMPUTING, DATA SCIENCE & ENGINEERING (ACIT-CSII-BCD), 2016, : 363 - 367
[4] Audio fingerprint extraction for content identification
Shiu, Y
Yeh, CH
Kuo, CCJ
INTERNET MULTIMEDIA MANAGEMENT SYSTEMS IV, 2003, 5242 : 55 - 64
[5] A robust audio fingerprint extraction algorithm
Lebosse, Jerome
Brun, Luc
Pailles, Jean Claude
PROCEEDINGS OF THE FOURTH IASTED INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, PATTERN RECOGNITION, AND APPLICATIONS, 2007, : 269 - +
[6] Speaker identification based text to audio alignment for an audio retrieval system
Roy, D
Malamud, C
1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 1099 - 1102
[7] Audio Fingerprint Extraction Based on Time-Frequency Domain
Liu, Zhengzheng
Li, Cong
Cao, Sanxing
2016 2ND IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATIONS (ICCC), 2016, : 1975 - 1979
[8] An Efficient Audio Fingerprint Search Algorithm for Music Retrieval
Lee, Sunhyung
Yook, Dongsuk
Chang, Sukmoon
IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2013, 59 (03) : 652 - 656
[9] Wavelet-based audio embedding and audio/video compression
Mendenhall, MJ
Claypoole, RL
APPLICATIONS OF DIGITAL IMAGE PROCESSING XXIV, 2001, 4472 : 413 - 423
[10] Audio Fingerprint Retrieval Method Based on Feature Dimension Reduction and Feature Combination
Zhang, Qiu-yu
Xu, Fu-jiu
Bai, Jian
KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2021, 15 (02): : 522 - 539

← 1 2 3 4 5 →