Audio Fingerprint Extraction Based on Locally Linear Embedding for Audio Retrieval System

被引：7

作者：

Jia, Maoshen ^{[1
]}

Li, Tianhao ^{[1
]}

Wang, Jing ^{[2
]}

机构：

[1] Beijing Univ Technol, Fac Informat Technol, Beijing Key Lab Computat Intelligence & Intellige, Beijing 100124, Peoples R China

[2] Beijing Inst Technol, Sch Informat & Elect, Beijing 100081, Peoples R China

来源：

ELECTRONICS | 2020年 / 9卷 / 09期

基金：

中国国家自然科学基金;

关键词：

audio fingerprint; sub-regions; dimensionality reduction; audio retrieval;

D O I：

10.3390/electronics9091483

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

With the appearance of a large amount of audio data, people have a higher demand for audio retrieval, which can quickly and accurately find the required information. Audio fingerprint retrieval is a popular choice because of its excellent performance. However, there is a problem about the large amount of audio fingerprint data in the existing audio fingerprint retrieval method which takes up more storage space and affects the retrieval speed. Aiming at the problem, this paper presents a novel audio fingerprinting method based on locally linear embedding (LLE) that has smaller fingerprints and the retrieval is more efficient. The proposed audio fingerprint extraction divides the bands around each peak in the frequency domain into four groups of sub-regions and the energy of every sub-region is computed. Then the LLE is performed for each group, respectively, and the audio fingerprint is encoded by comparing adjacent energies. To solve the distortion of linear speed changes, a matching strategy based on dynamic time warping (DTW) is adopted in the retrieval part which can compare two audio segments with different lengths. To evaluate the retrieval performance of the proposed method, the experiments are carried out under different conditions of single and multiple groups' dimensionality reduction. Both of them can achieve a high recall and precision rate and has a better retrieval efficiency with less data compared with some state-of-the-art methods.

引用

页码：1 / 15

页数：15

共 50 条

[11] A POWER MASK BASED AUDIO FINGERPRINT
Coover, Bob
Han, Jinyu
2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
[12] A TV Commercial Retrieval System based on Audio Features
Borras, Jose E.
Igual, Jorge
Fernandez-Llatas, Carlos
Traver, Vicente
PROCEEDINGS OF THE 10TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND MULTIMEDIA APPLICATIONS (SIGMAP 2013), 2013, : 65 - 70
[13] Fingerprint Extraction of Audio Signal using Wavelet Transform
Kamaladas, M. Davidson
Dialin, M. Maxina
INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, IMAGE PROCESSING AND PATTERN RECOGNITION (ICSIPR 2013), 2013, : 308 - 312
[14] Anime Audio Retrieval Based on Audio Separation and Feature Recognition
Li, De
Xu, Wenying
Jin, Xun
INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2024, 2024
[15] Audio Watermarking Based on Vector Quantization Index Modulation Using Audio Fingerprint
Nakaya, Shogo
Wada, Shigeo
ELECTRONICS AND COMMUNICATIONS IN JAPAN, 2015, 98 (03) : 14 - 23
[16] Audio fingerprint retrieval method using anti-fingerprint and frequency domain segmentation
Chen, Shuli
Zhang, Xueshuai
Zhang, Pengyuan
Liu, Jian
Shengxue Xuebao/Acta Acustica, 2022, 47 (04): : 531 - 540
[17] Audio fingerprint retrieval algorithm using anti-fingerprint and frequency domain segmentation
CHEN Shuli
ZHANG Xueshuai
ZHANG Pengyuan
LIU Jian
Chinese Journal of Acoustics, 2023, 42 (01) : 82 - 97
[18] Audio fingerprint matching based on a power weight
Seo, Jin Soo
Kim, Junghyun
Kim, Hyemi
JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2019, 38 (06): : 716 - 723
[19] Hierarchical system for content-based audio classification and retrieval
Zhang, T
Kuo, CCJ
MULTIMEDIA STORAGE AND ARCHIVING SYSTEMS III, 1998, 3527 : 398 - 409
[20] Audio Retrieval Based on Wavelet Transform
Chen, Deqin
Zhang, Wenhui
Zhang, Zhibo
Huang, Wei
Ao, Jia
2017 16TH IEEE/ACIS INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCE (ICIS 2017), 2017, : 531 - 534

← 1 2 3 4 5 →