Audio fingerprint extraction for content identification

被引：1

作者：

Shiu, Y ^{[1
]}

Yeh, CH ^{[1
]}

Kuo, CCJ ^{[1
]}

机构：

[1] Univ So Calif, Integrated Media Syst Ctr, Los Angeles, CA 90089 USA

来源：

INTERNET MULTIMEDIA MANAGEMENT SYSTEMS IV | 2003年 / 5242卷

关键词：

audio fingerprint; audio identification; zero-crossing rate; audio database management; audio processing;

D O I：

10.1117/12.511271

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this work, we present an audio content identification system that identifies some unknown audio material by comparing its fingerprint with those extracted off-line and saved in the music database. We will describe in detail the procedure to extract audio fingerprints and demonstrate that they are robust to noise and content-preserving manipulations. The main feature in the proposed system is the zero-crossing rate extracted with the octave-band filter bank. The zero-crossing rate can be used to describe the dominant frequency in each subband with a very low computational cost. The size of audio finger-print is small and can be efficiently stored along with the compressed files in the database. It is also robust to many modifications such as tempo change and time-alignment distortion. Besides, the octave-band filter bank is used to enhance the robustness to distortion, especially those localized on some frequency regions.

引用

页码：55 / 64

页数：10

共 50 条

[31] An Identification System Using Fingerprint For Live Broadcasting Content On TV
Yoon, Young-Suk
Park, Jihyun
Kim, Junghyun
Yoo, Wonyoung
18TH IEEE INTERNATIONAL SYMPOSIUM ON CONSUMER ELECTRONICS (ISCE 2014), 2014,
[32] Peak-Based Philips Fingerprint Robust to Pitch-Shift for Audio Identification
Chu, Renjie
Niu, Baoning
Yao, Shanshan
Liu, Jianquan
IEEE MULTIMEDIA, 2021, 28 (01) : 74 - 82
[33] Energy Classification-assisted Fingerprint System For Content-based Audio Copy Detection
Zhang, Yongchao
Xu, Mingxing
Pratt, Emlyn
2012 9TH INTERNATIONAL CONFERENCE ON COMMUNICATIONS (COMM), 2012, : 35 - 38
[34] Audio Event Identification in Sports Media Content: the Case of Basketball
Filippidis, Panagiotis-Marios
Vryzas, Nikolaos
Kotsakis, Rigas
Thoidis, Lordanis
Dimoulas, Charalampos
Bratsas, Charalampos
146TH AES CONVENTION, 2019,
[35] Automatic audio classification and speaker identification for video content analysis
Liu, Shu-Chang
Bi, Jing
Jia, Zhi-Qiang
Chen, Rui
Chen, Jie
Zhou, Min-Min
SNPD 2007: EIGHTH ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING, AND PARALLEL/DISTRIBUTED COMPUTING, VOL 2, PROCEEDINGS, 2007, : 91 - +
[36] Audio Fingerprint Application for the Media Industry
Kusuma, Andrew Putra
Wanniarachchi, Vajisha U.
Fernando, Owen Noel Newton
Keong, Ng Wee
PROCEEDINGS OF THE 2018 ACM INTERNATIONAL JOINT CONFERENCE ON PERVASIVE AND UBIQUITOUS COMPUTING AND PROCEEDINGS OF THE 2018 ACM INTERNATIONAL SYMPOSIUM ON WEARABLE COMPUTERS (UBICOMP/ISWC'18 ADJUNCT), 2018, : 114 - 117
[37] A POWER MASK BASED AUDIO FINGERPRINT
Coover, Bob
Han, Jinyu
2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
[38] Video content extraction and representation using a joint audio and video processing
Vienna Univ of Technology, Vienna, Austria
ICASSP IEEE Int Conf Acoust Speech Signal Process Proc, (3033-3036):
[39] Video content extraction and representation using a joint audio and video processing
Saraceno, C
ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, : 3033 - 3036
[40] An evaluation of feature extraction for query-by-content audio information retrieval
Yu, Yi
Downie, J. Stephen
Joe, Kazuki
ISM WORKSHOPS 2007: NINTH IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA - WORKSHOPS, PROCEEDINGS, 2007, : 297 - +

← 1 2 3 4 5 →