Audio fingerprint extraction for content identification

被引:1
|
作者
Shiu, Y [1 ]
Yeh, CH [1 ]
Kuo, CCJ [1 ]
机构
[1] Univ So Calif, Integrated Media Syst Ctr, Los Angeles, CA 90089 USA
来源
关键词
audio fingerprint; audio identification; zero-crossing rate; audio database management; audio processing;
D O I
10.1117/12.511271
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work, we present an audio content identification system that identifies some unknown audio material by comparing its fingerprint with those extracted off-line and saved in the music database. We will describe in detail the procedure to extract audio fingerprints and demonstrate that they are robust to noise and content-preserving manipulations. The main feature in the proposed system is the zero-crossing rate extracted with the octave-band filter bank. The zero-crossing rate can be used to describe the dominant frequency in each subband with a very low computational cost. The size of audio finger-print is small and can be efficiently stored along with the compressed files in the database. It is also robust to many modifications such as tempo change and time-alignment distortion. Besides, the octave-band filter bank is used to enhance the robustness to distortion, especially those localized on some frequency regions.
引用
收藏
页码:55 / 64
页数:10
相关论文
共 50 条
  • [21] A Noise Robust Audio Fingerprint Extraction Technique for Mobile Devices Using Gradient Histograms
    Park, Taejin
    Beack, SeungKwon
    Lee, Taejin
    2015 IEEE 5TH INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS - BERLIN (ICCE-BERLIN), 2015, : 287 - 290
  • [22] Pairwise Boosted Audio Fingerprint
    Jang, Dalwon
    Yoo, Chang D.
    Lee, Sunil
    Kim, Sungwoong
    Kalker, Ton
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2009, 4 (04) : 995 - 1004
  • [23] Content-based identification of audio titles on the Internet
    Neuschmied, H
    Mayer, H
    Batlle, E
    FIRST INTERNATIONAL CONFERENCE ON WEB DELIVERING OF MUSIC, PROCEEDINGS, 2001, : 96 - 100
  • [24] Content extraction based on MPEG-7 audio representation
    Wieczorkowska, AA
    INTELLIGENT INFORMATION SYSTEMS 2002, PROCEEDINGS, 2002, 17 : 141 - 144
  • [25] Audio content extraction from MPEG-encoded sequences
    Pfeiffer, S
    Robert-Ribes, J
    Kim, D
    PROCEEDINGS OF THE FIFTH JOINT CONFERENCE ON INFORMATION SCIENCES, VOLS 1 AND 2, 2000, : A513 - A516
  • [26] MUSIC FINGERPRINT EXTRACTION FOR CLASSICAL MUSIC COVER SONG IDENTIFICATION
    Kim, Samuel
    Unal, Erdem
    Narayanan, Shrikanth
    2008 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-4, 2008, : 1261 - 1264
  • [27] IDFE: Fingerprint Deep Extraction Method for IoT Device Identification
    Tang, Yuezhong
    Lu, Shida
    Qian, Lifeng
    Wei, Xueyin
    Gu, Rongbin
    Huang, Jun
    Li, Jing
    Computer Engineering and Applications, 2024, 60 (17) : 117 - 128
  • [28] A Lightweight Radio Frequency Fingerprint Extraction Scheme for Device Identification
    Song, Lili
    Gao, Zhenzhen
    Huang, Jian
    Han, Boliang
    2023 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE, WCNC, 2023,
  • [29] Overview of radio frequency fingerprint extraction in specific emitter identification
    Sun L.
    Huang Z.
    Wang X.
    Wang F.
    Li B.
    Journal of Radars, 2020, 9 (06) : 1014 - 1031
  • [30] Direct gray-scale minutiae extraction in fingerprint identification
    Quek, C
    Wahab, A
    PARALLEL AND DISTRIBUTED COMPUTING SYSTEMS, 2000, : 646 - 651