Audio fingerprint extraction for content identification

被引:1
|
作者
Shiu, Y [1 ]
Yeh, CH [1 ]
Kuo, CCJ [1 ]
机构
[1] Univ So Calif, Integrated Media Syst Ctr, Los Angeles, CA 90089 USA
来源
关键词
audio fingerprint; audio identification; zero-crossing rate; audio database management; audio processing;
D O I
10.1117/12.511271
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work, we present an audio content identification system that identifies some unknown audio material by comparing its fingerprint with those extracted off-line and saved in the music database. We will describe in detail the procedure to extract audio fingerprints and demonstrate that they are robust to noise and content-preserving manipulations. The main feature in the proposed system is the zero-crossing rate extracted with the octave-band filter bank. The zero-crossing rate can be used to describe the dominant frequency in each subband with a very low computational cost. The size of audio finger-print is small and can be efficiently stored along with the compressed files in the database. It is also robust to many modifications such as tempo change and time-alignment distortion. Besides, the octave-band filter bank is used to enhance the robustness to distortion, especially those localized on some frequency regions.
引用
收藏
页码:55 / 64
页数:10
相关论文
共 50 条
  • [31] An Identification System Using Fingerprint For Live Broadcasting Content On TV
    Yoon, Young-Suk
    Park, Jihyun
    Kim, Junghyun
    Yoo, Wonyoung
    18TH IEEE INTERNATIONAL SYMPOSIUM ON CONSUMER ELECTRONICS (ISCE 2014), 2014,
  • [32] Peak-Based Philips Fingerprint Robust to Pitch-Shift for Audio Identification
    Chu, Renjie
    Niu, Baoning
    Yao, Shanshan
    Liu, Jianquan
    IEEE MULTIMEDIA, 2021, 28 (01) : 74 - 82
  • [33] Energy Classification-assisted Fingerprint System For Content-based Audio Copy Detection
    Zhang, Yongchao
    Xu, Mingxing
    Pratt, Emlyn
    2012 9TH INTERNATIONAL CONFERENCE ON COMMUNICATIONS (COMM), 2012, : 35 - 38
  • [34] Audio Event Identification in Sports Media Content: the Case of Basketball
    Filippidis, Panagiotis-Marios
    Vryzas, Nikolaos
    Kotsakis, Rigas
    Thoidis, Lordanis
    Dimoulas, Charalampos
    Bratsas, Charalampos
    146TH AES CONVENTION, 2019,
  • [35] Automatic audio classification and speaker identification for video content analysis
    Liu, Shu-Chang
    Bi, Jing
    Jia, Zhi-Qiang
    Chen, Rui
    Chen, Jie
    Zhou, Min-Min
    SNPD 2007: EIGHTH ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING, AND PARALLEL/DISTRIBUTED COMPUTING, VOL 2, PROCEEDINGS, 2007, : 91 - +
  • [36] Audio Fingerprint Application for the Media Industry
    Kusuma, Andrew Putra
    Wanniarachchi, Vajisha U.
    Fernando, Owen Noel Newton
    Keong, Ng Wee
    PROCEEDINGS OF THE 2018 ACM INTERNATIONAL JOINT CONFERENCE ON PERVASIVE AND UBIQUITOUS COMPUTING AND PROCEEDINGS OF THE 2018 ACM INTERNATIONAL SYMPOSIUM ON WEARABLE COMPUTERS (UBICOMP/ISWC'18 ADJUNCT), 2018, : 114 - 117
  • [37] A POWER MASK BASED AUDIO FINGERPRINT
    Coover, Bob
    Han, Jinyu
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [38] Video content extraction and representation using a joint audio and video processing
    Vienna Univ of Technology, Vienna, Austria
    ICASSP IEEE Int Conf Acoust Speech Signal Process Proc, (3033-3036):
  • [39] Video content extraction and representation using a joint audio and video processing
    Saraceno, C
    ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, : 3033 - 3036
  • [40] An evaluation of feature extraction for query-by-content audio information retrieval
    Yu, Yi
    Downie, J. Stephen
    Joe, Kazuki
    ISM WORKSHOPS 2007: NINTH IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA - WORKSHOPS, PROCEEDINGS, 2007, : 297 - +