Two-Stage Method for Specific Audio Retrieval based on MP3 Compression Domain

被引:0
作者
Tsai, Tsung-Han [1 ]
Chang, Wei-Chin [1 ]
机构
[1] Natl Cent Univ, Dept Elect Engn, Chungli 32054, Taiwan
来源
ISCAS: 2009 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1-5 | 2009年
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, the content-based retrieval of one-singer audio example based on MP3 (MPEG 1 layer III) digital music archive is considered. In our proposed method, the Sub-Band Coefficients (SBC) in a MP3 frame is used for feature extraction. Both Quantization-Tree indexing (QT) approach and the Mel-Frequency Subband Coefficients (MFSCs) approach are proposed for indexing on MP3 objects. Finally, a Melody-Line contour comparison method is used to measure the similarity between MP3 objects. Evaluations on a content-based MP3 retrieval system are performed. Experimental results show that our proposed approach can perform good performance and high accuracy. At least 95.9% of accuracy can be achieved at the top-1 retrieval result.
引用
收藏
页码:713 / 716
页数:4
相关论文
共 50 条
[31]   A Shape-Based Two-Stage Product Image Retrieval Method [J].
Gong Shangfu ;
Du Juan .
INTELLIGENT SYSTEM AND APPLIED MATERIAL, PTS 1 AND 2, 2012, 466-467 :1050-1054
[32]   Frequency Shift Method for MP3 Audio Data by Modifying Inputs of IMDCT [J].
Jung, Seung Pyo ;
Lee, Dong Hoon ;
Kim, Tae Hoon ;
Park, Ju Sung .
JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 2016, 64 (1-2) :13-22
[33]   Frequency shift method for MP3 audio data by modifying inputs of IMDCT [J].
Jung, Seung Pyo ;
Lee, Dong Hoon ;
Kim, Tae Hoon ;
Park, Ju Sung .
AES: Journal of the Audio Engineering Society, 2016, 64 (1-2) :13-22
[34]   Data Audio Compression Lossless FLAG Format to Lossy Audio MP3 format with Huffman Shift Coding Algorithm [J].
Firmansah, Luthfi ;
Setiawan, Erwin Budi .
2016 4TH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY (ICOICT), 2016,
[35]   Content-based retrieval of MP3 songs based on query by singing [J].
Lie, WN ;
Su, CK .
2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL V, PROCEEDINGS: DESIGN AND IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS INDUSTRY TECHNOLOGY TRACKS MACHINE LEARNING FOR SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING SIGNAL PROCESSING FOR EDUCATION, 2004, :929-932
[36]   A fast two-stage content-based image retrieval approach in the DCT domain [J].
Tsai, Tienwei ;
Huang, Yo-Ping ;
Chiang, Te-Wei .
INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2008, 22 (04) :765-781
[37]   Research of MP3 Audio Digital Watermark Algorithm Based on Hash Values [J].
Wei, Xianmin .
MATERIALS SCIENCE AND ENGINEERING, PTS 1-2, 2011, 179-180 :830-835
[38]   A new DCT audio watermarking scheme based on preliminary MP3 study [J].
Maha Charfeddine ;
Maher El’arbi ;
Chokri Ben Amar .
Multimedia Tools and Applications, 2014, 70 :1521-1557
[39]   A new DCT audio watermarking scheme based on preliminary MP3 study [J].
Charfeddine, Maha ;
El'arbi, Maher ;
Ben Amar, Chokri .
MULTIMEDIA TOOLS AND APPLICATIONS, 2014, 70 (03) :1521-1557
[40]   Detection of double MP3 compression Based on Difference of Calibration Histogram [J].
Yanzhen Ren ;
Mengdi Fan ;
Dengpan Ye ;
Jing Yang ;
Lina Wang .
Multimedia Tools and Applications, 2016, 75 :13855-13870