M-MUSICS: an intelligent mobile music retrieval system

被引:2
作者
Rho, Seungmin [2 ]
Hwang, Eenjun [2 ]
Park, Jong Hyuk [1 ]
机构
[1] Seoul Natl Univ Sci & Technol, Dept Comp Sci & Engn, Seoul, South Korea
[2] Korea Univ, Sch Elect Engn, Seoul, South Korea
关键词
Content-based audio retrieval; Mobile platform; Relevance feedback; Signal processing; RELEVANCE FEEDBACK; IMAGE RETRIEVAL; SIMILARITY;
D O I
10.1007/s00530-010-0212-y
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Accurate voice humming transcription and efficient indexing and retrieval schemes are essential to a large-scale humming-based audio retrieval system. Although much research has been done to develop such schemes, their performance in terms of precision, recall, and F-measure, among all similarity metrics, are still unsatisfactory. In this paper, we propose a new voice query transcription scheme. It considers the following features: note onset detection using dynamic threshold methods, fundamental frequency (F0) acquisition of each frame, and frequency realignment using K-means. We use a popularity-adaptive indexing structure called frequently accessed index (FAI) based on frequently queried tunes for indexing purposes. In addition, we propose a semi-supervised relevance feedback and query reformulation scheme based on a genetic algorithm to improve retrieval efficiency. In this paper, we extend our efforts to mobile multimedia environments and develop a mobile audio retrieval system. Experiments show our system performs satisfactory in wireless mobile multimedia environments.
引用
收藏
页码:313 / 326
页数:14
相关论文
共 33 条
[11]  
Ghias A., 1995, PROC ACM MULTIMEDIA, P231
[12]  
HOASHI K, 2002, ACM SIGIR, P385
[13]  
JANG JSR, 2001, MIRACLE MUSIC INFORM, P11
[14]  
KARYDIS I, 2005, CONTENT BASED MUSIC, P137
[15]  
KLAPURI A, 1999, IEEE WORKSH APPL SIG, P115
[16]  
KLAPURI A, 2005, WASPAA, P291
[17]  
LAMPROPOULOU PS, 2006, P 10 INT C KNOWL BAS, P384
[18]  
LEVEAU P, 2004, METHODOLOGY TOOLS EV, P72
[19]  
MACQUEEN J, 1967, P KNOWL DISC DAT MIN, P16
[20]  
PARASKEVI L, 2009, INTELL DECIS TECH, V3, P123, DOI DOI 10.3233/IDT-2009-0060