A comparative evaluation of search techniques for query-by-humming using the MUSART testbed

被引:35
作者
Dannenberg, Roger B. [1 ]
Birmingham, William P.
Pardo, Bryan
Hu, Ning
Meek, Colin
Tzanetakis, George
机构
[1] Carnegie Mellon Univ, Dept Comp Sci, Pittsburgh, PA 15213 USA
[2] Grove City Coll, Grove City, PA 16127 USA
[3] Northwestern Univ, Dept Elect Engn & Comp Sci, Evanston, IL 60208 USA
[4] Google Inc, New York Off, New York, NY 10018 USA
[5] Microsoft Corp, Redmond, WA 98052 USA
[6] Univ Victoria, Dept Comp Sci, Victoria, BC V8W 3P6, Canada
来源
JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY | 2007年 / 58卷 / 05期
关键词
D O I
10.1002/asi.20532
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Query-by-humming systems offer content-based searching for melodies and require no special musical training or knowledge. Many such systems have been built, but there has not been much useful evaluation and comparison in the literature due to the lack of shared databases and queries. The MUSART project testbed allows various search algorithms to be compared using a shared framework that automatically runs experiments and summarizes results. Using this testbed, the authors compared algorithms based on string alignment, melodic contour matching, a hidden Markov model, n-grams, and CubyHum. Retrieval performance is very sensitive to distance functions and the representation of pitch and rhythm, which raises questions about some previously published conclusions. Some algorithms are particularly sensitive to the quality of queries. Our queries, which are taken from human subjects in a realistic setting, are quite difficult, especially for n-gram models. Finally, simulations on query-by-humming performance as a function of database size indicate that retrieval performance falls only slowly as the database size increases.
引用
收藏
页码:687 / 701
页数:15
相关论文
共 32 条
  • [11] Goto M, 2000, INT CONF ACOUST SPEE, P757
  • [12] Hewlett W. B., 1998, MELODIC SIMILARITY C, V11
  • [13] HOOS H, 2001, INT S MUS INF RETR B
  • [14] HSU JL, 2001, 2 ANN INT S MUS INF
  • [15] HSU JL, 2002, C INF KNOWL MAN VA
  • [16] HU N, 2002, P 2 ACM IEEE CS JOIN
  • [17] JIN H, 2002, ISMIR 2002 C P PAR F
  • [18] MAZZONI D, 2001, 2 ANN INT S MUS INF
  • [19] McNab R. J., 1996, Proceedings of the 1st ACM International Conference on Digital Libraries, P11, DOI 10.1145/226931.226934
  • [20] MEEK C, 2001, 2 ANN INT S MUS INF