A comparative evaluation of search techniques for query-by-humming using the MUSART testbed

被引:35
作者
Dannenberg, Roger B. [1 ]
Birmingham, William P.
Pardo, Bryan
Hu, Ning
Meek, Colin
Tzanetakis, George
机构
[1] Carnegie Mellon Univ, Dept Comp Sci, Pittsburgh, PA 15213 USA
[2] Grove City Coll, Grove City, PA 16127 USA
[3] Northwestern Univ, Dept Elect Engn & Comp Sci, Evanston, IL 60208 USA
[4] Google Inc, New York Off, New York, NY 10018 USA
[5] Microsoft Corp, Redmond, WA 98052 USA
[6] Univ Victoria, Dept Comp Sci, Victoria, BC V8W 3P6, Canada
来源
JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY | 2007年 / 58卷 / 05期
关键词
D O I
10.1002/asi.20532
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Query-by-humming systems offer content-based searching for melodies and require no special musical training or knowledge. Many such systems have been built, but there has not been much useful evaluation and comparison in the literature due to the lack of shared databases and queries. The MUSART project testbed allows various search algorithms to be compared using a shared framework that automatically runs experiments and summarizes results. Using this testbed, the authors compared algorithms based on string alignment, melodic contour matching, a hidden Markov model, n-grams, and CubyHum. Retrieval performance is very sensitive to distance functions and the representation of pitch and rhythm, which raises questions about some previously published conclusions. Some algorithms are particularly sensitive to the quality of queries. Our queries, which are taken from human subjects in a realistic setting, are quite difficult, especially for n-gram models. Finally, simulations on query-by-humming performance as a function of database size indicate that retrieval performance falls only slowly as the database size increases.
引用
收藏
页码:687 / 701
页数:15
相关论文
共 32 条
  • [1] BAINBRIDGE D, 1999, INT C DIG LIB BERK C
  • [2] BAINBRIDGE D, 2002, DIG LIB PEOPL KNOWL
  • [3] CLAUSEN M, 2000, 1 INT S MUS INF RETR
  • [4] DANNENBERG RB, 2003, ISMIR 2003 4 INT C M
  • [5] DANNENBERG RB, 2004, ISMIR 2004 5 INT C M
  • [6] Robust polyphonic music retrieval with N-grams
    Doraisamy, S
    Rüeger, S
    [J]. JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2003, 21 (01) : 53 - 70
  • [7] DORAISAMY S, 2002, ISMIR 2002 3 INT C M
  • [8] DOWNIE JS, 2000, P 23 ANN INT ACM SIG
  • [9] Durbin R., 1998, BIOL SEQUENCE ANAL
  • [10] DUREY AS, 2001, 2 ANN INT S MUS INF