Slicing it all ways: Mathematical models for tonal induction, approximation, and segmentation using the spiral array

被引:4
作者
Chew, Elaine [1 ]
机构
[1] Univ So Calif, Viterbi Sch Engn, Epstein Dept Ind & Syst Engn, Los Angeles, CA 90089 USA
关键词
computational music cognition; automated tonal analysis; real-time algorithms;
D O I
10.1287/ijoc.1050.0134
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
This paper presents the spiral array model and its associated algorithms for tonal induction, approximation, and segmentation. The spiral array is a geometric model for tonality that clusters perceptually similar tonal entities. The model summarizes music information as interior points inside an array of spirals. Distances in the spiral array space are used to quantify tonal similarity The paper traces the evolution, and presents general forms, of the existing algorithms. for key finding, pitch spelling, and segmentation, and proposes a new O(n) algorithm, Argus, for tonal segmentation. The proposed algorithm computes a value that quantifies the discrepancy between the local contexts in the future and past at each point in time. Discrepancy values exceeding control thresholds are shown to mark the segmentation boundaries of the test set that concur with expert analyses. A number of window sizes and threshold settings are investigated. The algorithm is demonstrated using Edward MacDowell's To A Wild Rose and tested on Franz Schubert's Allegretto from Moment Musical D780 No. 6 and Thema from Impromptu D935 No. 4. The algorithm accurately locates tonal boundaries in all three case studies.
引用
收藏
页码:305 / 320
页数:16
相关论文
共 23 条
[1]   Pitch spelling: A computational model [J].
Cambouropoulos, E .
MUSIC PERCEPTION, 2003, 20 (04) :411-429
[2]  
Chew E., 2002, Music and Artificial Intelligence. Second International Conference, ICMAI 2002. Proceedings (Lecture Notes in Artificial Intelligence Vol.2445), P18
[3]   Real-time pitch spelling using the spiral array [J].
Chew, E ;
Chen, YC .
COMPUTER MUSIC JOURNAL, 2005, 29 (02) :61-76
[4]  
CHEW E, 2001, 23 ANN M COGN SCI SO, P206
[5]  
Chew E., 2000, THESIS MIT CAMBRIDGE
[6]  
CHEW E, 2003, P 4 INT C MUS INF RE
[7]  
CHEW E, 2003, ACM MULT C 2003 BERK, P448
[8]   Tonal description of polyphonic audio for music content processing [J].
Gomez, Emilia .
INFORMS JOURNAL ON COMPUTING, 2006, 18 (03) :294-304
[9]  
Krumhansl C. L., 1990, COGNITIVE FDN MUSICA
[10]  
Longuet-Higgins H.C., 1962, MUSIC REV, V23, P271