An Information-Geometric Approach to Real-Time Audio Segmentation

被引:16
作者
Dessein, Arnaud [1 ]
Cont, Arshia [1 ]
机构
[1] UPMC, MuTant Project Team INRIA, UMR STMS 9912, IRCAM,CNRS, F-75004 Paris, France
关键词
Audio segmentation; change detection; information geometry; real-time system;
D O I
10.1109/LSP.2013.2247039
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
We present a generic approach to real-time audio segmentation in the framework of information geometry for exponential families. The proposed system detects changes by monitoring the information rate of the signals as they arrive in time. We also address shortcomings of traditional cumulative sum approaches to change detection, which assume known parameters before change. This is done by considering exact generalized likelihood ratio test statistics, with a complete estimation of the unknown parameters in the respective hypotheses. We derive an efficient sequential scheme to compute these statistics through convex duality. We finally provide results for speech segmentation in speakers, and polyphonic music segmentation in note slices.
引用
收藏
页码:331 / 334
页数:4
相关论文
共 16 条
  • [1] AMARI S, 2000, TRANSL MATH MONOGRAP, V191
  • [2] [Anonymous], 1986, Lecture Notes-Monograph Series
  • [3] Barndorff-Nielsen O., 1978, PROBABILITY MATH STA
  • [4] Basseville M, 1993, DETECTION ABRUPT CHA
  • [5] A tutorial on onset detection in music signals[J]. Bello, JP;Daudet, L;Abdallah, S;Duxbury, C;Davies, M;Sandler, MB. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2005(05)
  • [6] On the Information Geometry of Audio Streams With Applications to Similarity Computing[J]. Cont, Arshia;Dubnov, Shlomo;Assayag, Gerard. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011(04)
  • [7] An online Kernel change detection algorithm[J]. Desobry, F;Davy, M;Doncarli, C. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2005(08)
  • [8] Foote J, 2000, 2000 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, PROCEEDINGS VOLS I-III, P452, DOI 10.1109/ICME.2000.869637
  • [9] A REGULARIZED KERNEL-BASED APPROACH TO UNSUPERVISED AUDIO SEGMENTATION[J]. Harchaoui, Zaid;Vallet, Felicien;Lung-Yut-Fong, Alexandre;Cappe, Olivier. 2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009
  • [10] Speaker segmentation and clustering[J]. Kotti, Margarita;Moschou, Vassiliki;Kotropoulos, Constantine. SIGNAL PROCESSING, 2008(05)