MODEL BASED MULTIPLE AUDIO SEQUENCE ALIGNMENT

被引:0
作者
Basaran, Dogac [1 ]
Cemgil, A. Taylan [2 ]
Anarim, Emin [1 ]
机构
[1] Bogazici Univ, Dept Elect & Elect Engn, Istanbul, Turkey
[2] Dept Comp Engn, Istanbul, Turkey
来源
2011 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA) | 2011年
关键词
Audio alignment; Audio matching; Maximum likelihood; Probabilistic Model;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We formulate alignment of multiple and partially overlapping audio sequences in a probabilistic framework. We define and compare four generative models for time varying features extracted from audio clips that are recorded independently and asynchronously. We are able to handle missing data and multiple clips where no clip is covering the entire material. We define proper scoring functions for each model and the matching is achieved with a sequential alignment algorithm. The simulation results on real data suggest that the approach is able to handle difficult ambiguous scenarios or partial matchings.
引用
收藏
页码:13 / 16
页数:4
相关论文
共 50 条
[41]   Towards Timbre-Invariant Audio Features for Harmony-Based Music [J].
Mueller, Meinard ;
Ewert, Sebastian .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (03) :649-662
[42]   Maximum Likelihood Phylogenetic Inference is Consistent on Multiple Sequence Alignments, with or without Gaps [J].
Truszkowski, Jakub ;
Goldman, Nick .
SYSTEMATIC BIOLOGY, 2016, 65 (02) :328-333
[44]   MLEM deconvolution of protein X-ray diffraction images based on a multiple-PSF model [J].
Zhu, D ;
Razaz, M ;
Hemmings, A .
IEEE TRANSACTIONS ON NANOBIOSCIENCE, 2006, 5 (02) :95-102
[45]   SCAMPP: Scaling Alignment-Based Phylogenetic Placement to Large Trees [J].
Wedell, Eleanor ;
Cai, Yirong ;
Warnow, Tandy .
IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2023, 20 (02) :1417-1430
[46]   A novel low complexity multiuser detector based on modified genetic algorithm in direct sequence-code division multiple access communication systems [J].
Zahedi, A. ;
Bakhshi, H. ;
Jafari, S. ;
Abdolmohammadi, H. R. ;
Rajati, M. R. .
SCIENTIA IRANICA, 2013, 20 (06) :2015-2023
[47]   The shifted inverse-gamma model for noise-floor estimation in archived audio recordings [J].
Godsill, Simon .
SIGNAL PROCESSING, 2010, 90 (04) :991-999
[48]   Audio Source Separation in Reverberant Environments Using β-Divergence-Based Nonnegative Factorization [J].
Fakhry, Mahmoud ;
Svaizer, Piergiorgio ;
Omologo, Maurizio .
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (07) :1462-1476
[49]   Molecular systematics of teioid lizards (Teioidea/Gymnophthalmoidea: Squamata) based on the analysis of 48 loci under tree-alignment and similarity-alignment [J].
Goicoechea, Noemi ;
Frost, Darrel R. ;
De la Riva, Ignacio ;
Pellegrino, Katia C. M. ;
Sites, Jack, Jr. ;
Rodrigues, Miguel T. ;
Padial, Jose M. .
CLADISTICS, 2016, 32 (06) :624-671
[50]   Auxiliary model-based maximum likelihood multi-innovation recursive least squares identification for multiple-input multiple-output systems☆ [J].
Wang, Huihui ;
Zhang, Qian ;
Liu, Ximei .
JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2024, 361 (18)