Online audio background determination for complex audio environments

被引:12
作者
Moncrieff, Simon [1 ]
Venkatesh, Svetha [1 ]
West, Geoff [1 ]
机构
[1] Curtin Univ Technol, Dept Comp, Perth, WA 6845, Australia
关键词
algorithms; management; audio analysis; surveillance and monitoring; online background modelling;
D O I
10.1145/1230812.1230814
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We present a method for foreground/background separation of audio using a background modelling technique. The technique models the background in an online, unsupervised, and adaptive fashion, and is designed for application to long term surveillance and monitoring problems. The background is determined using a statistical method to model the states of the audio over time. In addition, three methods are used to increase the accuracy of background modelling in complex audio environments. Such environments can cause the failure of the statistical model to accurately capture the background states. An entropy-based approach is used to unify background representations fragmented over multiple states of the statistical model. The approach successfully unifies such background states, resulting in a more robust background model. We adaptively adjust the number of states considered background according to background complexity, resulting in the more accurate classification of background models. Finally, we use an auxiliary model cache to retain potential background states in the system. This prevents the deletion of such states due to a rapid influx of observed states that can occur for highly dynamic sections of the audio signal. The separation algorithm was successfully applied to a number of audio environments representing monitoring applications.
引用
收藏
页数:30
相关论文
共 26 条
  • [21] SoundButton:: Design of a low power wearable audio classification system
    Stäger, M
    Lukowicz, P
    Perera, N
    von Büren, T
    Tröster, G
    Starner, T
    [J]. SEVENTH IEEE INTERNATIONAL SYMPOSIUM ON WEARABLE COMPUTERS, PROCEEDINGS, 2003, : 12 - 17
  • [22] Stauffer C., 1999, Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149), P246, DOI 10.1109/CVPR.1999.784637
  • [23] Vacher M, 2004, Proceedings of the Second IASTED International Conference on Biomedical Engineering, P395
  • [24] Witten IH, Frank E: Data Mining: Practical Machine Learning Tools and Techniques 2nd editionSan Francisco: Morgan Kaufmann Publishers; 2005:560. ISBN 0-12-088407-0, £34.99
    Francisco Azuaje
    [J]. BioMedical Engineering OnLine, 5 (1)
  • [25] Pfinder: Real-time tracking of the human body
    Wren, CR
    Azarbayejani, A
    Darrell, T
    Pentland, AP
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1997, 19 (07) : 780 - 785
  • [26] Hierarchical classification of audio data for archiving and retrieving
    Zhang, T
    Kuo, CCJ
    [J]. ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, : 3001 - 3004