Surveillance Audio Attention Model Based on Spatial Audio Cues

被引:0
|
作者
Hang, Bo [1 ,2 ]
Hu, RuiMin [2 ]
Yang, YuHong [2 ]
Ma, Ye [2 ]
Chang, Jun [3 ]
机构
[1] Xiangfan Univ, Xiangfan 441053, Peoples R China
[2] Wuhan Univ, Natl Engn Res Ctr Multimedia Software, Wuhan 430072, Peoples R China
[3] Wuhan Univ, Comp Sch, Wuhan 430072, Peoples R China
关键词
Audio attention; spatial audio; environment adaptive;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
For stereo audio surveillance in complex environment, we proposed a bottom-up audio attention model based on spatial audio cues analysis. and an environment adaptive normalization method The traditional audio attention models are based on mono audio characters, such as energy, energy peak. or pitch Their performance is limited by neglecting the spatial in The spatial cues in audio stream provide additional information for attention analysis And the dynamic updated background sound can help to reduce the environment effect The preliminary experiment showed that the proposed model is an effective way to analyzing at events. which is caused by rapid moving sound source, in stereo audio stream
引用
收藏
页码:908 / +
页数:2
相关论文
共 50 条
  • [21] Bootstrapping Audio-Visual Video Segmentation by Strengthening Audio Cues
    Chen, Tianxiang
    Tan, Zhentao
    Gong, Tao
    Chu, Qi
    Wu, Yue
    Liu, Bin
    Yu, Nenghai
    Lu, Le
    Ye, Jieping
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2025, 35 (03) : 2398 - 2409
  • [22] Spatial Audio in 360° Videos: Does it influence Visual Attention?
    Hirway, Amit
    Qiao, Yuansong
    Murray, Niall
    PROCEEDINGS OF THE 13TH ACM MULTIMEDIA SYSTEMS CONFERENCE, MMSYS 2022, 2022, : 39 - 51
  • [23] Investigation into spatial audio quality of experience in the presence of accompanying video cues with spatial mismatch
    Kim, Chungeun
    Kondoz, Ahmet
    Shi, Xiyu
    2013 IEEE GLOBECOM WORKSHOPS (GC WKSHPS), 2013, : 1192 - 1197
  • [24] AUDIO SET CLASSIFICATION WITH ATTENTION MODEL: A PROBABILISTIC PERSPECTIVE
    Kong, Qiuqiang
    Xu, Yong
    Wang, Wenwu
    Plumbley, Mark D.
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 316 - 320
  • [25] Audio-Visual Salieny Network with Audio Attention Module
    Cheng, Shuaiyang
    Gao, Xing
    Song, Liang
    Xiahou, Jianbing
    PROCEEDINGS OF 2021 2ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND INFORMATION SYSTEMS (ICAIIS '21), 2021,
  • [26] Music Audio Sentiment Classification Based on CNN-BiLSTM and Attention Model
    Chen Zhen
    Liu Changhui
    2021 4TH INTERNATIONAL CONFERENCE ON ROBOTICS, CONTROL AND AUTOMATION ENGINEERING (RCAE 2021), 2021, : 156 - 160
  • [27] Seeing more: Visualizing audio cues
    Bergstrom, Tony
    Karahalios, Karrie
    HUMAN-COMPUTER INTERACTION - INTERACT 2007, PT 2, PROCEEDINGS, 2007, 4663 : 29 - +
  • [28] Surveillance system for audio broadcast
    Schimmel, Jiri
    Prinosil, Jiri
    2006 7TH NORDIC SIGNAL PROCESSING SYMPOSIUM, 2006, : 298 - +
  • [29] Audio analysis for surveillance applications
    Radhakrishnan, R
    Divakaran, A
    Smaragdis, P
    2005 WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2005, : 158 - 161
  • [30] Spatial Audio & Jazz
    Rowden, Jonathan
    DOWN BEAT, 2022, 89 (02): : 62 - 63