Unsupervised Bayesian Surprise Detection in Spatial Audio with Convolutional Variational Autoencoder and LSTM Model

被引:0
|
作者
Khah, Arman Nik [1 ]
Htun, Chitsein [1 ]
Prakash, Ravi [1 ]
机构
[1] Univ Texas Dallas, Richardson, TX 75083 USA
来源
PROCEEDINGS OF THE 2024 ACM INTERNATIONAL CONFERENCE ON INTERACTIVE MEDIA EXPERIENCES WORKSHOPS, IMXW 2024 | 2024年
关键词
360 degrees video; spatial audio; visual attention; Bayesian surprise; unsupervised learning; VAE-LSTM; AMBISONICS;
D O I
10.1145/3672406.3672422
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Understanding user visual attention (VA) is crucial for Field-of-View (FoV) prediction and resultant bandwidth optimization for 360 degrees video streaming. The influence of spatial audio on VA has been largely overlooked. Traditional methods, using saliency, characterize important stimuli as statistical outliers [4] but fail to capture the Temporal Evolution of Attention (TEA), where initially salient stimuli become routine and less attention-grabbing due to continual exposure [2, 20]. This paper introduces a novel unsupervised deep learning approach using a Convolutional Variational Autoencoder and Long Short-Term Memory (CVAE-LSTM) model to detect Bayesian surprise [2] in spatial audio streams, considering factors such as time, context, and user expectations. Our findings highlight the importance of temporal context in determining the surprisal value of audio events and the selective nature of sensory processing and attention in complex environments.
引用
收藏
页码:116 / 121
页数:6
相关论文
共 32 条
  • [21] A novel unsupervised anomaly detection method for rotating machinery based on memory augmented temporal convolutional autoencoder
    Li, Wanxiang
    Shang, Zhiwu
    Zhang, Jie
    Gao, Maosheng
    Qian, Shiqi
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 123
  • [22] Abnormal fastener detection model based on deep convolutional autoencoder with structural similarity
    Li Q.-Y.
    Wang J.-Z.
    Zhu Y.-Z.
    Huang Q.-L.
    Peng W.-J.
    Wang S.-C.
    Dai P.
    Jiaotong Yunshu Gongcheng Xuebao/Journal of Traffic and Transportation Engineering, 2022, 22 (04): : 186 - 195
  • [23] Dragon_Pi: IoT Side-Channel Power Data Intrusion Detection Dataset and Unsupervised Convolutional Autoencoder for Intrusion Detection
    Lightbody, Dominic
    Ngo, Duc-Minh
    Temko, Andriy
    Murphy, Colin C.
    Popovici, Emanuel
    FUTURE INTERNET, 2024, 16 (03)
  • [24] A Novel Model for Ship Trajectory Anomaly Detection Based on Gaussian Mixture Variational Autoencoder
    Xie, Lei
    Guo, Tao
    Chang, Jiliang
    Wan, Chengpeng
    Hu, Xinyuan
    Yang, Yang
    Ou, Changkui
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2023, 72 (11) : 13826 - 13835
  • [25] DVAEGMM: Dual Variational Autoencoder With Gaussian Mixture Model for Anomaly Detection on Attributed Networks
    Khan, Wasim
    Haroon, Mohammad
    Khan, Ahmad Neyaz
    Hasan, Mohammad Kamrul
    Khan, Asif
    Mokhtar, Umi Asma
    Islam, Shayla
    IEEE ACCESS, 2022, 10 : 91160 - 91176
  • [26] AE5-SSIM: A Novel Unsupervised Tinfoils Defect Detection Model with Deep Autoencoder
    Zhang, Fanghui
    Zhang, Linna
    Zhang, Damin
    Huang, Yansen
    Kan, Shichao
    Cen, Yigang
    2022 16TH IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP2022), VOL 1, 2022, : 212 - 217
  • [27] A Probabilistic Model Based on Bipartite Convolutional Neural Network for Unsupervised Change Detection
    Liu, Jia
    Zhang, Wenhua
    Liu, Fang
    Xiao, Liang
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [28] Automatic Fabric Defect Detection with a Multi-Scale Convolutional Denoising Autoencoder Network Model
    Mei, Shuang
    Wang, Yudan
    Wen, Guojun
    SENSORS, 2018, 18 (04)
  • [29] VGGM: Variational Graph Gaussian Mixture Model for Unsupervised Change Point Detection in Dynamic Networks
    Zhang, Xinxun
    Jiao, Pengfei
    Gao, Mengzhou
    Li, Tianpeng
    Wu, Yiming
    Wu, Huaming
    Zhao, Zhidong
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2024, 19 : 4272 - 4284
  • [30] Hybrid model of convolutional auto-encoder and ellipse characteristic for unsupervised high impedance fault detection
    Yang, Junjie
    Delinchant, Benoit
    Niyato, Dusit
    Hadjsaid, Nouredine
    ELECTRIC POWER SYSTEMS RESEARCH, 2025, 238