Perceptual thresholds of audio-visual spatial coherence for a variety of audio-visual objects

被引:0
|
作者
Stenzel, Hanne [1 ]
Jackson, Philip J. B. [1 ]
机构
[1] Univ Surrey, Ctr Vis Speech & Signal Proc, Guildford GU2 7XH, Surrey, England
来源
2018 AES INTERNATIONAL CONFERENCE ON AUDIO FOR VIRTUAL AND AUGMENTED REALITY | 2018年
基金
英国工程与自然科学研究理事会;
关键词
AUDIBLE MOVEMENT ANGLE; SOUND LOCALIZATION; FREQUENCY; NOISE;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Audio-visual spatial perception relies on the integration of both auditory and visual spatial information. Depending on auditory and visual features of the stimulus, and the relevance of each sound to the listener, offsets between both signals are more or less acceptable. The current paper investigates to which extent each of these factors influences how critical the perception of spatial coherence is by estimating the psychometric function for seventeen audio-visual stimuli. The results show that the maximum accepted offset angle does not depend on semantic categories but is linked to audio feature classes with harmonic sounds leading to greater acceptable offsets. A regression shows that the perceptual spectral centroid is negatively correlated with the offset angle and the slope of the psychometric spatial-coherence function. This finding, however, is not conclusive and further research is necessary to define all parameters that influence bimodal localization of realistic stimuli.
引用
收藏
页数:10
相关论文
共 50 条
  • [21] Perceptual encoding of musical notes: Is it visual, auditory or audio-visual?
    Yenicira, Gdem Gulcay
    Alici, Tevfik
    Calgan, Beril
    Uzar, Aylin Cakici
    INTERNATIONAL JOURNAL OF PSYCHOLOGY, 2023, 58 : 244 - 244
  • [22] AUDIO-VISUAL FOR THE PATIENT
    STUTTLE, FL
    JOURNAL OF BONE AND JOINT SURGERY-AMERICAN VOLUME, 1959, 41 (07): : 1362 - 1362
  • [23] The Audio-Visual Reader
    不详
    JOURNAL OF EDUCATIONAL RESEARCH, 1955, 48 (07): : 552 - 553
  • [24] An audio-visual speech recognition system for testing new audio-visual databases
    Pao, Tsang-Long
    Liao, Wen-Yuan
    VISAPP 2006: PROCEEDINGS OF THE FIRST INTERNATIONAL CONFERENCE ON COMPUTER VISION THEORY AND APPLICATIONS, VOL 2, 2006, : 192 - +
  • [25] Audio-visual event detection based on mining of semantic audio-visual labels
    Goh, KS
    Miyahara, K
    Radhakrishan, R
    Xiong, ZY
    Divakaran, A
    STORAGE AND RETRIEVAL METHODS AND APPLICATIONS FOR MULTIMEDIA 2004, 2004, 5307 : 292 - 299
  • [26] LEARNING CONTEXTUALLY FUSED AUDIO-VISUAL REPRESENTATIONS FOR AUDIO-VISUAL SPEECH RECOGNITION
    Zhang, Zi-Qiang
    Zhang, Jie
    Zhang, Jian-Shu
    Wu, Ming-Hui
    Fang, Xin
    Dai, Li-Rong
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 1346 - 1350
  • [27] Audio-Visual Causality and Stimulus Reliability Affect Audio-Visual Synchrony Perception
    Li, Shao
    Ding, Qi
    Yuan, Yichen
    Yue, Zhenzhu
    FRONTIERS IN PSYCHOLOGY, 2021, 12
  • [28] AV2AV: Direct Audio-Visual Speech to Audio-Visual Speech Translation with Unified Audio-Visual Speech Representation
    Choi, Jeongsoo
    Park, Se Jin
    Kim, Minsu
    Ro, Yong Man
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 27315 - 27327
  • [29] Perceptual Quality of Audio-Visual Content with Common Video and Audio Degradations
    Becerra Martinez, Helard
    Hines, Andrew
    Farias, Mylene C. Q.
    APPLIED SCIENCES-BASEL, 2021, 11 (13):
  • [30] Feedback Modulates Audio-Visual Spatial Recalibration
    Kramer, Alexander
    Roeder, Brigitte
    Bruns, Patrick
    FRONTIERS IN INTEGRATIVE NEUROSCIENCE, 2020, 13