Perceptual thresholds of audio-visual spatial coherence for a variety of audio-visual objects

被引:0
|
作者
Stenzel, Hanne [1 ]
Jackson, Philip J. B. [1 ]
机构
[1] Univ Surrey, Ctr Vis Speech & Signal Proc, Guildford GU2 7XH, Surrey, England
来源
2018 AES INTERNATIONAL CONFERENCE ON AUDIO FOR VIRTUAL AND AUGMENTED REALITY | 2018年
基金
英国工程与自然科学研究理事会;
关键词
AUDIBLE MOVEMENT ANGLE; SOUND LOCALIZATION; FREQUENCY; NOISE;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Audio-visual spatial perception relies on the integration of both auditory and visual spatial information. Depending on auditory and visual features of the stimulus, and the relevance of each sound to the listener, offsets between both signals are more or less acceptable. The current paper investigates to which extent each of these factors influences how critical the perception of spatial coherence is by estimating the psychometric function for seventeen audio-visual stimuli. The results show that the maximum accepted offset angle does not depend on semantic categories but is linked to audio feature classes with harmonic sounds leading to greater acceptable offsets. A regression shows that the perceptual spectral centroid is negatively correlated with the offset angle and the slope of the psychometric spatial-coherence function. This finding, however, is not conclusive and further research is necessary to define all parameters that influence bimodal localization of realistic stimuli.
引用
收藏
页数:10
相关论文
共 50 条
  • [31] Speech Reaction Time Measurements for the Evaluation of Audio-Visual Spatial Coherence
    Stenzel, Hanne
    Jackson, Philip J. B.
    Francombe, Jon
    2017 NINTH INTERNATIONAL CONFERENCE ON QUALITY OF MULTIMEDIA EXPERIENCE (QOMEX), 2017,
  • [32] Measuring the visual in audio-visual input
    Pujadas, Georgia
    Munoz, Carmen
    ITL-INTERNATIONAL JOURNAL OF APPLIED LINGUISTICS, 2023, 174 (02) : 263 - 290
  • [33] Perceptual Quality Assessment of Omnidirectional Audio-Visual Signals
    Zhu, Xilei
    Duan, Huiyu
    Cao, Yuqin
    Zhu, Yuxin
    Zhu, Yucheng
    Liu, Jing
    Chen, Li
    Min, Xiongkuo
    Zhai, Guangtao
    ARTIFICIAL INTELLIGENCE, CICAI 2023, PT II, 2024, 14474 : 512 - 525
  • [34] Audio-visual link in auditory spatial discrimination
    Kato, Masaharu
    Kashino, Makio
    Acoustical Science and Technology, 2001, 22 (05) : 380 - 382
  • [35] A Robust Audio-visual Speech Recognition Using Audio-visual Voice Activity Detection
    Tamura, Satoshi
    Ishikawa, Masato
    Hashiba, Takashi
    Takeuchi, Shin'ichi
    Hayamizu, Satoru
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2702 - +
  • [36] Audio-visual speech enhancement with AVCDCN (audio-visual codebook dependent cepstral normalization)
    Deligne, S
    Potamianos, G
    Neti, C
    SAM2002: IEEE SENSOR ARRAY AND MULTICHANNEL SIGNAL PROCESSING WORKSHOP PROCEEDINGS, 2002, : 68 - 71
  • [37] PERCEPTUAL EVALUATION ON AUDIO-VISUAL DATASET OF 360 CONTENT
    Fela, Randy F.
    Pastor, Andreas
    Le Callet, Patrick
    Zacharov, Nick
    Vigier, Toinon
    Forchhammer, Soren
    2022 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO WORKSHOPS (IEEE ICMEW 2022), 2022,
  • [38] Audio-visual speech experience with age influences perceived audio-visual asynchrony in speech
    Alm, M. (magnus.alm@svt.ntnu.no), 1600, Acoustical Society of America (134):
  • [39] Audio-visual speech experience with age influences perceived audio-visual asynchrony in speech
    Alm, Magnus
    Behne, Dawn
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2013, 134 (04): : 3001 - 3010
  • [40] EXPERIMENT IN AUDIO AND AUDIO-VISUAL GROUP THERAPY
    GORDON, MT
    BRITISH JOURNAL OF DISORDERS OF COMMUNICATION, 1969, 4 (01): : 83 - 88