Perceptual thresholds of audio-visual spatial coherence for a variety of audio-visual objects

被引:0
|
作者
Stenzel, Hanne [1 ]
Jackson, Philip J. B. [1 ]
机构
[1] Univ Surrey, Ctr Vis Speech & Signal Proc, Guildford GU2 7XH, Surrey, England
来源
2018 AES INTERNATIONAL CONFERENCE ON AUDIO FOR VIRTUAL AND AUGMENTED REALITY | 2018年
基金
英国工程与自然科学研究理事会;
关键词
AUDIBLE MOVEMENT ANGLE; SOUND LOCALIZATION; FREQUENCY; NOISE;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Audio-visual spatial perception relies on the integration of both auditory and visual spatial information. Depending on auditory and visual features of the stimulus, and the relevance of each sound to the listener, offsets between both signals are more or less acceptable. The current paper investigates to which extent each of these factors influences how critical the perception of spatial coherence is by estimating the psychometric function for seventeen audio-visual stimuli. The results show that the maximum accepted offset angle does not depend on semantic categories but is linked to audio feature classes with harmonic sounds leading to greater acceptable offsets. A regression shows that the perceptual spectral centroid is negatively correlated with the offset angle and the slope of the psychometric spatial-coherence function. This finding, however, is not conclusive and further research is necessary to define all parameters that influence bimodal localization of realistic stimuli.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Audio-Visual Objects
    Kubovy M.
    Schutz M.
    Review of Philosophy and Psychology, 2010, 1 (1) : 41 - 61
  • [2] Transfer of Audio-Visual Temporal Training to Temporal and Spatial Audio-Visual Tasks
    Suerig, Ralf
    Bottari, Davide
    Roeder, Brigitte
    MULTISENSORY RESEARCH, 2018, 31 (06) : 556 - 578
  • [3] Audio-visual spatial alignment improves integration in the presence of a competing audio-visual stimulus
    Fleming, Justin T.
    Noyce, Abigail L.
    Shinn-Cunningham, Barbara G.
    NEUROPSYCHOLOGIA, 2020, 146
  • [4] An audio-visual distance for audio-visual speech vector quantization
    Girin, L
    Foucher, E
    Feng, G
    1998 IEEE SECOND WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, 1998, : 523 - 528
  • [5] Catching audio-visual mice:: The extrapolation of audio-visual speed
    Hofbauer, MM
    Wuerger, SM
    Meyer, GF
    Röhrbein, F
    Schill, K
    Zetzsche, C
    PERCEPTION, 2003, 32 : 96 - 96
  • [6] UNSUPERVISED EXTRACTION OF AUDIO-VISUAL OBJECTS
    Casanovas, Anna Llagostera
    Vandergheynst, Pierre
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 2284 - 2287
  • [7] Separation of audio-visual speech sources: A new approach exploiting the audio-visual coherence of speech stimuli
    Sodoyer, D
    Schwartz, JL
    Girin, L
    Klinkisch, J
    Jutten, C
    EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2002, 2002 (11) : 1165 - 1173
  • [8] An audio-visual speech recognition with a new mandarin audio-visual database
    Liao, Wen-Yuan
    Pao, Tsang-Long
    Chen, Yu-Te
    Chang, Tsun-Wei
    INT CONF ON CYBERNETICS AND INFORMATION TECHNOLOGIES, SYSTEMS AND APPLICATIONS/INT CONF ON COMPUTING, COMMUNICATIONS AND CONTROL TECHNOLOGIES, VOL 1, 2007, : 19 - +
  • [9] Separation of audio-visual speech sources: A new approach exploiting the audio-visual coherence of speech stimuli
    Sodoyer, D. (sodoyer@icp.inpg.fr), 1600, Hindawi Publishing Corporation (2002):
  • [10] Separation of Audio-Visual Speech Sources: A New Approach Exploiting the Audio-Visual Coherence of Speech Stimuli
    David Sodoyer
    Jean-Luc Schwartz
    Laurent Girin
    Jacob Klinkisch
    Christian Jutten
    EURASIP Journal on Advances in Signal Processing, 2002