Are acoustics enough? Semantic effects on auditory salience in natural scenes

被引:1
|
作者
Kothinti, Sandeep Reddy [1 ]
Elhilali, Mounya [1 ]
机构
[1] Johns Hopkins Univ, Ctr Language & Speech Proc, Dept Elect & Comp Engn, Baltimore, MD 21218 USA
来源
FRONTIERS IN PSYCHOLOGY | 2023年 / 14卷
关键词
auditory salience; auditory attention; audio event detection; bottom-up attention; auditory perception; VISUAL-ATTENTION; BEHAVIORAL-EXPERIMENTS; DISTRACTION; CAPTURE; MECHANISMS; ALLOCATION; OBJECTS; ONSETS; SHIFTS; BRAIN;
D O I
10.3389/fpsyg.2023.1276237
中图分类号
B84 [心理学];
学科分类号
04 ; 0402 ;
摘要
Auditory salience is a fundamental property of a sound that allows it to grab a listener's attention regardless of their attentional state or behavioral goals. While previous research has shed light on acoustic factors influencing auditory salience, the semantic dimensions of this phenomenon have remained relatively unexplored owing both to the complexity of measuring salience in audition as well as limited focus on complex natural scenes. In this study, we examine the relationship between acoustic, contextual, and semantic attributes and their impact on the auditory salience of natural audio scenes using a dichotic listening paradigm. The experiments present acoustic scenes in forward and backward directions; the latter allows to diminish semantic effects, providing a counterpoint to the effects observed in forward scenes. The behavioral data collected from a crowd-sourced platform reveal a striking convergence in temporal salience maps for certain sound events, while marked disparities emerge in others. Our main hypothesis posits that differences in the perceptual salience of events are predominantly driven by semantic and contextual cues, particularly evident in those cases displaying substantial disparities between forward and backward presentations. Conversely, events exhibiting a high degree of alignment can largely be attributed to low-level acoustic attributes. To evaluate this hypothesis, we employ analytical techniques that combine rich low-level mappings from acoustic profiles with high-level embeddings extracted from a deep neural network. This integrated approach captures both acoustic and semantic attributes of acoustic scenes along with their temporal trajectories. The results demonstrate that perceptual salience is a careful interplay between low-level and high-level attributes that shapes which moments stand out in a natural soundscape. Furthermore, our findings underscore the important role of longer-term context as a critical component of auditory salience, enabling us to discern and adapt to temporal regularities within an acoustic scene. The experimental and model-based validation of semantic factors of salience paves the way for a complete understanding of auditory salience. Ultimately, the empirical and computational analyses have implications for developing large-scale models for auditory salience and audio analytics.
引用
收藏
页数:18
相关论文
共 50 条
  • [1] Auditory salience using natural scenes: An online study
    Kothinti, Sandeep Reddy
    Huang, Nicholas
    Elhilali, Mounya
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2021, 150 (04): : 2952 - 2966
  • [2] Instrumental analysis and synthesis of auditory scenes: Communication acoustics
    Blaubert, J
    VIRTUAL, SYNTHETIC, AND ENTERTAINMENT AUDIO, 2002, : 387 - 395
  • [3] Auditory and Cognitive Effects of Aging on Perception of Environmental Sounds in Natural Auditory Scenes
    Gygi, Brian
    Shafiro, Valeriy
    JOURNAL OF SPEECH LANGUAGE AND HEARING RESEARCH, 2013, 56 (05): : 1373 - 1388
  • [4] Auditory salience using natural soundscapes
    Huang, Nicholas
    Elhilali, Mounya
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2017, 141 (03): : 2163 - 2176
  • [5] Fixations on objects in natural scenes: dissociating importance from salience
    't Hart, Bernard M.
    Schmidt, Hannah C. E. F.
    Roth, Christine
    Einhaeuser, Wolfgang
    FRONTIERS IN PSYCHOLOGY, 2013, 4
  • [6] Semantic guidance of attention within natural scenes
    Walter, E
    Dassonville, P
    VISUAL COGNITION, 2005, 12 (06) : 1124 - 1142
  • [7] Paraphrase is not enough (challenges for Natural Semantic Metalanguage)
    Barker, C
    THEORETICAL LINGUISTICS, 2003, 29 (03) : 201 - 209
  • [8] Semantic consistency versus perceptual salience in visual scenes: Findings from change detection
    Spotorno, Sara
    Tatler, Benjamin W.
    Faure, Sylvane
    ACTA PSYCHOLOGICA, 2013, 142 (02) : 168 - 176
  • [9] Semantic informativeness mediates the detection of changes in natural scenes
    Hollingworth, A
    Henderson, JM
    VISUAL COGNITION, 2000, 7 (1-3) : 213 - 235
  • [10] Semantic Control of Feature Extraction from Natural Scenes
    Neri, Peter
    JOURNAL OF NEUROSCIENCE, 2014, 34 (06): : 2374 - 2388