Visual-auditory perception of prosodic focus in Japanese by native and non-native speakers

被引:0
作者
Zhang, Yixin [1 ]
Chen, Xi [1 ]
Chen, Si [1 ,2 ,3 ,4 ]
Meng, Yuzhe [1 ]
Lee, Albert [5 ]
机构
[1] Hong Kong Polytech Univ, Dept Chinese & Bilingual Studies, Hong Kong, Peoples R China
[2] Hong Kong Polytech Univ, Res Ctr Language Cognit & Neurosci, Dept Chinese & Bilingual Studies, Hong Kong, Peoples R China
[3] Hong Kong Polytech Univ, Res Inst Smart Ageing, Hong Kong, Peoples R China
[4] Hong Kong Polytech Univ, HK PolyU PekingU Res Ctr Chinese Linguist, Hong Kong, Peoples R China
[5] Educ Univ Hong Kong, Dept Linguist & Modern Language Studies, Hong Kong, Peoples R China
来源
FRONTIERS IN HUMAN NEUROSCIENCE | 2023年 / 17卷
关键词
multi-sensory perception; visual-auditory integration; prosodic focus; focus perception; Japanese; Cantonese; CULTURAL-DIFFERENCES; FACIAL EXPRESSIONS; PROMINENCE; LANGUAGE; RATINGS; ACCENT;
D O I
10.3389/fnhum.2023.1237395
中图分类号
Q189 [神经科学];
学科分类号
071006 ;
摘要
IntroductionSpeech communication is multi-sensory in nature. Seeing a speaker's head and face movements may significantly influence the listeners' speech processing, especially when the auditory information is not clear enough. However, research on the visual-auditory integration speech processing has left prosodic perception less well investigated than segmental perception. Furthermore, while native Japanese speakers tend to use less visual cues in segmental perception than in other western languages, to what extent the visual cues are used in Japanese focus perception by the native and non-native listeners remains unknown. To fill in these gaps, we test focus perception in Japanese among native Japanese speakers and Cantonese speakers who learn Japanese, using auditory-only and auditory-visual sentences as stimuli.MethodologyThirty native Tokyo Japanese speakers and thirty Cantonese-speaking Japanese learners who had passed the Japanese-Language Proficiency Test with level N2 or N3 were asked to judge the naturalness of 28 question-answer pairs made up of broad focus eliciting questions and three-word answers carrying broad focus, or contrastive or non-contrastive narrow focus on the middle object words. Question-answer pairs were presented in two sensory modalities, auditory-only and visual-auditory modalities in two separate experimental sessions.ResultsBoth the Japanese and Cantonese groups showed weak integration of visual cues in the judgement of naturalness. Visual-auditory modality only significantly influenced Japanese participants' perception when the questions and answers were mismatched, but when the answers carried non-contrastive narrow focus, the visual cues impeded rather than facilitated their judgement. Also, the influences of specific visual cues like the displacement of eyebrows or head movements of both Japanese and Cantonese participants' responses were only significant when the questions and answers were mismatched. While Japanese participants consistently relied on the left eyebrow for focus perception, the Cantonese participants referred to head movements more often.DiscussionThe lack of visual-auditory integration in Japanese speaking population found in segmental perception also exist in prosodic perception of focus. Not much foreign language effects has been found among the Cantonese-speaking learners either, suggesting a limited use of facial expressions in focus marking by native and non-native Japanese speakers. Overall, the present findings indicate that the integration of visual cues in perception of focus may be specific to languages rather than universal, adding to our understanding of multisensory speech perception.
引用
收藏
页数:16
相关论文
共 73 条
  • [1] Adobe Premiere Pro Help, Adjusting volume levels
  • [2] [Anonymous], 2016, E-Prime 3.0
  • [3] Information structure: linguistic, cognitive, and processing approaches
    Arnold, Jennifer E.
    Kaiser, Elsi
    Kahn, Jason M.
    Kim, Lucy K.
    [J]. WILEY INTERDISCIPLINARY REVIEWS-COGNITIVE SCIENCE, 2013, 4 (04) : 403 - 413
  • [4] Asiaee M., 2016, Lang. Relat. Res, V7, P258
  • [5] Random effects structure for confirmatory hypothesis testing: Keep it maximal
    Barr, Dale J.
    Levy, Roger
    Scheepers, Christoph
    Tily, Harry J.
    [J]. JOURNAL OF MEMORY AND LANGUAGE, 2013, 68 (03) : 255 - 278
  • [6] Bauer R. S., 1997, Modern Cantonese Phonology, DOI DOI 10.1515/9783110823707
  • [7] The ventriloquist effect does not depend on the direction of deliberate visual attention
    Bertelson, P
    Vroomen, J
    de Gelder, B
    Driver, J
    [J]. PERCEPTION & PSYCHOPHYSICS, 2000, 62 (02): : 321 - 332
  • [8] Beskow J, 2006, P 9 INT C SPOK LANG
  • [9] Matsumoto and Ekman's Japanese and Caucasian Facial Expressions of Emotion (JACFEE): Reliability data and cross-national differences
    Biehl, M
    Matsumoto, D
    Ekman, P
    Hearn, V
    Heider, K
    Kudoh, T
    Ton, V
    [J]. JOURNAL OF NONVERBAL BEHAVIOR, 1997, 21 (01) : 3 - 21
  • [10] STUDIES IN COLLOQUIAL JAPANESE IV PHONEMICS
    Bloch, Bernard
    [J]. LANGUAGE, 1950, 26 (01) : 86 - 125