Rigid vs Non-Rigid Face and Head Motion in Phone and Tone Perception

被引:0
作者
Burnham, Denis [1 ]
Reynolds, Jessica [1 ]
Vignali, Guillaume [1 ]
Bollwerk, Sandra [1 ]
Jones, Caroline [2 ]
机构
[1] Univ Western Sydney, MARCS Auditory Labs, Penrith, NSW, Australia
[2] Univ New South Wales, Sch Educ, Sydney, NSW, Australia
来源
INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4 | 2007年
关键词
speech perception; auditory-visual speech perception; lexical tone; Cantonese;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
There is recent evidence that the visual concomitants, not only of the articulation of phones (consonants & vowels), but also of tones (fundamental frequency variations that signal lexical meaning in tone languages) facilitate speech perception. Analysis of speech production data from a Cantonese speaker suggests that the source of this perceptual information for tones involve rigid motion of the head rather than non-rigid face motion. A perceptual discrimination study was conducted using OPTOTRAK output in which rigid or non-rigid motion of the head could be presented independently, using two conditions: one in which words to be discriminated only differed in tone, and another in which they only differed in phone. The results suggest that non-rigid motion is the critical determinant for successful discrimination of phones, whereas both non-rigid and rigid motion are required for the discrimination of tones.
引用
收藏
页码:1405 / +
页数:2
相关论文
共 17 条
  • [1] Burnham D., 2001, P 7 C SPEECH COMM TE, P395
  • [2] Burnham D., 1992, P 3 INT S LANGUAGE L, P531
  • [3] Burnham D., 2001, P AUD VIS SPEECH PER, P155
  • [4] CAVE C, 1998, 4 INT C SPOK LANG PR, V4, P2175
  • [5] DEGELDER B, 1995, P 4 EUR C SPEECH COM, P1699
  • [6] Fromkin V., 1978, Tone: A Linguistic Survey
  • [7] HEARING LIPS AND SEEING VOICES
    MCGURK, H
    MACDONALD, J
    [J]. NATURE, 1976, 264 (5588) : 746 - 748
  • [8] Ramsay JO, 1997, Springer series in statistics, V1997th
  • [9] Sekiyama K., 1994, Journal of the Acoustical Society of Japan (E), V15, P143, DOI 10.1250/ast.15.143
  • [10] Cultural and linguistic factors in audiovisual speech processing: The McGurk effect in Chinese subjects
    Sekiyama, K
    [J]. PERCEPTION & PSYCHOPHYSICS, 1997, 59 (01): : 73 - 80