The effects of N-gram probabilistic measures on the recognition and production of four-word sequences

被引:57
作者
Tremblay, Antoine [1 ]
Tucker, Benjamin V. [2 ]
机构
[1] Univ Alberta, IWK Hlth Ctr, Interdisciplinary Res Goldbloom Res Pavil, 5850-5980 Univ Ave POB 9700, Halifax, NS B3K 6R8, Canada
[2] Univ Alberta, Dept Linguist, Edmonton, AB T6G 2E7, Canada
关键词
multi-word sequences; N-grams; speech processing; speech production; mixed-effects regression; frequency of occurrence; logit; log probability of occurrence; mutual information;
D O I
10.1075/ml.6.2.04tre
中图分类号
H0 [语言学];
学科分类号
030303 ; 0501 ; 050102 ;
摘要
The present study investigates the processing and production of four-word sequences such as I don't really know, at the age of, and I think it's the. Specifically, we investigate the influence of families of probabilistic measures such as unigram, bigram, trigram, and quadgram frequency of occurrence, logarithmic (log) probability of occurrence, and mutual information. Log probability of occurrence emerged as the predominant predictor family in the onset latency analysis, suggesting that recognition is mainly underpinned by competition between a target N-gram and its family members. In contrast, the amount of experience one has with an N-gram (frequency of occurrence) surfaced as the most prominent predictor in production. Further, probabilistic measures tied to trigrams surfaced as the best predictors in the onset latency analysis, while the measures tied to unigrams were most predictive of production durations. Finally, the interactions between probabilistic measures tied to unigrams, bigrams, trigrams, and quadgrams suggest that N-grams of different lengths are processed in parallel in both recognition and production.
引用
收藏
页码:302 / 324
页数:23
相关论文
共 40 条
  • [1] [Anonymous], 2020, CORPUS CONT AM ENGLI
  • [2] [Anonymous], 1997, UNDERSTANDING REGRES
  • [3] [Anonymous], 2010, PERSPECTIVES FORMULA
  • [4] More than words: Frequency effects for multi-word phrases
    Arnon, Inbal
    Snider, Neal
    [J]. JOURNAL OF MEMORY AND LANGUAGE, 2010, 62 (01) : 67 - 82
  • [5] Mixed-effects modeling with crossed random effects for subjects and items
    Baayen, R. H.
    Davidson, D. J.
    Bates, D. M.
    [J]. JOURNAL OF MEMORY AND LANGUAGE, 2008, 59 (04) : 390 - 412
  • [6] Baayen R.H., 2008, ANAL LINGUISTIC DATA
  • [7] Baayen RH, 2010, AMST STUD THEORY HIS, V311, P257
  • [8] Lexical dynamics for low-frequency complex words A regression study across tasks and modalities
    Baayen, R. Harald
    Wurm, Lee H.
    Aycock, Joanna
    [J]. MENTAL LEXICON, 2007, 2 (03) : 419 - 463
  • [9] Predictability effects on durations of content and function words in conversational English
    Bell, Alan
    Brenier, Jason M.
    Gregory, Michelle
    Girand, Cynthia
    Jurafsky, Dan
    [J]. JOURNAL OF MEMORY AND LANGUAGE, 2009, 60 (01) : 92 - 111
  • [10] Belsley DA., 1980, REGRESSION DIAGNOSTI