Limitations of Autoregressive Models and Their Alternatives

被引:0
|
作者
Lin, Chu-Cheng [1 ]
Jaech, Aaron [2 ]
Li, Xin [1 ]
Gormley, Matthew R. [3 ]
Eisner, Jason [1 ]
机构
[1] Johns Hopkins Univ, Dept Comp Sci, Baltimore, MD 21218 USA
[2] Facebook AI, New York, NY USA
[3] Carnegie Mellon Univ, Machine Learning Dept, Pittsburgh, PA USA
来源
2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL-HLT 2021) | 2021年
基金
美国国家科学基金会;
关键词
STATISTICAL-MODELS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Standard autoregressive language models perform only polynomial-time computation to compute the probability of the next symbol. While this is attractive, it means they cannot model distributions whose next-symbol probability is hard to compute. Indeed, they cannot even model them well enough to solve associated easy decision problems for which an engineer might want to consult a language model. These limitations apply no matter how much computation and data are used to train the model, unless the model is given access to oracle parameters that grow superpolynomially in sequence length. Thus, simply training larger autoregressive language models is not a panacea for NLP. Alternatives include energy-based models (which give up efficient sampling) and latent-variable autoregressive models (which give up efficient scoring of a given string). Both are powerful enough to escape the above limitations.
引用
收藏
页码:5147 / 5172
页数:26
相关论文
共 50 条
  • [1] Tests against stationary and explosive alternatives in vector autoregressive models
    Ahlgren, Niklas
    Nyblom, Jukka
    JOURNAL OF TIME SERIES ANALYSIS, 2008, 29 (03) : 421 - 443
  • [2] Animal models of the human brain: Successes, limitations, and alternatives
    Kanwisher, Nancy
    CURRENT OPINION IN NEUROBIOLOGY, 2025, 90
  • [3] ALTERNATIVES TO SACCHAROSE AND THEIR LIMITATIONS
    ARRIGO, L
    INDUSTRIE ALIMENTARI, 1982, 21 (01): : 42 - 43
  • [4] Alternatives to colonoscopy and their limitations
    Chaput, Ulriikka
    Oudjit, Ammar
    Prat, Frederic
    Chaussade, Stanislas
    PRESSE MEDICALE, 2010, 39 (04): : 437 - 445
  • [5] PANCREAS TRANSPLANTATION - ALTERNATIVES AND LIMITATIONS
    HOOGWERF, BJ
    CLEVELAND CLINIC JOURNAL OF MEDICINE, 1990, 57 (06) : 563 - 563
  • [6] Evaluating the limitations of and alternatives in beaconing
    Heissenbuettel, Marc
    Braun, Torsten
    Waelchli, Markus
    Bernoulli, Thomas
    AD HOC NETWORKS, 2007, 5 (05) : 558 - 578
  • [7] PERFORMANCE APPROACH - LIMITATIONS AND ALTERNATIVES
    DEMING, BS
    EDUCATIONAL FORUM, 1977, 41 (02): : 213 - 220
  • [8] ON THE THRESHOLD AUTOREGRESSIVE MODELS
    HILI, O
    COMPTES RENDUS DE L ACADEMIE DES SCIENCES SERIE I-MATHEMATIQUE, 1992, 314 (07): : 573 - 576
  • [9] LINEAR AUTOREGRESSIVE MODELS
    Herkenrath, Ulrich
    Iosifescu, Marius
    Rudolph, Andreas
    MATHEMATICAL REPORTS, 2010, 12 (03): : 245 - 259