Quantifying the Contextualization of Word Representations with Semantic Class Probing

被引:0
|
作者
Zhao, Mengjie [1 ]
Dufter, Philipp [1 ]
Yaghoobzadeh, Yadollah [2 ]
Schutze, Hinrich [1 ]
机构
[1] Ludwig Maximilians Univ Munchen, CIS, Munich, Germany
[2] Microsoft Turing, Montreal, PQ, Canada
基金
欧洲研究理事会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Pretrained language models achieve state-ofthe-art results on many NLP tasks, but there are still many open questions about how and why they work so well. We investigate the contextualization of words in BERT. We quantify the amount of contextualization, i.e., how well words are interpreted in context, by studying the extent to which semantic classes of a word can be inferred from its contextualized embedding. Quantifying contextualization helps in understanding and utilizing pretrained language models. We show that the top layer representations support highly accurate inference of semantic classes; that the strongest contextualization effects occur in the lower layers; that local context is mostly sufficient for contextualizing words; and that top layer representations are more task-specific after finetuning while lower layer representations are more transferable. Finetuning uncovers task-related features, but pretrained knowledge about contextualization is still well preserved.
引用
收藏
页码:1219 / 1234
页数:16
相关论文
共 50 条
  • [21] Multilingual Semantic Textual Similarity using Multilingual Word Representations
    Ahmed, Mahtab
    Dixit, Chahna
    Mercer, Robert E.
    Khan, Atif
    Samee, Muhammad Rifayat
    Urra, Felipe
    2020 IEEE 14TH INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING (ICSC 2020), 2020, : 194 - 198
  • [22] Utilizing Latent Semantic Word Representations for Automated Essay Scoring
    Jin, Cancan
    He, Ben
    IEEE 12TH INT CONF UBIQUITOUS INTELLIGENCE & COMP/IEEE 12TH INT CONF ADV & TRUSTED COMP/IEEE 15TH INT CONF SCALABLE COMP & COMMUN/IEEE INT CONF CLOUD & BIG DATA COMP/IEEE INT CONF INTERNET PEOPLE AND ASSOCIATED SYMPOSIA/WORKSHOPS, 2015, : 1101 - 1108
  • [23] Word senses and semantic representations - Can we have both?
    Pala, K
    TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2000, 1902 : 109 - 114
  • [24] The Role of Protected Class Word Lists in Bias Identification of Contextualized Word Representations
    Sedoc, Joao
    Ungar, Lyle
    GENDER BIAS IN NATURAL LANGUAGE PROCESSING (GEBNLP 2019), 2019, : 55 - 61
  • [25] Quantifying the Semantic Representations of Adolescents' Memories of Positive and Negative Life Events
    Garcia, Danilo
    Sikstrom, Sverker
    JOURNAL OF HAPPINESS STUDIES, 2013, 14 (04) : 1309 - 1323
  • [26] Quantifying the Semantic Representations of Adolescents’ Memories of Positive and Negative Life Events
    Danilo Garcia
    Sverker Sikström
    Journal of Happiness Studies, 2013, 14 : 1309 - 1323
  • [27] Study on quantifying the semantic correlative relation of Chinese word-pair
    Zhong, Maosheng
    Liu, Hui
    Liu, Lei
    Journal of Information and Computational Science, 2009, 6 (02): : 765 - 774
  • [28] CONTEXTUALIZATION IN THE STUDY OF COGNITION AND SOCIAL REPRESENTATIONS
    JODELET, D
    INTERNATIONAL JOURNAL OF PSYCHOLOGY, 1992, 27 (3-4) : 273 - 273
  • [29] Statistical and Semantic Approaches for Tweet Contextualization
    Zingla, Meriem Amina
    Chiraz, Latiri
    Slimani, Yahya
    Berrut, Catherine
    KNOWLEDGE-BASED AND INTELLIGENT INFORMATION & ENGINEERING SYSTEMS 19TH ANNUAL CONFERENCE, KES-2015, 2015, 60 : 498 - 507
  • [30] Homogenization of word relationships in schizophrenia: Topological analysis of cortical semantic representations
    Hayashi, Ryusuke
    Kaji, Shizuo
    Matsumoto, Yukiko
    Nishida, Satoshi
    Nishimoto, Shinji
    Takahashi, Hidehiko
    PSYCHIATRY AND CLINICAL NEUROSCIENCES, 2024, 78 (11) : 687 - 695