Word Representation Learning in Multimodal Pre-Trained Transformers: An Intrinsic Evaluation

被引：4

作者：

Pezzelle, Sandro ^{[1
]}

Takmaz, Ece ^{[1
]}

Fernandez, Raquel ^{[1
]}

机构：

[1] Univ Amsterdam, Inst Log Language & Computat, Amsterdam, Netherlands

来源：

TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS | 2021年 / 9卷

基金：

欧洲研究理事会;

关键词：

DISTRIBUTIONAL SEMANTICS; MODELS;

D O I：

10.1162/tacl_a_00443

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This study carries out a systematic intrinsic evaluation of the semantic representations learned by state-of-the-art pre-trained multimodal Transformers. These representations are claimed to be task-agnostic and shown to help on many downstream language-and-vision tasks. However, the extent to which they align with human semantic intuitions remains unclear. We experiment with various models and obtain static word representations from the contextualized ones they learn. We then evaluate them against the semantic judgments provided by human speakers. In linewith previous evidence, we observe a generalized advantage of multimodal representations over languageonly ones on concrete word pairs, but not on abstract ones. On the one hand, this confirms the effectiveness of these models to align language and vision, which results in better semantic representations for concepts that are grounded in images. On the other hand, models are shown to follow different representation learning patterns, which sheds some light on how and when they perform multimodal integration.

引用

页码：1563 / 1579

页数：17

共 64 条

[21] Gerz D, 2016, P 2016 C EMP METH NA, P2173, DOI [DOI 10.18653/V1/D16-1235, 10.18653/v1/D16-1235]
[22] HARNAD S, 1990, PHYSICA D, V42, P335, DOI 10.1016/0167-2789(90)90087-6
[23] DISTRIBUTIONAL STRUCTURE
Harris, Zellig S.
[J]. WORD-JOURNAL OF THE INTERNATIONAL LINGUISTIC ASSOCIATION, 1954, 10 (2-3): : 146 - 162
[24] Hendricks LA, 2021, FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-IJCNLP 2021, P3635
[25] Hendricks Lisa Anne, 2021, T ASSOC COMPUT LING, DOI [10.1162/tacl_a_00385, DOI 10.1162/TACL_A_00385]
[26] Hill F., 2014, P 2014 C EMPIRICAL M, P255, DOI DOI 10.3115/V1/D14-1032
[27] SimLex-999: Evaluating Semantic Models With (Genuine) Similarity Estimation
Hill, Felix
Reichart, Roi
Korhonen, Anna
[J]. COMPUTATIONAL LINGUISTICS, 2015, 41 (04) : 665 - 695
[28] Huang T.-H., 1392, P 2016 C N AM CHAPTE, P1233
[29] Ilharco G, 2021, 2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL-HLT 2021), P5367
[30] Kiela D., 2014, P 2014 C EMPIRICAL M, P36, DOI DOI 10.3115/V1/D14-1005

← 1 2 3 4 5 6 7 →