Word Embedding with Neural Probabilistic Prior

被引:0
作者
Ren, Shaogang [1 ]
Li, Dingcheng [1 ]
Li, Ping [1 ]
机构
[1] Baidu Res, Cognit Comp Lab, 10900 NE 8th St, Bellevue, WA 98004 USA
来源
PROCEEDINGS OF THE 2024 SIAM INTERNATIONAL CONFERENCE ON DATA MINING, SDM | 2024年
关键词
NONLINEAR ICA;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
To improve word representation learning, we propose a probabilistic prior which can be seamlessly integrated with word embedding models. Different from previous methods, word embedding is taken as a probabilistic generative model, and it enables us to impose a prior regularizing word representation learning. The proposed prior not only enhances the representation of embedding vectors but also improves the model's robustness and stability. The structure of the proposed prior is simple and effective, and it can be easily implemented and flexibly plugged in most existing word embedding models. Extensive experiments show the proposed method improves word representation on various tasks.
引用
收藏
页码:896 / 904
页数:9
相关论文
共 43 条
  • [1] Almuhareb Abdulrahman, 2006, Attributes in lexical acquisition
  • [2] Jointly learning word embeddings using a corpus and a knowledge base
    Alsuhaibani, Mohammed
    Bollegala, Danushka
    Maehara, Takanori
    Kawarabayashi, Ken-ichi
    [J]. PLOS ONE, 2018, 13 (03):
  • [3] [Anonymous], 2011, Proc. of 5th IEEE ANTS
  • [4] Distributional Memory: A General Framework for Corpus-Based Semantics
    Baroni, Marco
    Lenci, Alessandro
    [J]. COMPUTATIONAL LINGUISTICS, 2010, 36 (04) : 673 - 721
  • [5] Brazinskas A., 2017, P COLING, P1775
  • [6] Learning User and Product Distributed Representations Using a Sequence Model for Sentiment Analysis
    Chen, Tao
    Xu, Ruifeng
    He, Yulan
    Xia, Yunqing
    Wang, Xuan
    [J]. IEEE COMPUTATIONAL INTELLIGENCE MAGAZINE, 2016, 11 (03) : 35 - 45
  • [7] Clark C, 2018, PROCEEDINGS OF THE 56TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL), VOL 1, P845
  • [8] Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
  • [9] DEFINING CURVATURE OF A STATISTICAL PROBLEM (WITH APPLICATIONS TO 2ND ORDER EFFICIENCY)
    EFRON, B
    [J]. ANNALS OF STATISTICS, 1975, 3 (06) : 1189 - 1217
  • [10] Placing search in context: The concept revisited
    Finkelstein, L
    Gabrilovich, E
    Matias, Y
    Rivlin, E
    Solan, Z
    Wolfman, G
    Ruppin, E
    [J]. ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2002, 20 (01) : 116 - 131