Considerations about learning Word2Vec

被引:41
作者
Di Gennaro, Giovanni [1 ]
Buonanno, Amedeo [2 ]
Palmieri, Francesco A. N. [1 ]
机构
[1] Univ Campania Luigi Vanvitelli, Dipartimento Ingn, Via Roma 29, I-81031 Aversa, CE, Italy
[2] ENEA, Dept Energy Technol & Renewable Energy Sources, Res Ctr Portici, PE Fermi 1, Portici, NA, Italy
关键词
Word embedding; Natural language processing; Neural networks; MEMORY;
D O I
10.1007/s11227-021-03743-2
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Despite the large diffusion and use of embedding generated through Word2Vec, there are still many open questions about the reasons for its results and about its real capabilities. In particular, to our knowledge, no author seems to have analysed in detail how learning may be affected by the various choices of hyperparameters. In this work, we try to shed some light on various issues focusing on a typical dataset. It is shown that the learning rate prevents the exact mapping of the co-occurrence matrix, that Word2Vec is unable to learn syntactic relationships, and that it does not suffer from the problem of overfitting. Furthermore, through the creation of an ad-hoc network, it is also shown how it is possible to improve Word2Vec directly on the analogies, obtaining very high accuracy without damaging the pre-existing embedding. This analogy-enhanced Word2Vec may be convenient in various NLP scenarios, but it is used here as an optimal starting point to evaluate the limits of Word2Vec.
引用
收藏
页码:12320 / 12335
页数:16
相关论文
共 33 条
[1]   SynoExtractor: A Novel Pipeline for Arabic Synonym Extraction Using Word2Vec Word Embeddings [J].
Al-Matham, Rawan N. ;
Al-Khalifa, Hend S. .
COMPLEXITY, 2021, 2021
[2]  
Almeida F., 2019, ARXIV190109069
[3]  
ALTSZYLER E, 2017, CONSCIOUS COGNIT
[4]  
[Anonymous], 2016, ARXIV161001520
[5]  
[Anonymous], 2014, CORPUS METHODS SEMAN
[6]  
[Anonymous], 2015, Transactions of the Association for Computational Linguistics, DOI DOI 10.1186/1472-6947-15-S2-S2.ARXIV:1103.0398
[7]   Deep Neural Architecture for Multi-Modal Retrieval based on Joint Embedding Space for Text and Images [J].
Balaneshin-kordan, Saeid ;
Kotov, Alexander .
WSDM'18: PROCEEDINGS OF THE ELEVENTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, 2018, :28-36
[8]  
Baroni M, 2014, PROCEEDINGS OF THE 52ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1, P238
[9]   Distributional Memory: A General Framework for Corpus-Based Semantics [J].
Baroni, Marco ;
Lenci, Alessandro .
COMPUTATIONAL LINGUISTICS, 2010, 36 (04) :673-721
[10]  
Bengio Yoshua, 2012, Neural Networks: Tricks of the Trade. Second Edition: LNCS 7700, P437, DOI 10.1007/978-3-642-35289-8_26