Unsupervised Approaches for Computing Word Similarity in Portuguese

被引:0
作者
Oliveira, Hugo Goncalo [1 ]
机构
[1] Univ Coimbra, Dept Informat Engn, CISUC, Coimbra, Portugal
来源
PROGRESS IN ARTIFICIAL INTELLIGENCE (EPIA 2017) | 2017年 / 10423卷
关键词
Semantic similarity; Word similarity; Lexical knowledge bases; Lexical semantics; Word embeddings; Distributional semantics; MODELS;
D O I
10.1007/978-3-319-65340-2_67
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents several approaches for computing word similarity in Portuguese and is motivated by the recent availability of state-of-the-art distributional models of Portuguese words, which add to several lexical knowledge bases (LKBs) for this language, available for a longer time. The previous resources were exploited to answer word similarity tests, also recently available for Portuguese. We conclude that there are several valid approaches for this task, but not one that outperforms all the others in every single test. For instance, distributional models seem to capture relatedness better, but LKBs are better suited for computing genuine similarity.
引用
收藏
页码:828 / 840
页数:13
相关论文
共 29 条
[1]  
[Anonymous], 2013, P 17 C COMPUTATIONAL, DOI DOI 10.1007/BF02579642
[2]  
[Anonymous], 2013, P WORKSH TRACK INT C
[3]  
[Anonymous], 2012, P 24 INT C COMP LING
[4]  
[Anonymous], 2016, T ASSOC COMPUT LING, DOI DOI 10.1162/TACL_A_00051
[5]  
[Anonymous], 2008, COMPANION P 14 BRAZI
[6]   Lemon and Tea Are Not Similar: Measuring Word-to-Word Similarity by Combining Different Methods [J].
Banjade, Rajendra ;
Maharjan, Nabin ;
Niraula, Nobal B. ;
Rus, Vasile ;
Gautam, Dipesh .
COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING (CICLING 2015), PT I, 2015, 9041 :335-346
[7]  
Barreiro A., 2010, P 2008 INT NOOJ C NO
[8]  
Barreiro A, 2008, LECT NOTES ARTIF INT, V5190, P202, DOI 10.1007/978-3-540-85980-2_21
[9]  
Budanitsky A, 2006, COMPUT LINGUIST, V32, P13, DOI 10.1162/coli.2006.32.1.13
[10]   Placing search in context: The concept revisited [J].
Finkelstein, L ;
Gabrilovich, E ;
Matias, Y ;
Rivlin, E ;
Solan, Z ;
Wolfman, G ;
Ruppin, E .
ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2002, 20 (01) :116-131