Comparing the Performance of Neural and Statistical Sentence Embeddings on Summarization and Word Sense Disambiguation

被引:0
作者
Juvekar, Gaurav [1 ]
Lolage, Abhishek [1 ]
Sahasrabudhe, Dhruva [1 ]
Haribhakta, Yashodhara [1 ]
机构
[1] Coll Engn, Dept Comp Engn & Informat Technol, Pune, Maharashtra, India
来源
2018 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI) | 2018年
关键词
sentence embedding; vector; similarity; text summarization; word sense disambiguation;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We analyzed the performance of two sentence embeddings: SIF (Smoothed Inverse Frequency) created using weighted GloVe word embeddings, and sent2vec, trained using a neural network. Using these sentence embeddings without modification, we compared and contrasted their performance on extractive text summarization and word sense disambiguation using existing methods tailored for sentence embeddings. We find that our results are better than the simplest baselines and approach competitive baselines for both these tasks, proving that sentence embeddings are to some extent successful in capturing the structure of language.
引用
收藏
页码:1787 / 1792
页数:6
相关论文
共 21 条
[1]  
Allahyari Mehdi, 2017, ABS170702268 CORR
[2]  
[Anonymous], 2010, Proc. of the 23rd International Conference on Computational Linguistics (Coling 2010)
[3]  
[Anonymous], 2013, ADV NEURAL INFORM PR, DOI DOI 10.48550/ARXIV.1310.4546
[4]  
Arora S., 2017, INT C LEARN REPR
[5]  
Banerjee S., 2002, Computational Linguistics and Intelligent Text Processing. Third International Conference, CICLing 2002. Proceedings (Lecture Notes in Computer Science Vol.2276), P136
[6]  
Banerjee S., 2003, IJCAI
[7]  
Bartunov Sergey, 2015, ABS150207257 CORR
[8]  
Cheng J, 2016, ABS160307252 CORR
[9]  
Ganesan K., 2015, ROUGE 2 0 UPDATED IM
[10]  
Kiros Ryan, 2015, ABS150606726 CORR