A Novel Sentence Vector Generation Method Based on Autoencoder and Bi-directional LSTM

被引:0
作者
Fukuda, Kiyohito [1 ]
Mori, Naoki [1 ]
Matsumoto, Keinosuke [1 ]
机构
[1] Osaka Prefecture Univ, Naka Ku, 1-1 Gakuen Cho, Sakai, Osaka 5998531, Japan
来源
DISTRIBUTED COMPUTING AND ARTIFICIAL INTELLIGENCE | 2019年 / 800卷
关键词
Sentence vector; Bi-directional long short-term memory; Autoencoder;
D O I
10.1007/978-3-319-94649-8_16
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, dramatic performance improvement in computing has enabled a breakthrough in machine learning technologies. Against this background, generating distributed representation of discrete symbols such as natural languages and images has attracted considerable interest. In the field of natural language processing, word2vec, a method to generate distributed representations of words is well known and its effectiveness well reported. However, an effective method to generate the distributed representation of sentences and documents has not yet been reported. In this study, we propose a method of generating the distributed representation of sentences by using an autoencoder based on bi-directional long short-term memory (BiLSTM). To obtain the information and findings that necessary to generate effective representations, the computational experiments are carried out.
引用
收藏
页码:128 / 135
页数:8
相关论文
共 13 条
  • [1] Learning to forget: Continual prediction with LSTM
    Gers, FA
    Schmidhuber, J
    Cummins, F
    [J]. NEURAL COMPUTATION, 2000, 12 (10) : 2451 - 2471
  • [2] Graves A, 2012, STUD COMPUT INTELL, V385, P1, DOI [10.1162/neco.1997.9.1.1, 10.1007/978-3-642-24797-2]
  • [3] Reducing the dimensionality of data with neural networks
    Hinton, G. E.
    Salakhutdinov, R. R.
    [J]. SCIENCE, 2006, 313 (5786) : 504 - 507
  • [4] Kiros R., 2015, Advances in Neural Information Processing Systems, V28
  • [5] Le Q., 2014, ICML, P1188
  • [6] Logeswaran Lajanugen, 2018, INT C LEARNING REPRE
  • [7] Mikolov T., 2013, ADV NEURAL INFORM PR, V26, P3111
  • [8] Mikolov Tomas, 2013, P 1 INT C LEARN REPR
  • [9] Pennington J., 2014, P 2014 C EMP METH NA, P1532
  • [10] Ponti E.M., 2017, ABS170506369 CORR