Exploiting Siamese Neural Networks on Short Text Similarity Tasks for Multiple Domains and Languages

被引:5
作者
Andrioli de Souza, Joao Vitor [1 ]
Oliveira, Lucas Emanuel Silva E. [1 ]
Gumiel, Yohan Bonescki [1 ]
Carvalho, Deborah Ribeiro [1 ]
Cabral Moro, Claudia Maria [1 ]
机构
[1] Pontifical Catholic Univ Parana PUCPR, Grad Program Hlth Technol PPGTS, Curitiba, Parana, Brazil
来源
COMPUTATIONAL PROCESSING OF THE PORTUGUESE LANGUAGE, PROPOR 2020 | 2020年 / 12037卷
关键词
Semantic Textual Similarity; Siamese neural networks; Shared tasks;
D O I
10.1007/978-3-030-41505-1_34
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Semantic textual similarity algorithms are essential to several natural language processing tasks as clustering documents and text summarization. Many shared tasks regarding this subject were performed during the last few years, but generally, focused on a unique domain and/or language. Siamese Neural Network (SNN) is well known for its ability to compute similarity requiring less training data. We proposed a SNN architecture incorporated with language-independent features, aiming to perform short text similarity calculation in multiple languages and domains. We explored three different corpora from shared tasks: ASSIN 1 and ASSIN 2 with Portuguese journalistic texts and N2C2 (English clinical texts). We adapted theSNNproposed by Mueller and Thyagarajan (2016), in twoways: (i) the activation functions were changed to the ReLU, instead of the sigmoid function, and; (ii) we incorporated the architecture to accept three new lexical features and an embedding layer to infer the values of the pre-trained word embeddings. The evaluation was performed by the Pearson correlation (PC) and the Mean Squared Error (MSE) between the models' predicted values and corpora's gold standard. Our approach achieved better results than the baseline in both languages and domains.
引用
收藏
页码:357 / 367
页数:11
相关论文
共 19 条
  • [11] Bromley J., 1993, International Journal of Pattern Recognition and Artificial Intelligence, V7, P669, DOI 10.1142/S0218001493000339
  • [12] Cer D., 2017, ARXIV170800055, DOI 10.18653/v1/S17-2001
  • [13] Learning a similarity metric discriminatively, with application to face verification
    Chopra, S
    Hadsell, R
    LeCun, Y
    [J]. 2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, : 539 - 546
  • [14] Fonseca ER, 2016, LINGUAMATICA, V8, P3
  • [15] Hartmann NS, 2016, LINGUAMATICA, V8, P59
  • [16] Mueller J, 2016, AAAI CONF ARTIF INTE, P2786
  • [17] Neculoiu P., 2016, P 1 WORKSH REPR LEAR, P148, DOI DOI 10.18653/V1/W16-1617
  • [18] Ranasinghe T., 2019, RANLP 2019
  • [19] Oliveira LESE, 2019, STUD HEALTH TECHNOL, V264, P123, DOI [10.3233/SHTI190196, 10.3233/SHT1190196]