Exploiting Siamese Neural Networks on Short Text Similarity Tasks for Multiple Domains and Languages

被引：5

作者：

Andrioli de Souza, Joao Vitor ^{[1
]}

Oliveira, Lucas Emanuel Silva E. ^{[1
]}

Gumiel, Yohan Bonescki ^{[1
]}

Carvalho, Deborah Ribeiro ^{[1
]}

Cabral Moro, Claudia Maria ^{[1
]}

机构：

[1] Pontifical Catholic Univ Parana PUCPR, Grad Program Hlth Technol PPGTS, Curitiba, Parana, Brazil

来源：

COMPUTATIONAL PROCESSING OF THE PORTUGUESE LANGUAGE, PROPOR 2020 | 2020年 / 12037卷

关键词：

Semantic Textual Similarity; Siamese neural networks; Shared tasks;

D O I：

10.1007/978-3-030-41505-1_34

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Semantic textual similarity algorithms are essential to several natural language processing tasks as clustering documents and text summarization. Many shared tasks regarding this subject were performed during the last few years, but generally, focused on a unique domain and/or language. Siamese Neural Network (SNN) is well known for its ability to compute similarity requiring less training data. We proposed a SNN architecture incorporated with language-independent features, aiming to perform short text similarity calculation in multiple languages and domains. We explored three different corpora from shared tasks: ASSIN 1 and ASSIN 2 with Portuguese journalistic texts and N2C2 (English clinical texts). We adapted theSNNproposed by Mueller and Thyagarajan (2016), in twoways: (i) the activation functions were changed to the ReLU, instead of the sigmoid function, and; (ii) we incorporated the architecture to accept three new lexical features and an embedding layer to infer the values of the pre-trained word embeddings. The evaluation was performed by the Pearson correlation (PC) and the Mean Squared Error (MSE) between the models' predicted values and corpora's gold standard. Our approach achieved better results than the baseline in both languages and domains.

引用

页码：357 / 367

页数：11

共 19 条

[1] Agirre E., 2016, P 10 INT WORKSH SEM, P497, DOI [10.18653/v1/S16-1081, DOI 10.18653/V1/S16-1081]
[2] Agirre E., 2015, P 9 INT WORKSHOP SEM, P252, DOI 10.18653/v1/S15-2045
[3] Agirre E., 2014, P 8 INT WORKSH SEM E, P81, DOI [10.3115/v1/S14-2010, DOI 10.3115/V1/S14-2010]
[4] Agirre E., 2013, P 2 JOINT C LEX COMP, V1, P32
[5] Agirre E., 2012, Proceedings of the First Joint Conference on Lexical and Computational Semantics-Volume 1: Proceedings of the Main Conference and the Shared Task, and Volume 2: Proceedings of the Sixth International Workshop on Semantic Evaluation, SemEval'12, V1, P385, DOI DOI 10.5555/2387636.2387697
[6] Alves A., 2018, OPENACCESS SER INFOR, V62, P1, DOI [10.4230/OASIcs.SLATE.2018.12, DOI 10.4230/OASICS.SLATE.2018.12]
[7] [Anonymous], 2017, P 11 BRAZ S INF HUM
[8] Barbosa L, 2016, LINGUAMATICA, V8, P15
[9] Barrow J., 2017, P 11 INT WORKSH SEM, P180
[10] SICK through the SemEval glasses. Lesson learned from the evaluation of compositional distributional semantic models on full sentences through semantic relatedness and textual entailment
Bentivogli, Luisa
Bernardi, Raffaella
Marelli, Marco
Menini, Stefano
Baroni, Marco
Zamparelli, Roberto
[J]. LANGUAGE RESOURCES AND EVALUATION, 2016, 50 (01) : 95 - 124

← 1 2 →