Analysis of the Impact of Machine Translation Evaluation Metrics for Semantic Textual Similarity

被引：1

作者：

Magnolini, Simone ^{[1
,2
]}

Ngoc Phuoc An Vo ^{[3
]}

Popescu, Octavian ^{[4
]}

机构：

[1] Univ Brescia, Brescia, Italy

[2] FBK, Trento, Italy

[3] Xerox Res Ctr Europe, Meylan, France

[4] IBM TJ Watson Res, Yorktown Hts, NY USA

来源：

AI*IA 2016: ADVANCES IN ARTIFICIAL INTELLIGENCE | 2016年 / 10037卷

关键词：

Semantic textual similarity; Machine translation evaluation metrics; Paraphrase recognition;

D O I：

10.1007/978-3-319-49130-1_33

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We present a work to evaluate the hypothesis that automatic evaluation metrics developed forMachine Translation (MT) systems have significant impact on predicting semantic similarity scores in Semantic Textual Similarity (STS) task, in light of their usage for paraphrase identification. We show that different metrics may have different behaviors and significance along the semantic scale [0-5] of the STS task. In addition, we compare several classification algorithms using a combination of different MT metrics to build an STS system; consequently, we show that although this approach obtains remarkable result in paraphrase identification task, it is insufficient to achieve the same result in STS. We show that this problem is due to an excessive adaptation of some algorithms to dataset domain and at the end a way to mitigate or avoid this issue.

引用

页码：450 / 463

页数：14

共 50 条

[31] Semantic Textual Similarity of Portuguese-Language Texts: An Approach Based on the Semantic Inferentialism Model [J].

Pinheiro, Vladia ;

Furtado, Vasco ;

Albuquerque, Adriano .

COMPUTATIONAL PROCESSING OF THE PORTUGUESE LANGUAGE, 2014, 8775 :183-188

[32] Predicting Semantic Textual Similarity of Arabic Question Pairs using Deep Learning [J].

Einea, Omar ;

Elnagar, Ashraf .

2019 IEEE/ACS 16TH INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS (AICCSA 2019), 2019,

[33] A resource-light method for cross-lingual semantic textual similarity [J].

Glavas, Goran ;

Franco-Salvador, Marc ;

Ponzetto, Simone P. ;

Rosso, Paolo .

KNOWLEDGE-BASED SYSTEMS, 2018, 143 :1-9

[34] Interpretable semantic textual similarity of sentences using alignment of chunks with classification and regression [J].

Majumder, Goutam ;

Pakray, Partha ;

Das, Ranjita ;

Pinto, David .

APPLIED INTELLIGENCE, 2021, 51 (10) :7322-7349

[35] Evaluating text representations for unsupervised legal semantic textual similarity in Brazilian Portuguese [J].

Daniel da Silva Junior ;

Daniel de Oliveira ;

Aline Paes .

Discover Data, 3 (1)

[36] Interpretable semantic textual similarity of sentences using alignment of chunks with classification and regression [J].

Goutam Majumder ;

Partha Pakray ;

Ranjita Das ;

David Pinto .

Applied Intelligence, 2021, 51 :7322-7349

[37] Contrastive Meta-Learner for Automatic Text Labeling and Semantic Textual Similarity [J].

Cooper, Ryan ;

Kliesner, Kenneth W. ;

Zenker, Stephen .

IEEE ACCESS, 2024, 12 :166792-166799

[38] Semantic Textual Similarity Measures for Case-Based Retrieval of Argument Graphs [J].

Lenz, Mirko ;

Ollinger, Stefan ;

Sahitaj, Premtim ;

Bergmann, Ralph .

CASE-BASED REASONING RESEARCH AND DEVELOPMENT, ICCBR 2019, 2019, 11680 :219-234

[39] Evaluating Question generation models using QA systems and Semantic Textual Similarity [J].

Shaheer, Safwan ;

Hossain, Ishmam ;

Sarna, Sudipta Nandi ;

Mehedi, Md Humaion Kabir ;

Rasel, Annajiat Alim .

2023 IEEE 13TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE, CCWC, 2023, :431-435

[40] A New Annotation Method With Five Labels For Constructing Semantic Textual Similarity Corpus [J].

Li, Nan ;

Xiao, Youan .

2019 11TH INTERNATIONAL CONFERENCE ON INTELLIGENT HUMAN-MACHINE SYSTEMS AND CYBERNETICS (IHMSC 2019), VOL 2, 2019, :260-263

← 1 2 3 4 5 →