Joint semantic similarity assessment with raw corpus and structured ontology for semantic-oriented service discovery

被引：0

作者：

Wei Lu

Yuanyuan Cai

Xiaoping Che

Yuxun Lu

机构：

[1] Beijing Jiaotong University,School of Software Engineering

来源：

Personal and Ubiquitous Computing | 2016年 / 20卷

关键词：

Joint semantic similarity assessment; Feature fusion; Low-dimensional vector space; WordNet; Service discovery;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Semantic-oriented service matching is one of the challenges in automatic Web service discovery. Service users may search for Web services using keywords and receive the matching services in terms of their functional profiles. A number of approaches to computing the semantic similarity between words have been developed to enhance the precision of matchmaking, which can be classified into ontology-based and corpus-based approaches. The ontology-based approaches commonly use the differentiated concept information provided by a large ontology for measuring lexical similarity with word sense disambiguation. Nevertheless, most of the ontologies are domain-special and limited to lexical coverage, which have a limited applicability. On the other hand, corpus-based approaches rely on the distributional statistics of context to represent per word as a vector and measure the distance of word vectors. However, the polysemous problem may lead to a low computational accuracy. In this paper, in order to augment the semantic information content in word vectors, we propose a multiple semantic fusion (MSF) model to generate sense-specific vector per word. In this model, various semantic properties of the general-purpose ontology WordNet are integrated to fine-tune the distributed word representations learned from corpus, in terms of vector combination strategies. The retrofitted word vectors are modeled as semantic vectors for estimating semantic similarity. The MSF model-based similarity measure is validated against other similarity measures on multiple benchmark datasets. Experimental results of word similarity evaluation indicate that our computational method can obtain higher correlation coefficient with human judgment in most cases. Moreover, the proposed similarity measure is demonstrated to improve the performance of Web service matchmaking based on a single semantic resource. Accordingly, our findings provide a new method and perspective to understand and represent lexical semantics.

引用

页码：311 / 323

页数：12

共 43 条

[1]

Banerjee S(2003)Extended gloss overlaps as a measure of semantic relatedness Int Jt Conf Artif Intell 3 805-810

[2]

Pedersen T(2013)Evolutionary algorithm based on different semantic similarity for synonym recognition Knowl Based Syst 37 62-69

[3]

Chaves-González JM(2002)Unraveling the web services web: an introduction to SOAP, WSDL, and UDDI IEEE Internet Comput 6 86-407

[4]

MartíNez-Gil J(1990)Indexing by latent semantic analysis J Am Soc Inf Sci 41 391-88

[5]

Curbera F(2015)A WordNet-based semantic similarity measurement combining edge-counting and information content theory Eng Appl Artif Intell 39 80-640

[6]

Duftler M(1995)Formal ontology, conceptual analysis and knowledge representation Int J Hum Comput Stud 43 625-32

[7]

Khalaf R(2013)The role of text pre-processing in sentiment analysis Procedia Comput Sci 17 26-176

[8]

Nagy W(2016)A web service discovery scheme based on structural and semantic similarity J Inf Sci Eng 32 153-882

[9]

Mukhi N(2003)An approach for measuring semantic similarity between words using multiple information sources IEEE Trans Knowl Data Eng 15 871-381

[10]

Weerawarana S(2012)Concept vector for semantic similarity and relatedness based on WordNet structure J Syst Softw 85 370-41

← 1 2 3 4 5 →