Incorporating LDA With Word Embedding for Web Service Clustering

被引:15
作者
Zhao, Yi [1 ]
Wang, Chong [1 ]
Wang, Jian [1 ]
He, Keqing [1 ]
机构
[1] Wuhan Univ, Sch Comp Sci, Wuhan, Hubei, Peoples R China
基金
中国国家自然科学基金;
关键词
Latent Dirichlet Allocation; Web Service Clustering; Word Embedding; Word2vec; DISCOVERY; MX;
D O I
10.4018/IJWSR.2018100102
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the rapid growth of web services on the internet, web service discovery has become a hot topic in services computing. Faced with the heterogeneous and unstructured service descriptions, many service clustering approaches have been proposed to promote web service discovery, and many other approaches leveraged auxiliary features to enhance the classical LDA model to achieve better clustering performance. However, these extended LDA approaches still have limitations in processing data sparsity and noise words. This article proposes a novel web service clustering approach by incorporating LDA with word embedding, which leverages relevant words obtained based on word embedding to improve the performance of web service clustering. Especially, the semantically relevant words of service keywords by Word2vec were used to train the word embeddings and then incorporated into the LDA training process. Finally, experiments conducted on a real-world dataset published on ProgrammableWeb show that the authors' proposed approach can achieve better clustering performance than several classical approaches.
引用
收藏
页码:29 / 44
页数:16
相关论文
共 35 条
[1]  
[Anonymous], 2008, Bmvc, DOI DOI 10.5244/C.22.50
[2]  
[Anonymous], 2008, P 17 INT C WORLD WID, DOI DOI 10.1145/1367497.1367605
[3]  
Arthur D., 2007, P 18 ANN ACM SIAM S, DOI DOI 10.1145/1283383.1283494
[4]  
Aznag M., 2013, INT J WEB SERVICE CO, V4
[5]   A Hybrid Meta-Heuristic Approach for QoS-Aware Cloud Service Composition [J].
Bhushan, S. Bharath ;
Reddy, Pradeep C. H. .
INTERNATIONAL JOURNAL OF WEB SERVICES RESEARCH, 2018, 15 (02) :1-20
[6]   A FOUR-LEVEL LINEAR DISCRIMINANT ANALYSIS BASED SERVICE SELECTION IN THE CLOUD ENVIRONMENT [J].
Bhushan, S. Bharath ;
Reddy, Pradeep C. H. .
INTERNATIONAL JOURNAL OF TECHNOLOGY, 2016, 7 (05) :859-870
[7]   Latent Dirichlet allocation [J].
Blei, DM ;
Ng, AY ;
Jordan, MI .
JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) :993-1022
[8]   Domain-aware Mashup service clustering based on LDA topic model from multiple data sources [J].
Cao, Buqing ;
Liu, Xiaoqing ;
Liu, Jianxun ;
Tang, Mingdong .
INFORMATION AND SOFTWARE TECHNOLOGY, 2017, 90 :40-54
[9]  
Cerami E., 2002, WEB SERVICES ESSENTI
[10]  
Chen L, 2013, LECT NOTES COMPUT SC, V8274, P162, DOI 10.1007/978-3-642-45005-1_12