Multi-Relational Topic Model-Based Approach for Web Services Clustering

被引:0
作者
Shi M. [1 ]
Liu J.-X. [1 ]
Zhou D. [1 ]
Cao B.-Q. [1 ]
Wen Y.-P. [1 ]
机构
[1] Key Laboratory of Knowledge Processing&Networked Manufacturing, Hunan University of Science&Technology, Xiangtan, 411201, Hunan
来源
Jisuanji Xuebao/Chinese Journal of Computers | 2019年 / 42卷 / 04期
基金
中国国家自然科学基金;
关键词
Clustering; Multi-relational network; Prior knowledge; Topic model; Web services;
D O I
10.11897/SP.J.1016.2019.00820
中图分类号
学科分类号
摘要
Web service discovery is a significant and nontrivial task in the domain of Web service computing. With the rapid growth in the number of Web services on the Internet, e.g., an increasing number of enterprises tend to make public their software and other resources in the form of services within and outside the organizations, locating exactly the desired Web services is becoming increasingly hard for users. It has been shown that clustering Web services according to their functionalities is an efficient way to facilitate Web services discovery as well as services management. The clustering results can help us better understanding the more fine-granted categorically functional features of Web services, and meanwhile significantly reduce the searching space and retrieval time with respect to a given user query. Existing methods on this topic mainly focus on mining the semantic functional information of Web services, etc., adopting LDA to firstly elicit the functional semantics of Web services and then clustering Web services according to their topic distributions based on some clustering methods such as K-means. However, the natural description documents of Web services generally contain limit number of words. It is hard for most existing LDA-based methods to model short text documents, which may seriously degrade the Web service clustering accuracy. To narrow such negative effect, this paper aims to mitigate the data sparse issue by mining and leveraging some types of auxiliary information that is helpful to the service clustering problem. After a careful exploration of the Web service multi-relational network that is naturally established from users' frequent behaviors (e.g., invoking and annotating) of using Web services, we found that the composition relationships between Web services and the annotation relationships between Web services sharing identical tags could be used to improve the semantics extraction and service clustering processes, i.e., services with annotation relationships tend to share similar functional semantics, whereas services with composition relationships should follow dissimilar latent topic distributions. Based on these observations, we first propose a multi-relational probabilistic topic model, MR-LDA, to simultaneously model the composition relationships as well as the annotation relationships, where services with either composition or annotation relationship will exert an impact on each other during the topic sampling process for each word. Based on the topic model, we further propose an efficient Web service clustering algorithm, MR-LDA+, to firstly revise the obtained topic distribution probabilities of Web services such that above two kinds of relationships information can be explicitly encoded, and then based on it performs the Web services clustering. We extensively evaluate the proposed topic model and clustering approach on a real world dataset crawled from ProgrammableWeb. The experimental comparisons demonstrate that our approach significantly outperforms other state-of-the-art Web services clustering methods. In addition, we also design experiments to verify if the used auxiliary information can help to extract more accurate semantics by conducting service classification and vector visualization tasks based on the Support Vector Machine and t-SNE algorithms, respectively, and both the classification performance and vector visualization results demonstrate the positive impact of the introduced auxiliary relationships information. © 2019, Science Press. All right reserved.
引用
收藏
页码:820 / 836
页数:16
相关论文
共 38 条
  • [1] Xia B., Fan Y., Tan W., Et al., Category-aware API clustering and distributed recommendation for automatic Mashup creation, IEEE Transactions on Services Computing, 8, 5, pp. 674-687, (2015)
  • [2] Cao B., Et al., Mashup service clustering based on an integration of service content and network via exploiting a two-level topic model, Proceedings of the IEEE International Conference on Web Services(ICWS), pp. 212-219, (2016)
  • [3] Shi M., Et al., WE-LDA: A word embeddings augmented LDA model for Web services clustering, Proceedings of the IEEE International Conference on Web Services(ICWS), pp. 9-16, (2017)
  • [4] Elgazzar K., Hassan A., Martin P., Clustering WSDL documents to bootstrap the discovery of Web services, Proceedings of the IEEE International Conference on Web Services (ICWS), pp. 147-154, (2010)
  • [5] Wen T., Sheng G., Li Y., Guo Q., Research on Web service discovery with semantics and clustering, Proceedings of the IEEE Joint International Information Technology and Artificial Intelligence Conference (IJCAI), pp. 62-67, (2011)
  • [6] Xia Y., Chen P., Bao L., Et al., A QoS-aware Web service selection algorithm based on clustering, Proceedings of the IEEE International Conference on Web Services(ICWS), pp. 428-435, (2011)
  • [7] Zhou Z., Sellami M., Gaaloul W., Et al., Data providing services clustering and management for facilitating service discovery and replacement, IEEE Transactions on Automation Science and Engineering, 10, 4, pp. 1131-1146, (2013)
  • [8] Zhang M., Liu X., Zhang R., Sun H., A Web service recommendation approach based on QoS prediction using fuzzy clustering, Proceedings of the IEEE International Conference on Services Computing (ICSOC), pp. 138-145, (2012)
  • [9] Zhu J., Kang Y., Zheng Z., Lyu M.R., A clustering-based QoS prediction approach for Web service recommendation, Proceedings of the IEEE International Symposium on Object/Component/Service-Oriented Real-Time Distributed Computing Workshops (ISORCW), pp. 93-98, (2012)
  • [10] Shi M., Liu J.X., Zhou D., Et al., A probabilistic topic model for Mashup tag recommendation, Proceedings of the IEEE International Conference on Web Services(ICWS), pp. 444-451, (2016)