期刊文献+

基于Word2Vec和LDA主题模型的Web服务聚类方法 被引量:11

Web services clustering based on Word2Vec and LDA topic model
下载PDF
导出
摘要 为高效地发现满足用户需求的Web服务,针对Web服务的描述文本较短、缺乏足够有效信息的问题,提出一种基于Word2Vec和LDA主题模型的Web服务聚类方法。该方法首先将Wikipedia语料库作为扩充源,使用word2vec对Web服务描述文档内容进行扩充,再将扩充后的描述文档利用主题模型进行特征建模,将短文本主题建模转化为长文本主题建模,更准确地实现服务内容主题表达,最后根据文档的主题分布矩阵寻找相似的服务并完成聚类,使用从ProgrammableWeb收集的真实数据进行实验。研究结果表明:本文方法与TFIDF-K,LDA,WT-LDA和LDA-K方法相比,F分别提高419.74%,20.11%,15.60%和27.80%,利用扩充后的Web服务的描述文档进行聚类的方法能够有效提高Web服务聚类的效果。 Considering that the description text of Web service is short and lack of enough effective information,a Web service clustering method was proposed based on Word2Vec and LDA topic model in order to find the Web service that meets user’s needs efficiently.Firstly,Wikipedia corpus was used as an extension source,and Word2Vec was used to extend the content of Web service description document,and then the expanded description document was modeled using the topic model.The short text topic modeling was transformed into a long text topic modeling,which achieved the topic of service content expression more accurately.Finally the similar service was found based on the topic distribution matrix of the document and the clustering was completed.Real data from ProgrammableWeb was used to carry out experiments.The results show that F obtained by the method increases by419.74%,20.11%,15.60%,27.80%,respectively,compared with those using TFIDF-K,LDA,WT-LDA and LDA-K.The use of extended Web service description documents clustering method can effectively improve the effectiveness of Web service clustering.
作者 肖巧翔 曹步清 张祥平 刘建勋 李晏新闻 XIAO Qiaoxiang;CAO Buqing;ZHANG Xiangping;LIU Jianxun;LI Yanxinwen(Hunan University of Science & Technology, Xiangtan 411201, China;State Key Laboratory of Networking and Switching Technology,Beijing University of Posts and Telecommunications, Beijing 100876, China;College of Navigation, Quanzhou Normal University, Quanzhou 362699, China)
出处 《中南大学学报(自然科学版)》 EI CAS CSCD 北大核心 2018年第12期2979-2985,共7页 Journal of Central South University:Science and Technology
基金 国家自然科学基金资助项目(61873316 61872139) 湖南省自然科学基金资助项目(2017JJ2098) 网络与交换技术国家重点实验室(北京邮电大学)开放课题(SKLNST-2016-2-26)~~
关键词 WEB服务 Word2Vec LDA主题模型 K-MEANS算法 Web服务聚类 Web services Word2Vec LDA topic model K-means algorithm Web service clustering
  • 相关文献

参考文献7

二级参考文献45

  • 1李瑞,邱玉辉.基于离散点的蚁群聚类算法的研究[J].计算机科学,2005,32(6):111-113. 被引量:4
  • 2田铮,李小斌,句彦伟.谱聚类的扰动分析[J].中国科学(E辑),2007,37(4):527-543. 被引量:33
  • 3叶蕾,张斌.基于功能语义的Web服务发现方法[J].计算机研究与发展,2007,44(8):1357-1364. 被引量:24
  • 4Teh Y, Jordan M, Beal M, Blei D. Hierarchical Dirichlet process. Journal of the American Statistical Association, 2004,101(476): 1566-1581. [doi: 10.2307/27639773]. 被引量:1
  • 5Zhang DQ, Yang LT, Huang HY. Searching in Internet of things: Vision and challenges. In: Proc. of the IEEE 9th Int'l Symp. on Parallel and Distributed Processing with Application (ISPA). 2011.201-206. [doi: 10.1109/ISPA.2011.53]. 被引量:1
  • 6Valerie I, Nikolaos G, Sara H, Apostolos Z, Panos V, Marco A, Marco AG, Amira BH. Service-Oriented middleware for the future Internet: State of the art and research directions. Journal of Internet Services and Applications, 2011,2(1):23-45. [doi: 10.1007/ s13174-011-0021-3]. 被引量:1
  • 7Guinard D, Trifa V, Karnouskos S, Spiess P, Savio D. Interacting with the SOA-based Internet of things: Discovery, query, selection, and on-demand provisioning of Web services. IEEE Trans. on Services Computing, 2010,3(3):223-235. [doi: 10.1109/ TSC.2010.3]. 被引量:1
  • 8Teixeira T, Hachem S, Issarny V, Georgantas N. Service oriented middleware for the Interact of things: A perspective. In: Abramowicz W, ed. Proc. of the 4th European Conf. on ServiceWave. Berlin, Heidelberg: Springer-Verlag, 2011. 220-229. [doi: 10.1007/978i3-642-24755-2_21 ]. 被引量:1
  • 9Cassar G, Barnaghi P, Wang W, Moessner K. A hybrid semantic matchmaker for loT services. In: Proe. of the IEEE Int'l Conf. on Green Computing and Communications (GreenCom). Washington: IEEE Computer Society, 2012. 210-216. [doi: 10.1109/Green Com.2012.40]. 被引量:1
  • 10Blei DM, NgAY, Jordan MI. Latent dirichlet allocation. Journal of Machine Learning Research, 2003,3:993-1022. 被引量:1

共引文献178

同被引文献106

引证文献11

二级引证文献33

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部