一种基于CUDA的局部敏感哈希算法被引量：1

A Locality Sensitive Hashing Algorithm Based on CUDA

下载PDF

导出

摘要传统的局部敏感哈希算法建立哈希表时往往需要较大的内存空间以及较长的建立时间.在查询阶段,查询样本K个最近邻数据项的所需时间超过整个运行时间的95%.针对这些问题,运用计算设备架构将局部敏感哈希算法移植至图形处理器,并用多线程并行计算数据项的哈希值来建立哈希表.查询阶段在全局内存中引入基于工作队列的多样本查询,以提高算法的运行效率.实验结果表明,所提出的算法与传统的局部敏感哈希算法相比,能在不降低运算精度的情况下将运算速度提高近12倍. The traditional locality sensitive hashing （LSH） algorithm often takes a large memory space and long settling time to build a hash table. In the query phase, it takes more than 95% of the overall running time to search K nearest neighbor data items of samples. To solve these problems, this paper uses the compute unified device architecture （CUDA） to transplant LSH algorithm into a GPU, using parallel multi-threads to calculate the values of data items to build the hash table. In the query phase, we introduce multi- sample query based on work queue to global memory to improve efficiency of the algorithm. Experimental results show that computing speed of the proposed LSH algorithm is 12 times faster than the traditional LSH algorithm.

作者张一凡余小清安炫东万旺根

机构地区上海大学通信与工程学院上海大学智慧城市研究院

出处《应用科学学报》 CAS CSCD 北大核心 2015年第5期550-558,共9页 Journal of Applied Sciences

基金国家"863"高技术研究发展计划基金(No.2013AA01A603) 上海市教育委员会科研创新项目基金(No.14YZ011)资助

关键词计算设备架构图形处理器局部敏感哈希 K最近邻 compute unified device architecture （CUDA）； graphics processing unit （GPU）；locality sensitive hashing； K-nearest neighbor

分类号 TP332 [自动化与计算机技术—计算机系统结构]

引文网络
相关文献

参考文献15

1凌康..基于位置敏感哈希的相似性搜索技术研究[D].南京大学,2012:
2DONKIN D. Multidimensional search problem [J]. Siam Journal on Computing, 1976, 5(2): 181- 186. 被引量：1
3YEH T, LEE J, DARRELL W. Adaptive vocabulary forests br dynamic indexing and category learning [C]//Proceedings of the llth IEEE International Conference on Computer Vision Rio de Janeiro, 2007. Washington, DC, USA: IEEE Computer Society, 2007: 1-8. 被引量：1
4INDYK P, MOTWANI R. Approximate nearest neighbors: towards removing the curse of dimen- sionality [C]//Proceedings of 30th ACM Symposium on Theory of Computing. New York, USA, 1998: 604-613. 被引量：1
5GIONIS A, INDYK P. Similarity search in high dimensions via hashing [C]//Proceedings of the 25th International Conference on Very Large Data Bases, Edinburgh, UK, 1999: 518-529. 被引量：1
6Wu Y, SHOu L D, CgEN G. CB-LSH: an efficient LSH indexing algorithm based on compressed bitmap [J]. Journal of Zhejiang University, 2012, 3(46): 376-385. 被引量：1
7YIN S Y, BADR M, VODISLAV D. Dynamic for approximate nearest neighbor search [J]. 48-62. multi-probe LSH: an I/O efficient index structure Lecture Notes in Computer Science, 2013, 1(8055). 被引量：1
8Gu X G, ZHANG Y D, ZHANG L. An improved method of locality sensitive hashing for indexing large-scale and high-dimensional features [J]. Signal Processing, 2013, 93(8): 2244-2255. 被引量：1
9JIANG K, QUE Q C, KULIS B. Revisiting kernelized locality-sensitive hashing for improved large- scale image retrieval [C]//IEEE Conference on Computer Vision and Pattern Recognition, 2014. 被引量：1
10雷德川..基于CUDA的GPU加速迭代重建算法研究[D].中国工程物理研究院,2012:

二级参考文献17

1INDYK P,MOTWANI R.Approximate nearest neighbors:Towards removing the curse of dimensionality[C].Pro-ceedings of 30th ACM Symposium on Theory ofComputing.New York,USA,1998.604-613. 被引量：1
2GIONIS AI,NDYK P,MOTWANI R.Similarity search inhigh dimensions via hashing[C].Proceedings of the 25thInternational Conference on Very Large Data Bases,Edin-burgh,UK,1999.518-529. 被引量：1
3CHARIKAR M.Similarity Estimation Techniques fromRounding Algorithms[C].ACM Symposium on Theory ofComputing,2002. 被引量：1
4DATAR M,IMMORLICA N,INDYK P,et al.Locality-Sensitive Hashing Scheme Based on p-Stable Distributions[C].Proceedings of the 20th ACM Symposium on Com-putational Geometry(SOCG),2004:253-262. 被引量：1
5GRAUMAN K,DARRELL T.Pyramid Match Hashing:Sub-Linear Time Indexing Over Partial Correspondences[C].IEEE Conference on Computer Vision and PatternRecognition(CVPR),2007:1-8. 被引量：1
6JAIN P,KULIS B,GRAUMAN K.Fast Image Search forLearned Metrics[C].IEEE Conference on ComputerVision and Pattern Recognition(CVPR),2008:1-8. 被引量：1
7INDYK P.Stable Distributions,PseudorandomGenerators,Embeddings,and Data Stream Computation[C].Proceedings.41st IEEE Symposium on Foundationsof Computer Science,2000:189-197. 被引量：1
8JING Y S,BALUJA S.VisualRank:Applying PageRankto Large-Scale Image Search[J].IEEE Transactions onPattern Analysis and Machine Intelligence.2008,30(11):1877-1890. 被引量：1
9RYYNNEN M,KLAPURI A.Query by humming ofmidi and audio using locality sensitive hashing[C].2008IEEE International Conference on Acoustics,Speech andSignal Processing,Las Vegas,Nevada,USA,2008.2249-2252. 被引量：1
10BALUJA S,COVELL M,IOFFE S.Permutationgrouping:intelligent Hash function design for audio&image retrieval[C].IEEE International Conference onAcoustics,Speech and Signal Processing,Las Vegas,Nevada,USA,2008.2137-2140. 被引量：1

共引文献2

1俞鹏飞,张新峰,王敏捷.基于乐纹特征和倒排索引的音乐检索系统[J].计算机应用与软件,2014,31(10):45-48. 被引量：2
2詹增荣,程丹.基于核化局部敏感哈希的快速文档检索方法[J].湖南科技大学学报（自然科学版）,2019,34(3):75-83. 被引量：1

同被引文献3

1许相莉,张利彪,于哲舟,周春光.多粒度颜色特征在图像检索中的应用(英文)[J].应用科学学报,2009,27(1):56-61. 被引量：5
2赵永威,李弼程,高毫林.一种基于精确欧氏位置敏感哈希的目标检索方法[J].应用科学学报,2012,30(4):349-355. 被引量：3
3郑永军,张连海.基于动态匹配词格检索的关键词检测[J].应用科学学报,2014,32(2):149-155. 被引量：2

引证文献1

1李蕾,岑翼刚,赵瑞珍,崔丽鸿,王艳红.基于欧氏距离双比特嵌入哈希的图像检索[J].应用科学学报,2017,35(2):193-206.

1刘根平.集中式环境下的局部敏感哈希算法综述[J].移动通信,2015,39(10):46-51. 被引量：1
2曹玉东,刘福英,蔡希彪.基于局部敏感哈希算法的图像高维数据索引技术的研究[J].辽宁工业大学学报（自然科学版）,2013,33(1):1-3. 被引量：6
3李红梅,郝文宁,陈刚.基于精确欧氏局部敏感哈希的协同过滤推荐算法[J].计算机应用,2014,34(12):3481-3486. 被引量：9
4阴爱英.基于线程并行计算的Apriori算法[J].西安科技大学学报,2014,34(1):71-74. 被引量：6
5郑志学,李长云,倪伟.一种基于多线程加密的防伪二维码的生成方法[J].湖南工业大学学报,2016,30(5):41-44. 被引量：2
6刘健,董倩兰.Observer模式在.NET多线程并行计算中的应用[J].计算机时代,2010(11):13-16. 被引量：3
7卢立托,李攀峰,马洪浩.基于GPU的不规则三角网向规则格网数字高程模型转换算法优化[J].计算机应用,2015,35(A01):32-34. 被引量：2
8Rob Farber.CUDA——了解和使用共享内存[J].程序员,2008(11):114-115.
9马俊强.Linux内核新旧工作队列机制的剖析和比较[J].电脑知识与技术,2014,10(2X):1227-1230.
10王启明,甘泉.云环境下基于SLA的工作队列调配算法研究[J].系统仿真技术,2015,11(4):307-312.

应用科学学报

2015年第5期

浏览历史

内容加载中请稍等...

一种基于CUDA的局部敏感哈希算法被引量：1

参考文献15

二级参考文献17

共引文献2

同被引文献3

引证文献1

相关作者

相关机构

相关主题

浏览历史

一种基于CUDA的局部敏感哈希算法 被引量：1

参考文献15

二级参考文献17

共引文献2

同被引文献3

引证文献1

相关作者

相关机构

相关主题

浏览历史

一种基于CUDA的局部敏感哈希算法被引量：1