摘要
中文关键词模糊密文搜索方案依赖预定义的关键词模糊集,存在搜索复杂度高和存储空间大等不足。为此,提出一种应用于云存储的基于拼音相似度的多关键词密文模糊搜索方案。运用拼音相似度和欧氏距离衡量汉字的相似度,以布隆过滤器为基础,利用p-稳定分布的局部哈希函数构造索引,实现多个中文关键词的密文模糊搜索。实验结果表明,与基于关键词的加密云数据模糊搜索方案相比,该方案搜索效率较高,存储代价较小,且能够保证信息安全。
The Chinese keyword fuzzy ciphertext search scheme relies on a predefined keyword fuzzy set,which has higher search complexity and storage cost.A multi-keyword ciphertext fuzzy search scheme based on pinyin similarity in cloud storage is proposed.The pinyin similarity and Euclidean distance are used to measure the similarity of Chinese characters.Based on the Bloom filter,the local hash function of p-stable distribution is used to construct the index to realize ciphertext fuzzy search of multiple Chinese keywords.Experimental results show that compared with fuzzy search over encrypted cloud data based on keywords,the scheme has high search efficiency,low storage cost and can ensure information security.
作者
黄保华
吕琦
莫家威
HUANG Baohua;LU Qi;MO Jiawei(School of Computer and Electronic Information,Guangxi University,Nanning 530004,China)
出处
《计算机工程》
CAS
CSCD
北大核心
2019年第1期103-108,共6页
Computer Engineering
基金
国家自然科学基金(61262072)
关键词
云存储
拼音相似度
布隆过滤器
局部敏感哈希
模糊搜索
加密查询
cloud storage
pinyin similarity
Bloom filter
Local Sensitive Hashing(LSH)
fuzzy search
encrypted query