摘要
为了保证语料库的安全性,提出基于分布式集群的语料库防篡改检索方法。通过分布式集群运行结构,采用决策树ID3算法分析节点的检测属性,构建语料库行为样本判定数,挖掘语料库浏览行为数据;采用数据关键特征防篡改检索方法,将监控代码植入到语料库浏览行为数据中,对语料库资源数据各特征函数进行加密处理,从而实现资源数据加密及防篡改检索。实验结果表明,上述方法的浏览行为挖掘效率较高,且能够将篡改信息全部检索出来,防篡改检索错误率较低,应用该方法后可较大程度提高语料库的安全性。
This paper proposes a tamper proof retrieval method based on distributed cluster for ensuring the security of corpus.According to the operation structure of distributed cluster,ID3 algorithm was introduced to analyze the detection attributes of nodes in detail to construct the decision number of corpus behavior samples,thus mining the behavior data of corpus browsing.The key features of data tamper resistant retrieval method were applied.Meanwhile,the feature functions of the corpus resource data were encrypted.Finally,the encryption and tamper proof retrieval of the resource data was achieved.The experimental results show that this method has high efficiency of browsing behavior mining,low tamper proof retrieval error rate,and good application prospect in the field of improving the security performance of corpus.
作者
安玉香
李檀
AN Yu-xiang;LI Tan(International School,Shenyang Jianzhu University,ShenyangLiaoning 110168,China;Shenyang JianzhuUniversity,Liaoning Shenyang 110168,China)
出处
《计算机仿真》
北大核心
2021年第9期460-464,共5页
Computer Simulation