摘要
针对数据仓库压缩存储技术,在基于Dwarf立方体模型的基础上,改进立方体存储结构,提出R-Dwarf立方体,进一步对立方体进行压缩,从而达到减少存储冗余以及算法时间复杂度的效果。使用Python对算法进行实现,并以基于TPC-H基准的星型模式数据进行演示,与原算法进行对比。
Aiming at the data warehouse compression storage technology,on the basis of Dwarf cube model,puts forward R-Dwarf cube to improve the cube storage structure and compress the cube further,so as to reduce the storage redundancy and algorithm time complexity.The algorithm is implemented with Python,and uses the star schema benchmark data based on the TPC-H to demonstrate the model,then does the comparison with the original algorithm.
作者
路钰莹
高茂庭
LU Yu-ying, GAO Mao-ting(College of Information Engineering, Shanghai Maritime University, Shanghai 20130)
出处
《现代计算机》
2018年第12期25-31,共7页
Modern Computer