摘要
首先,从分析电信运营商传统数据架构入手,识别当前面临的数据质量不高、实时性不够、灵活性不足及存储和应用相互制约等问题。然后,讨论引入数据湖技术的可行性,明确数据湖的存储规模化和低成本、数据"原汁原味"和方便易用、应用按需建模等特点,能够为电信运营商数据架构优化提供非常有益的参考。最后,提出数据统一存储、统一标准、近源采集、与应用分离的电信运营商数据湖建设方案,并针对数据湖数据分区、部署、入湖及应用数据动态加载、数据统一管理等关键要点给予明确的阐述。
Firstly, the traditional data architecture of telecom operators was analyzed and the problems such as low quality, lack of real-time, lack of flexibility and mutual constraints of storage and application were identified. Then, the feasibility of introducing data lake technology was discussed, the storage scale and low cost of data lake, the originality of data and ease of use, and the application of on-demand modeling were clarified, which could provide great benefits for telecom operators’ data architecture optimization. Finally, the data lake construction plan of unified storage, unified standards, near-source acquisition, and application separation was proposed, and key points such as data lake data partitioning, deployment, lake entry and application data dynamic loading, and unified data management were clearly stated.
作者
胡军军
谢晓军
石彦彬
喻琦
HU Junjun;XIE Xiaojun;SHI Yanbin;YU Qi(Guangzhou Research Institute of China Telecom Co.,Ltd.,Guangzhou 510630,China)
出处
《电信科学》
2019年第2期84-94,共11页
Telecommunications Science
关键词
数据湖
原生数据存储
存储计算分离
数据目录
data lake
native data storage
loose coupling between storage and computation
data catalog