摘要
ETL即数据抽取、转换、装载的过程,它是构建数据仓库的重要环节,而数据仓库是面向主题的、集成的、稳定的且随时间不断变化的数据集合。数据清洗是一个减少错误和不一致性、解决对象识别的过程,目前有很多数据清洗研究和ETL研究,但是如何在ETL过程中进行有效的数据清洗,此方面研究不多。本文将以此为问题出发点,探讨ETL中的数据清洗技术在税务系统(贵州省省直属局和九个地市州的原始数据)中的应用。
ETL namely data extraction,conversion,loading process.It is the building the important link of the data warehouse.The data warehouse is the theme for,integrated,stable and the changed with time data set. Data cleaning is a reducing errors and inconsistencies,and solve the object recognition process.At present there are many data cleaning research and study,but how to ETL ETL process of effective data cleaning,this research is not much.This paper will be based on the starting point,this paper discusses the problem of cleaning technology in data ETL tax system(guizhou province ZhiShuJu and nine cities and states of the original data) application.
出处
《科技广场》
2011年第11期65-67,共3页
Science Mosaic
关键词
ETL
中间数据库
目标数据库
MIS系统(管理信息系统)
Extraction-Transformation-Loading
Staging Database
Target Database
Management Information System