摘要
为了满足分布式应用系统中的数据质量要求,需要设计数据清洗方法与构件的共享环境。提出了数据清洗方法与构件的综合模型,阐述方法模型、过程模型和构件模型,以满足使用构件时的检索和选用要求。通过一种网络映射图方法,描述过程模型与方法模型的组合特征,并给出了数据清洗方法实例。在数据清洗构件的描述方面,给出了基于形式语言的构件描述,采用XMLSchema设计了Header、Deployment、Form、Function和Implementation共5种刻面及其它们的子刻面。以数据删除任务为例,详细阐述了数据删除与恢复方法的设计过程和算法描述,给出了相应构件的XML模式表示与实现的操作界面。提出的方法与构件综合技术已在实际科研项目中发挥重要作用。
A kind of sharing environment of data cleaning methods and components is demanded to meet the requirement of data quality in distributed application system. An integrated model which includes method model, process model and component model, was created for data cleaning methods and component design to meet the requirement of searching and selection in component using in the future. The combination of process model and method model was described by a kind of network mapping diagram. Lots of cleaning method instances were stated in the diagram. For the data cleaning components, description based on formal language was designed. Moreover, five facets of Header, Deployment, Form, Function and Implementation were designed with XML Schema. Finally, a case study was presented for data delete function. Taken data deletion for example, the design process and algorithm description of data deleting and recovering method were stated in detail and the corresponding component was denoted with XML Schema. Meanwhile, two operation interfaces were demonstrated for its implementation. The integrated technology of method and component has been applied in actual scientific projects.
出处
《石油化工高等学校学报》
CAS
2005年第2期67-71,共5页
Journal of Petrochemical Universities
基金
北京市教委科技发展计划面上项目 (KM2 0 0 5 10 0170 0 6 )。