摘要
目的探索空气污染与疾病关系的时间序列分析中门急诊数据快速清洗及自动分类统计汇总的方法。方法根据数据特征,制定清洗规则集,使用SQL语言分模块编写程序组,实现数据快速清洗和统计报表自动生成。结果该方法能够快速准确地清洗不同特征的个案数据,根据ICD-10自动计算分病种日就诊量,自动生成统计报表。结论该方法可以灵活、准确、快速地处理大量级数据,适用于医院、急救、死因、健康档案等个案数据,具有较强的实用价值,是开展空气污染健康风险评估的必要手段。
Objective To develop a method for cleaning the outpatient data rapidly and generating the statistical reports automatically. Methods Formulating the cleaning rules according to the data characters,writing programs to clean the individual cases and generate the statistical reports using the SQL language. Results It could clean the different individual cases rapidly,calculate the daily outpatient visits and generate the statistical reports automatically with high accuracy. Conclusion The method can apply to the data processing of the hospital cases,first aid cases,cause-of-death data and health records. It not only can process large amounts of data flexibly,conveniently and quickly,but also has great practical value. So it is the necessary way to the health risk assessment of air pollution.
出处
《卫生研究》
CAS
CSCD
北大核心
2016年第4期624-630,共7页
Journal of Hygiene Research
基金
公益性行业科研专项(No.201402022)
关键词
时间序列分析
数据清洗
疾病分类
统计报表
SQL
analysis of time series
data cleaning
classification of disease
statistical report
SQL