小域估计(Small Area Estimation)是抽样调查领域里一个重要的研究方向,国计民生中的许多重要问题,如关于失业率、残疾率、传染病的发病率和民意测验的抽样调查都需要采用不同的小域估计方法。本文针对小域估计问题,以估计方法发展脉络...小域估计(Small Area Estimation)是抽样调查领域里一个重要的研究方向,国计民生中的许多重要问题,如关于失业率、残疾率、传染病的发病率和民意测验的抽样调查都需要采用不同的小域估计方法。本文针对小域估计问题,以估计方法发展脉络为主线,以基于分层贝叶斯分析的小域估计为重点,对小域估计问题的理论、方法和最新进展进行简述,并利用澳大利亚残疾、老龄化和护理人员(SDAC2003)抽样调查实际数据,从分层贝叶斯分析角度对西澳大利亚残疾率进行估计,最后对估计结果进行比较和讨论。展开更多
Generalized Linear Mixed Model (GLMM) has been widely used in small area estimation for health indicators. Bayesian estimation is usually used to construct statistical intervals, however, its computational intensity i...Generalized Linear Mixed Model (GLMM) has been widely used in small area estimation for health indicators. Bayesian estimation is usually used to construct statistical intervals, however, its computational intensity is a big challenge for large complex surveys. Frequentist approaches, such as bootstrapping, and Monte Carlo (MC) simulation, are also applied but not evaluated in terms of the interval magnitude, width, and the computational time consumed. The 2013 Florida Behavioral Risk Factor Surveillance System data was used as a case study. County-level estimated prevalence of three health-related outcomes was obtained through a GLMM;and their 95% confidence intervals (CIs) were generated from bootstrapping and MC simulation. The intervals were compared to 95% credential intervals through a hierarchial Bayesian model. The results showed that 95% CIs for county-level estimates of each outcome by using MC simulation were similar to the 95% credible intervals generated by Bayesian estimation and were the most computationally efficient. It could be a viable option for constructing statistical intervals for small area estimation in public health practice.展开更多
文摘小域估计(Small Area Estimation)是抽样调查领域里一个重要的研究方向,国计民生中的许多重要问题,如关于失业率、残疾率、传染病的发病率和民意测验的抽样调查都需要采用不同的小域估计方法。本文针对小域估计问题,以估计方法发展脉络为主线,以基于分层贝叶斯分析的小域估计为重点,对小域估计问题的理论、方法和最新进展进行简述,并利用澳大利亚残疾、老龄化和护理人员(SDAC2003)抽样调查实际数据,从分层贝叶斯分析角度对西澳大利亚残疾率进行估计,最后对估计结果进行比较和讨论。
文摘Generalized Linear Mixed Model (GLMM) has been widely used in small area estimation for health indicators. Bayesian estimation is usually used to construct statistical intervals, however, its computational intensity is a big challenge for large complex surveys. Frequentist approaches, such as bootstrapping, and Monte Carlo (MC) simulation, are also applied but not evaluated in terms of the interval magnitude, width, and the computational time consumed. The 2013 Florida Behavioral Risk Factor Surveillance System data was used as a case study. County-level estimated prevalence of three health-related outcomes was obtained through a GLMM;and their 95% confidence intervals (CIs) were generated from bootstrapping and MC simulation. The intervals were compared to 95% credential intervals through a hierarchial Bayesian model. The results showed that 95% CIs for county-level estimates of each outcome by using MC simulation were similar to the 95% credible intervals generated by Bayesian estimation and were the most computationally efficient. It could be a viable option for constructing statistical intervals for small area estimation in public health practice.