The topic of this article is one-sided hypothesis testing for disparity, i.e., the mean of one group is larger than that of another when there is uncertainty as to which group a datum is drawn. For each datum, the unc...The topic of this article is one-sided hypothesis testing for disparity, i.e., the mean of one group is larger than that of another when there is uncertainty as to which group a datum is drawn. For each datum, the uncertainty is captured with a given discrete probability distribution over the groups. Such situations arise, for example, in the use of Bayesian imputation methods to assess race and ethnicity disparities with certain insurance, health, and financial data. A widely used method to implement this assessment is the Bayesian Improved Surname Geocoding (BISG) method which assigns a discrete probability over six race/ethnicity groups to an individual given the individual’s surname and address location. Using a Bayesian framework and Markov Chain Monte Carlo sampling from the joint posterior distribution of the group means, the probability of a disparity hypothesis is estimated. Four methods are developed and compared with an illustrative data set. Three of these methods are implemented in an R-code and one method in WinBUGS. These methods are programed for any number of groups between two and six inclusive. All the codes are provided in the appendices.展开更多
The population migration in Hubei Province was frequent in history,accompanied by the migration of surnames,so it is important to study their population surnames.We take Xiantao City as an example to explore the isony...The population migration in Hubei Province was frequent in history,accompanied by the migration of surnames,so it is important to study their population surnames.We take Xiantao City as an example to explore the isonymic structure of small and medium-sized cities in Hubei.The surname distributions of 223327 residents registered in 2013 were analyzed in 5 towns and 105 villages of Xiantao.The number of different surnames found was 422.As for surnames,theα-value reflects the influence of ethnic composition on the abundance of surnames.The correlation between the isonymic distance and the geographic distance between villages was calculated and indicated that Euclidean distance was weakly correlated with the geographic distance(r=0.177±0.012),and the isonymic distance increased with the geographical distance.Furthermore,the dendrogram and PCA built from the matrix of Euclidean distances between villages identified a main surname differentiation between the urban and rural areas.展开更多
文摘The topic of this article is one-sided hypothesis testing for disparity, i.e., the mean of one group is larger than that of another when there is uncertainty as to which group a datum is drawn. For each datum, the uncertainty is captured with a given discrete probability distribution over the groups. Such situations arise, for example, in the use of Bayesian imputation methods to assess race and ethnicity disparities with certain insurance, health, and financial data. A widely used method to implement this assessment is the Bayesian Improved Surname Geocoding (BISG) method which assigns a discrete probability over six race/ethnicity groups to an individual given the individual’s surname and address location. Using a Bayesian framework and Markov Chain Monte Carlo sampling from the joint posterior distribution of the group means, the probability of a disparity hypothesis is estimated. Four methods are developed and compared with an illustrative data set. Three of these methods are implemented in an R-code and one method in WinBUGS. These methods are programed for any number of groups between two and six inclusive. All the codes are provided in the appendices.
基金National Social Science Foundation of China(No.20BYY045).
文摘The population migration in Hubei Province was frequent in history,accompanied by the migration of surnames,so it is important to study their population surnames.We take Xiantao City as an example to explore the isonymic structure of small and medium-sized cities in Hubei.The surname distributions of 223327 residents registered in 2013 were analyzed in 5 towns and 105 villages of Xiantao.The number of different surnames found was 422.As for surnames,theα-value reflects the influence of ethnic composition on the abundance of surnames.The correlation between the isonymic distance and the geographic distance between villages was calculated and indicated that Euclidean distance was weakly correlated with the geographic distance(r=0.177±0.012),and the isonymic distance increased with the geographical distance.Furthermore,the dendrogram and PCA built from the matrix of Euclidean distances between villages identified a main surname differentiation between the urban and rural areas.