论文标题

嵌入式多级回归和延伸后:基于模型的推断,具有不完整的辅助信息

Embedded Multilevel Regression and Poststratification: Model-based Inference with Incomplete Auxiliary Information

论文作者

Li, Katherine, Si, Yajuan

论文摘要

健康差异研究经常评估人口亚组之间的健康结果。多级回归和延伸后(MRP)是小型亚组估计的一种流行方法,因为它通过拟合多级模型稳定估计的能力并通过对辅助变量进行后的流传化来调整选择偏差,这是对分析结果的人群特征。但是,MRP产生的估计值的粒度和质量受辅助变量的关节分布的可用性限制。数据分析师通常只能访问边际分布。为了克服这一局限性,我们将估计在MRP工作流程中嵌入了所需的种群细胞计数:嵌入MRP(EMRP)。在EMRP下,我们在实施MRP之前生成辅助变量的合成群体。所有估计不确定性的来源都通过完全贝叶斯框架传播。通过仿真研究,我们比较了不同的方法,并证明了EMRP对偏见差异方面的替代方案的改进,以产生有效的利息亚群来推断。作为例证,我们将EMRP应用于对福祉的纵向调查,并估计纽约市脆弱群体之间的粮食不安全率。我们发现,所有EMRP估计器都可以纠正经典MRP的偏差,同时保持较低的标准错误和较窄的置信区间,而不是直接使用WFPBB和基于设计的估计值。 EMRP估计器的性能彼此之间没有很大差异,尽管我们通常会建议WFPBB-MRP持续高的覆盖率。

Health disparity research often evaluates health outcomes across demographic subgroups. Multilevel regression and poststratification (MRP) is a popular approach for small subgroup estimation due to its ability to stabilize estimates by fitting multilevel models and to adjust for selection bias by poststratifying on auxiliary variables, which are population characteristics predictive of the analytic outcome. However, the granularity and quality of the estimates produced by MRP are limited by the availability of the auxiliary variables' joint distribution; data analysts often only have access to the marginal distributions. To overcome this limitation, we embed the estimation of population cell counts needed for poststratification into the MRP workflow: embedded MRP (EMRP). Under EMRP, we generate synthetic populations of the auxiliary variables before implementing MRP. All sources of estimation uncertainty are propagated with a fully Bayesian framework. Through simulation studies, we compare different methods and demonstrate EMRP's improvements over alternatives on the bias-variance tradeoff to yield valid subpopulation inferences of interest. As an illustration, we apply EMRP to the Longitudinal Survey of Wellbeing and estimate food insecurity prevalence among vulnerable groups in New York City. We find that all EMRP estimators can correct for the bias in classical MRP while maintaining lower standard errors and narrower confidence intervals than directly imputing with the WFPBB and design-based estimates. Performances from the EMRP estimators do not differ substantially from each other, though we would generally recommend the WFPBB-MRP for its consistently high coverage rates.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源