贝叶斯模型概率何时过度自信？

论文标题

贝叶斯模型概率何时过度自信？

When are Bayesian model probabilities overconfident?

论文作者

Oelrich, Oscar, Ding, Shutong, Magnusson, Måns, Vehtari, Aki, Villani, Mattias

论文摘要

贝叶斯模型比较通常基于比较模型集的后验分布。即使其他模型拟合或预测能力的其他度量也没有强烈的偏好，通常也会观察到这种分布集中在单个模型上。此外，数据样本的适度变化可以轻松地转移后验模型概率以专注于另一个模型。我们记录了在经济学和神经科学方面的两个备受瞩目的应用中的过度自信。为了更多地了解过度自信的来源，我们在单变量和多元线性回归中得出了贝叶斯因子的采样方差。结果表明，当i）比较模型给出数据生成过程的近似值时可能会发生过度自信，ii）模型在模型之间没有共享的较大自由度非常灵活，并且iii iii）模型低估了数据的真实变异性。

Bayesian model comparison is often based on the posterior distribution over the set of compared models. This distribution is often observed to concentrate on a single model even when other measures of model fit or forecasting ability indicate no strong preference. Furthermore, a moderate change in the data sample can easily shift the posterior model probabilities to concentrate on another model. We document overconfidence in two high-profile applications in economics and neuroscience. To shed more light on the sources of overconfidence we derive the sampling variance of the Bayes factor in univariate and multivariate linear regression. The results show that overconfidence is likely to happen when i) the compared models give very different approximations of the data-generating process, ii) the models are very flexible with large degrees of freedom that are not shared between the models, and iii) the models underestimate the true variability in the data.

下载PDF全文

下载文献需遵守相关版权规定

论文标题