论文标题
贝叶斯数据融合与共享先验
Bayesian data fusion with shared priors
论文作者
论文摘要
来自多个来源的数据和知识的集成称为数据融合。当仅以分布式方式或使用不同的传感器来推断一定兴趣的数据时,数据融合就变得至关重要。在贝叶斯环境中,可以使用未知数量的先验信息,并且可能在不同的分布式估计器中存在。当融合本地估计值时,除非融合节点对此进行解释并对其进行纠正,否则用于构造几个本地后代的先验知识可能会被过度使用。在本文中,我们分析了贝叶斯数据融合环境中共享先验的影响。根据不同的共同融合规则,我们的分析有助于理解性能行为,这是协作代理数量的函数以及不同类型的先验的结果。分析是通过使用贝叶斯推论中常见的两个差异进行的,结果的一般性允许分析非常通用的分布。通过各种估计和分类问题,包括线性和非线性模型以及联合学习方案,通过实验来证实这些理论结果。
The integration of data and knowledge from several sources is known as data fusion. When data is only available in a distributed fashion or when different sensors are used to infer a quantity of interest, data fusion becomes essential. In Bayesian settings, a priori information of the unknown quantities is available and, possibly, present among the different distributed estimators. When the local estimates are fused, the prior knowledge used to construct several local posteriors might be overused unless the fusion node accounts for this and corrects it. In this paper, we analyze the effects of shared priors in Bayesian data fusion contexts. Depending on different common fusion rules, our analysis helps to understand the performance behavior as a function of the number of collaborative agents and as a consequence of different types of priors. The analysis is performed by using two divergences which are common in Bayesian inference, and the generality of the results allows to analyze very generic distributions. These theoretical results are corroborated through experiments in a variety of estimation and classification problems, including linear and nonlinear models, and federated learning schemes.