论文标题
VFD:贝叶斯神经网络中的变异远见动态选择,以有效的人类活动识别
VFDS: Variational Foresight Dynamic Selection in Bayesian Neural Networks for Efficient Human Activity Recognition
论文作者
论文摘要
在许多机器学习任务中,以不同的成本获得具有不同程度预测能力的输入功能。为了优化性能成本折衷,将选择特征以观察先验的功能。但是,鉴于以前的观察结果变化的上下文,要选择的预测特征的子集可能会动态变化。因此,我们面临着挑战性的远见动态选择(FDS)的挑战:寻找动态和轻巧的政策,以决定接下来要观察哪些功能,然后实际观察它们,以实现整体性能成本的权衡。为了解决FDS,本文提出了一个贝叶斯学习框架,该框架是变异的预见动态选择(VFD)。 VFD通过优化一个特征模型性能和功能成本之间的权衡取舍的差异贝叶斯目标来选择一个要观察下一个特征子集的策略。其核心是二进制门上的隐式变分分布,取决于先前的观察结果,该观察结果将选择要观察的下一个特征子集。我们将VFD应用于人类活动识别(HAR)任务,在该任务中,绩效成本权衡在其实践中至关重要。广泛的结果表明,VFD在不断变化的环境下选择不同的功能,特别是在保持或提高HAR准确性的同时节省了感官成本。此外,VFD动态选择的功能被证明是可解释的,并且与不同的活动类型相关联。我们将发布代码。
In many machine learning tasks, input features with varying degrees of predictive capability are acquired at varying costs. In order to optimize the performance-cost trade-off, one would select features to observe a priori. However, given the changing context with previous observations, the subset of predictive features to select may change dynamically. Therefore, we face the challenging new problem of foresight dynamic selection (FDS): finding a dynamic and light-weight policy to decide which features to observe next, before actually observing them, for overall performance-cost trade-offs. To tackle FDS, this paper proposes a Bayesian learning framework of Variational Foresight Dynamic Selection (VFDS). VFDS learns a policy that selects the next feature subset to observe, by optimizing a variational Bayesian objective that characterizes the trade-off between model performance and feature cost. At its core is an implicit variational distribution on binary gates that are dependent on previous observations, which will select the next subset of features to observe. We apply VFDS on the Human Activity Recognition (HAR) task where the performance-cost trade-off is critical in its practice. Extensive results demonstrate that VFDS selects different features under changing contexts, notably saving sensory costs while maintaining or improving the HAR accuracy. Moreover, the features that VFDS dynamically select are shown to be interpretable and associated with the different activity types. We will release the code.