论文标题
绑定和视角将生成神经网络模型的推断作为推断
Binding and Perspective Taking as Inference in a Generative Neural Network Model
论文作者
论文摘要
从不同的角度,可以灵活地将特征绑定到相干大规模的能力是认知和智力的标志。重要的是,绑定问题不仅与视觉有关,还与一般智力,感觉运动集成,事件处理和语言有关。各种人工神经网络模型已通过动态神经领域和相关方法解决了这个问题。在这里,我们专注于生成的编码器架构,该体系结构适应其视角并通过回顾性推断绑定特征。我们首先训练模型,以学习动态生物运动或其他谐波运动模式(例如钟摆)的足够准确的生成模型。然后,我们在一定程度上争夺输入,可能会改变视角上的视角,然后将预测误差传播回结合矩阵,即确定特征结合的隐藏神经状态。此外,我们将误差进一步传播到采集神经元的透视图,后者将输入特征旋转并转化为已知的参考框架。评估表明,所得的基于梯度的推理过程解决了已知生物运动模式的观点采取和结合问题,从根本上产生了格式塔感知机制。此外,冗余特征性能和总体编码非常有用。当我们评估生物运动模式的算法时,原则性方法应适用于其他领域中的结合和格式塔感知问题。
The ability to flexibly bind features into coherent wholes from different perspectives is a hallmark of cognition and intelligence. Importantly, the binding problem is not only relevant for vision but also for general intelligence, sensorimotor integration, event processing, and language. Various artificial neural network models have tackled this problem with dynamic neural fields and related approaches. Here we focus on a generative encoder-decoder architecture that adapts its perspective and binds features by means of retrospective inference. We first train a model to learn sufficiently accurate generative models of dynamic biological motion or other harmonic motion patterns, such as a pendulum. We then scramble the input to a certain extent, possibly vary the perspective onto it, and propagate the prediction error back onto a binding matrix, that is, hidden neural states that determine feature binding. Moreover, we propagate the error further back onto perspective taking neurons, which rotate and translate the input features onto a known frame of reference. Evaluations show that the resulting gradient-based inference process solves the perspective taking and binding problem for known biological motion patterns, essentially yielding a Gestalt perception mechanism. In addition, redundant feature properties and population encodings are shown to be highly useful. While we evaluate the algorithm on biological motion patterns, the principled approach should be applicable to binding and Gestalt perception problems in other domains.