论文标题
迈向海马的生物学模型作为后继代理
Toward the biological model of the hippocampus as the successor representation agent
论文作者
论文摘要
海马是空间记忆和学习的重要大脑区域。最近,已经发布了基于时间差异(TD)学习的海马理论模型。受到连续代表(SR)学习算法的启发,将TD学习的价值函数分解为奖励和状态过渡,他们认为,海马中CA1放置细胞的触发速率代表了状态过渡的可能性。该理论称为预测图理论,声称代表空间的海马学会了从当前状态到未来状态的过渡的可能性。预期未来状态的神经相关性是CA1位置细胞的发射速率。这种解释对于行为实验中记录的结果是合理的,但缺乏神经生物学的含义。 修改SR学习算法将生物学含义添加到预测图理论中。与SR学习算法中当前和未来状态的信息的同时信息相似,CA1位置单元将获得CA3和Intorhinal Cortex的两个输入。数学转化表明,SR学习算法等效于异突触可塑性规则。讨论了CA1中的异突触可塑性现象,并将其与修改后的SR更新规则进行了比较。这项研究试图将TD算法解释为出现的神经生物学机制,并将神经科学和人工智能方法整合到该领域。
The hippocampus is an essential brain region for spatial memory and learning. Recently, a theoretical model of the hippocampus based on temporal difference (TD) learning has been published. Inspired by the successor representation (SR) learning algorithms, which decompose value function of TD learning into reward and state transition, they argued that the rate of firing of CA1 place cells in the hippocampus represents the probability of state transition. This theory, called predictive map theory, claims that the hippocampus representing space learns the probability of transition from the current state to the future state. The neural correlates of expecting the future state are the firing rates of the CA1 place cells. This explanation is plausible for the results recorded in behavioral experiments, but it is lacking the neurobiological implications. Modifying the SR learning algorithm added biological implications to the predictive map theory. Similar with the simultaneous needs of information of the current and future state in the SR learning algorithm, the CA1 place cells receive two inputs from CA3 and entorhinal cortex. Mathematical transformation showed that the SR learning algorithm is equivalent to the heterosynaptic plasticity rule. The heterosynaptic plasticity phenomena in CA1 were discussed and compared with the modified SR update rule. This study attempted to interpret the TD algorithm as the neurobiological mechanism occurring in place learning, and to integrate the neuroscience and artificial intelligence approaches in the field.