论文标题
发现差异:在不断变化的环境中体现代理的新任务
Spot the Difference: A Novel Task for Embodied Agents in Changing Environments
论文作者
论文摘要
体现AI是一个最近的研究领域,旨在创建可以在环境中移动和运行的智能代理。该领域的现有方法要求代理在全新且未开发的场景中行动。但是,此设置远非现实的用例,而需要在同一环境中执行多个任务。即使环境随时间变化,代理商仍然可以依靠其对场景的全球知识,同时试图使其内部表示形式适应当前环境状态。为了迈出此设置,我们提出了一个区别:体现AI的新任务,代理可以访问过时的环境地图,并且需要在固定时间预算中恢复正确的布局。为此,我们从3D空间的现有数据集开始收集一个新的占用图数据集,并为单个环境生成许多可能的布局。该数据集可以在流行的栖息地模拟器中使用,并且完全符合在导航过程中采用重建占用图的现有方法。此外,我们提出了一项勘探政策,该政策可以利用以前的环境知识,并比现有代理更快,更有效地确定场景的变化。实验结果表明,所提出的体系结构的表现优于现有的最新模型,用于探索这种新环境。
Embodied AI is a recent research area that aims at creating intelligent agents that can move and operate inside an environment. Existing approaches in this field demand the agents to act in completely new and unexplored scenes. However, this setting is far from realistic use cases that instead require executing multiple tasks in the same environment. Even if the environment changes over time, the agent could still count on its global knowledge about the scene while trying to adapt its internal representation to the current state of the environment. To make a step towards this setting, we propose Spot the Difference: a novel task for Embodied AI where the agent has access to an outdated map of the environment and needs to recover the correct layout in a fixed time budget. To this end, we collect a new dataset of occupancy maps starting from existing datasets of 3D spaces and generating a number of possible layouts for a single environment. This dataset can be employed in the popular Habitat simulator and is fully compliant with existing methods that employ reconstructed occupancy maps during navigation. Furthermore, we propose an exploration policy that can take advantage of previous knowledge of the environment and identify changes in the scene faster and more effectively than existing agents. Experimental results show that the proposed architecture outperforms existing state-of-the-art models for exploration on this new setting.