论文标题
果冻豆世界:无休止学习的测试床
Jelly Bean World: A Testbed for Never-Ending Learning
论文作者
论文摘要
机器学习近年来已经表现出越来越多的成功。但是,当前的机器学习系统是高度专业化的,对特定问题或域进行了培训,通常是在一个狭窄的数据集上。另一方面,人类的学习是高度通用和适应性的。永无止境的学习是一种机器学习范式,旨在弥合这一差距,目的是鼓励研究人员设计机器学习系统,这些机器学习系统可以学习在更复杂的环境中执行更广泛的相互关联的任务。迄今为止,还没有环境或测试来促进对永无止境的学习系统的开发和评估。为此,我们提出了果冻豆世界测试台。果冻豆世界允许在二维网格世界上进行实验,这些网格世界充满了物品,并且代理可以在其中导航。该测试床提供了足够复杂的环境,并且在其中更一般的智能算法应该比当前的最新强化学习方法更好。它通过产生非平稳环境并使用多任务,多代理,多模式和课程学习设置来促进实验。我们希望这个新的自由开放的软件能够促进对永无止境的学习系统以及更广泛的通用情报系统的开发和评估的新研究和兴趣。
Machine learning has shown growing success in recent years. However, current machine learning systems are highly specialized, trained for particular problems or domains, and typically on a single narrow dataset. Human learning, on the other hand, is highly general and adaptable. Never-ending learning is a machine learning paradigm that aims to bridge this gap, with the goal of encouraging researchers to design machine learning systems that can learn to perform a wider variety of inter-related tasks in more complex environments. To date, there is no environment or testbed to facilitate the development and evaluation of never-ending learning systems. To this end, we propose the Jelly Bean World testbed. The Jelly Bean World allows experimentation over two-dimensional grid worlds which are filled with items and in which agents can navigate. This testbed provides environments that are sufficiently complex and where more generally intelligent algorithms ought to perform better than current state-of-the-art reinforcement learning approaches. It does so by producing non-stationary environments and facilitating experimentation with multi-task, multi-agent, multi-modal, and curriculum learning settings. We hope that this new freely-available software will prompt new research and interest in the development and evaluation of never-ending learning systems and more broadly, general intelligence systems.