论文标题
蒙特卡洛的方法来缩小现实差距
A Monte Carlo Approach to Closing the Reality Gap
论文作者
论文摘要
我们提出了一种新颖的方法来解决“现实差距”问题,即修改机器人模拟,以使其性能变得更加类似于现实世界现象。是否正在使用人类设计师或自动化政策制定机制中使用模拟。我们希望使用模拟制定程序/策略,然后将其部署在真实系统上。我们进一步假设该程序包括一个带有标量输出的监视程序,以确定其何时实现其绩效目标。所提出的方法收集模拟和现实世界的观察,并构建有条件的概率函数。这些用于生成配对的推出以识别行为差异点。这些用于生成{\ it状态空间内核},该{\ IT状态空间内核将模拟迫使表现更像观察到的现实。 使用ROS/Gazebo评估该方法进行仿真,并在室外部署中进行了重大修改的TRAAXAS平台。结果不仅支持内核方法可以迫使模拟更像现实,而且修改使得改进的控制策略在修改后的模拟中测试的改进的控制策略在现实世界中的性能也更好。
We propose a novel approach to the 'reality gap' problem, i.e., modifying a robot simulation so that its performance becomes more similar to observed real world phenomena. This problem arises whether the simulation is being used by human designers or in an automated policy development mechanism. We expect that the program/policy is developed using simulation, and subsequently deployed on a real system. We further assume that the program includes a monitor procedure with scalar output to determine when it is achieving its performance objectives. The proposed approach collects simulation and real world observations and builds conditional probability functions. These are used to generate paired roll-outs to identify points of divergence in behavior. These are used to generate {\it state-space kernels} that coerce the simulation into behaving more like observed reality. The method was evaluated using ROS/Gazebo for simulation and a heavily modified Traaxas platform in outdoor deployment. The results support not just that the kernel approach can force the simulation to behave more like reality, but that the modification is such that an improved control policy tested in the modified simulation also performs better in the real world.