通过多保真贝叶斯优化伪造基于学习的控制器

论文标题

通过多保真贝叶斯优化伪造基于学习的控制器

Falsification of Learning-Based Controllers through Multi-Fidelity Bayesian Optimization

论文作者

Shahrooei, Zahra, Kochenderfer, Mykel J., Baheri, Ali

论文摘要

基于仿真的伪造是一种实用的测试方法，可以提高对系统将满足安全要求的信心。由于全因此模拟可以在计算上是要求的，因此我们调查了具有不同水平的忠诚度的模拟器的使用。作为第一步，我们就环境参数表达了总体安全规范，并将此安全规范构建为优化问题。我们提出了使用贝叶斯优化的多保真伪造框架，该框架能够确定除了从环境中找到导致系统失败的环境的可能实例之外，我们还应在哪个水平的忠诚度上进行安全评估。这种方法使我们能够以低保真模拟器的价格自动切换，并以具有成本效益的方式从高保真模拟器中从高保真模拟器中切换出来。我们对模拟各种环境的实验表明，多保真贝叶斯优化的伪造性能可与单财标贝叶斯优化相当，但成本较低。

Simulation-based falsification is a practical testing method to increase confidence that the system will meet safety requirements. Because full-fidelity simulations can be computationally demanding, we investigate the use of simulators with different levels of fidelity. As a first step, we express the overall safety specification in terms of environmental parameters and structure this safety specification as an optimization problem. We propose a multi-fidelity falsification framework using Bayesian optimization, which is able to determine at which level of fidelity we should conduct a safety evaluation in addition to finding possible instances from the environment that cause the system to fail. This method allows us to automatically switch between inexpensive, inaccurate information from a low-fidelity simulator and expensive, accurate information from a high-fidelity simulator in a cost-effective way. Our experiments on various environments in simulation demonstrate that multi-fidelity Bayesian optimization has falsification performance comparable to single-fidelity Bayesian optimization but with much lower cost.

下载PDF全文

下载文献需遵守相关版权规定

论文标题