分布式深入的强化学习以进行模拟控制

论文标题

分布式深入的强化学习以进行模拟控制

Distributed deep reinforcement learning for simulation control

论文作者

Pawar, Suraj, Maulik, Romit

论文摘要

物理系统的科学模拟中的一些应用可以作为控制/优化问题提出。此类系统的计算模型通常包含超参数，该模型控制解决方案保真度和计算费用。这些参数的调整是非平凡的，一般的方法是手动“检查点检查”以进行良好的组合。这是因为当参数空间较大并且何时可能动态变化时，最佳的超参数配置搜索变得不切实际。为了解决这个问题，我们提出了一个基于深钢筋学习（RL）的框架，以训练深层神经网络代理，该框架通过动态改变参数来控制模型求解。首先，我们通过动态更改系统参数来验证RL框架在混乱系统中控制混乱的问题。随后，我们通过自动调整运行时内离散的Navier-Stokes方程的松弛因子来说明框架加速稳态CFD求解器收敛的功能。结果表明，与弛豫因子的随机选择相比，学到的策略对放松因子的运行时间控制导致收敛次数大大减少。我们的结果表明，在不同几何和边界条件上学习自适应超参数学习策略的潜在好处，对减少计算活动费用的影响。 \ footNote {数据和代码可在\ url {https://github.com/romit-maulik/par-rl}}}

Several applications in the scientific simulation of physical systems can be formulated as control/optimization problems. The computational models for such systems generally contain hyperparameters, which control solution fidelity and computational expense. The tuning of these parameters is non-trivial and the general approach is to manually `spot-check' for good combinations. This is because optimal hyperparameter configuration search becomes impractical when the parameter space is large and when they may vary dynamically. To address this issue, we present a framework based on deep reinforcement learning (RL) to train a deep neural network agent that controls a model solve by varying parameters dynamically. First, we validate our RL framework for the problem of controlling chaos in chaotic systems by dynamically changing the parameters of the system. Subsequently, we illustrate the capabilities of our framework for accelerating the convergence of a steady-state CFD solver by automatically adjusting the relaxation factors of discretized Navier-Stokes equations during run-time. The results indicate that the run-time control of the relaxation factors by the learned policy leads to a significant reduction in the number of iterations for convergence compared to the random selection of the relaxation factors. Our results point to potential benefits for learning adaptive hyperparameter learning strategies across different geometries and boundary conditions with implications for reduced computational campaign expenses. \footnote{Data and codes available at \url{https://github.com/Romit-Maulik/PAR-RL}}

下载PDF全文

下载文献需遵守相关版权规定

论文标题