通过人类示范加强学习的自动驾驶汽车的车道变化的安全决策

论文标题

通过人类示范加强学习的自动驾驶汽车的车道变化的安全决策

Safe Decision-making for Lane-change of Autonomous Vehicles via Human Demonstration-aided Reinforcement Learning

论文作者

Wu, Jingda, Huang, Wenhui, de Boer, Niels, Mo, Yanghui, He, Xiangkun, Lv, Chen

论文摘要

决策对于自动驾驶的车道变化至关重要。强化学习（RL）算法旨在确定各种情况下的行为价值，因此它们成为解决决策问题的有前途的途径。但是，运行时安全性较差，阻碍了基于RL的决策策略，从实践中进行了复杂的驾驶任务。为了解决这个问题，本文将人类的示威纳入了基于RL的决策策略中。人类受试者在驾驶模拟器中做出的决定被视为安全的示范，将其存储到重播缓冲液中，然后用来增强RL的训练过程。建立了一个复杂的车道变更任务，以检查开发策略的性能。模拟结果表明，人类的演示可以有效地提高RL决策的安全性。而拟议的策略超过了其他基于学习的决策策略。

Decision-making is critical for lane change in autonomous driving. Reinforcement learning (RL) algorithms aim to identify the values of behaviors in various situations and thus they become a promising pathway to address the decision-making problem. However, poor runtime safety hinders RL-based decision-making strategies from complex driving tasks in practice. To address this problem, human demonstrations are incorporated into the RL-based decision-making strategy in this paper. Decisions made by human subjects in a driving simulator are treated as safe demonstrations, which are stored into the replay buffer and then utilized to enhance the training process of RL. A complex lane change task in an off-ramp scenario is established to examine the performance of the developed strategy. Simulation results suggest that human demonstrations can effectively improve the safety of decisions of RL. And the proposed strategy surpasses other existing learning-based decision-making strategies with respect to multiple driving performances.

下载PDF全文

下载文献需遵守相关版权规定

论文标题