人类机器人相互作用的主动不确定性降低：一种隐式双控制方法

论文标题

人类机器人相互作用的主动不确定性降低：一种隐式双控制方法

Active Uncertainty Reduction for Human-Robot Interaction: An Implicit Dual Control Approach

论文作者

Hu, Haimin, Fisac, Jaime F.

论文摘要

准确预测人类行为的能力对于互动环境中机器人自主权的安全性和效率至关重要。不幸的是，机器人通常无法访问这些预测可能取决的关键信息，例如人们的目标，关注和合作意愿。双重控制理论通过将预测模型的未知参数视为随机隐藏状态，并使用系统操作期间收集的信息来推断其值，从而解决了这一挑战。虽然能够最佳，自动自动进行探索和开发，但双重控制在一般的互动运动计划上却是可靠的，这主要是由于机器人轨迹优化和人类意图推断之间的基本耦合。在本文中，我们提出了一种新型的算法方法，以基于隐式双控制范式来实现互动运动计划的主动不确定性。我们的方法依赖于基于抽样的随机动态编程的近似，从而导致模型预测控制问题，该问题可以通过基于实时梯度的优化方法很容易解决。结果策略显示出具有连续和分类不确定性的广泛预测人类模型的双重控制效果。通过模拟驾驶示例证明了我们方法的功效。

The ability to accurately predict human behavior is central to the safety and efficiency of robot autonomy in interactive settings. Unfortunately, robots often lack access to key information on which these predictions may hinge, such as people's goals, attention, and willingness to cooperate. Dual control theory addresses this challenge by treating unknown parameters of a predictive model as stochastic hidden states and inferring their values at runtime using information gathered during system operation. While able to optimally and automatically trade off exploration and exploitation, dual control is computationally intractable for general interactive motion planning, mainly due to the fundamental coupling between robot trajectory optimization and human intent inference. In this paper, we present a novel algorithmic approach to enable active uncertainty reduction for interactive motion planning based on the implicit dual control paradigm. Our approach relies on sampling-based approximation of stochastic dynamic programming, leading to a model predictive control problem that can be readily solved by real-time gradient-based optimization methods. The resulting policy is shown to preserve the dual control effect for a broad class of predictive human models with both continuous and categorical uncertainty. The efficacy of our approach is demonstrated with simulated driving examples.

下载PDF全文

下载文献需遵守相关版权规定

论文标题