强化学习的体验解释

论文标题

强化学习的体验解释

Experiential Explanations for Reinforcement Learning

论文作者

Alabdulkarim, Amal, Singh, Madhuri, Mansi, Gennie, Hall, Kaely, Ehsan, Upol, Riedl, Mark O.

论文摘要

强化学习（RL）系统可能是复杂且不可解剖的，这使得非AI专家了解或干预决策的挑战。这部分归因于RL的顺序性质，在该性质中，由于获得未来奖励的可能性，因此选择了动作。但是，RL代理丢弃了培训的定性特征，因此很难恢复选择“为什么”操作的用户可理解信息。我们提出了一种经验解释技术，以通过培训影响预测因素以及RL政策来产生反事实解释。影响预测因素是学习不同国家中不同奖励来源如何影响代理的模型，从而恢复有关政策如何反映环境的信息。两项人力评估研究表明，与其他标准类型的解释相比，与其他标准类型的解释相比，具有体验式解释的参与者能够正确猜测代理会做什么。参与者还发现，体验式解释更易于理解，令人满意，完整，有用和准确。定性分析提供了有关最有用的体验解释因素以及参与者从解释中寻求的所需特征的信息。

Reinforcement learning (RL) systems can be complex and non-interpretable, making it challenging for non-AI experts to understand or intervene in their decisions. This is due in part to the sequential nature of RL in which actions are chosen because of their likelihood of obtaining future rewards. However, RL agents discard the qualitative features of their training, making it difficult to recover user-understandable information for "why" an action is chosen. We propose a technique Experiential Explanations to generate counterfactual explanations by training influence predictors along with the RL policy. Influence predictors are models that learn how different sources of reward affect the agent in different states, thus restoring information about how the policy reflects the environment. Two human evaluation studies revealed that participants presented with Experiential Explanations were better able to correctly guess what an agent would do than those presented with other standard types of explanation. Participants also found that Experiential Explanations are more understandable, satisfying, complete, useful, and accurate. Qualitative analysis provides information on the factors of Experiential Explanations that are most useful and the desired characteristics that participants seek from the explanations.

下载PDF全文

下载文献需遵守相关版权规定

论文标题