更快的CE：快速，稀疏，透明和强大的反事实解释

论文标题

更快的CE：快速，稀疏，透明和强大的反事实解释

FASTER-CE: Fast, Sparse, Transparent, and Robust Counterfactual Explanations

论文作者

Sharma, Shubham, Gee, Alan H., Henderson, Jette, Ghosh, Joydeep

论文摘要

反事实解释在过去几年中的流行程度大大增加，这是一种有用的以人为中心的理解单个黑盒模型预测的方式。尽管在文献中已经确定了高质量反事实所需的几种特性，但三个关键问题：解释产生的速度，鲁棒性/敏感性和解释的简洁性（稀疏性）相对尚未探索。在本文中，我们提出了更快的CE：一组新型算法，以生成快速，稀疏和强大的反事实解释。关键思想是有效地在通过自动编码器指定的潜在空间中找到反事实的有希望的搜索说明。这些方向是基于在潜在空间中估计的每个原始输入特征以及目标的梯度确定的。快速检查最有前途的梯度方向的组合以及合并其他用户定义的约束的能力，使我们能够生成多种反事实解释，这些解释对于输入操作而言稀疏，现实且强大。通过在各种复杂性的三个数据集上进行的实验，我们表明，更快的CE不仅比其他最先进的方法快得多，可以生成多个解释，而且在考虑一组更大的理想（且通常是冲突的）属性时，它也非常出色。具体而言，我们介绍了多个性能指标的结果：稀疏性，邻近性，有效性，发电速度和解释的稳健性，以突出更快的CE家族的能力。

Counterfactual explanations have substantially increased in popularity in the past few years as a useful human-centric way of understanding individual black-box model predictions. While several properties desired of high-quality counterfactuals have been identified in the literature, three crucial concerns: the speed of explanation generation, robustness/sensitivity and succinctness of explanations (sparsity) have been relatively unexplored. In this paper, we present FASTER-CE: a novel set of algorithms to generate fast, sparse, and robust counterfactual explanations. The key idea is to efficiently find promising search directions for counterfactuals in a latent space that is specified via an autoencoder. These directions are determined based on gradients with respect to each of the original input features as well as of the target, as estimated in the latent space. The ability to quickly examine combinations of the most promising gradient directions as well as to incorporate additional user-defined constraints allows us to generate multiple counterfactual explanations that are sparse, realistic, and robust to input manipulations. Through experiments on three datasets of varied complexities, we show that FASTER-CE is not only much faster than other state of the art methods for generating multiple explanations but also is significantly superior when considering a larger set of desirable (and often conflicting) properties. Specifically we present results across multiple performance metrics: sparsity, proximity, validity, speed of generation, and the robustness of explanations, to highlight the capabilities of the FASTER-CE family.

下载PDF全文

下载文献需遵守相关版权规定

论文标题