论文标题

边际贡献的特征很重要 - 自然案例的公理方法

Marginal Contribution Feature Importance -- an Axiomatic Approach for The Natural Case

论文作者

Catav, Amnon, Fu, Boyang, Ernst, Jason, Sankararaman, Sriram, Gilad-Bachrach, Ran

论文摘要

当训练预测模型对医学数据训练时,目标有时是为了获得有关某种疾病的见解。在这种情况下,通常将特征重要性用作突出导致该疾病的重要因素的工具。由于现有的计算特征重要性得分的方法,因此了解其相对优点并不是微不足道的。此外,使用它们的情景多样性导致与特征重要性得分不同的期望。尽管通常将重点放在单个预测上的本地分数和研究特征对模型的贡献的全球分数之间的区别是很常见的,但另一个重要的部门区分了模型方案,在该方案中,目标是了解给定模型的预测与自然情景的预测,在该场景中,该目标是理解诸如疾病等现象的靶标。我们开发一组公理,代表自然场景中特征重要性函数所期望的属性,并证明只有一个满足所有这些功能的函数,边际贡献特征的重要性(MCI)。我们分析了此功能的理论和经验特性,并将其与其他特征重要性得分进行了比较。虽然我们的重点是自然的情况,但我们建议在其他情况下也可以进行公理方法。

When training a predictive model over medical data, the goal is sometimes to gain insights about a certain disease. In such cases, it is common to use feature importance as a tool to highlight significant factors contributing to that disease. As there are many existing methods for computing feature importance scores, understanding their relative merits is not trivial. Further, the diversity of scenarios in which they are used lead to different expectations from the feature importance scores. While it is common to make the distinction between local scores that focus on individual predictions and global scores that look at the contribution of a feature to the model, another important division distinguishes model scenarios, in which the goal is to understand predictions of a given model from natural scenarios, in which the goal is to understand a phenomenon such as a disease. We develop a set of axioms that represent the properties expected from a feature importance function in the natural scenario and prove that there exists only one function that satisfies all of them, the Marginal Contribution Feature Importance (MCI). We analyze this function for its theoretical and empirical properties and compare it to other feature importance scores. While our focus is the natural scenario, we suggest that our axiomatic approach could be carried out in other scenarios too.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源