Apprai：黑盒监督机器学习模型的多因素评估统一框架的理论

论文标题

Apprai：黑盒监督机器学习模型的多因素评估统一框架的理论

ComplAI: Theory of A Unified Framework for Multi-factor Assessment of Black-Box Supervised Machine Learning Models

论文作者

De, Arkadipta, Gudipudi, Satya Swaroop, Panchanan, Sourab, Desarkar, Maunendra Sankar

论文摘要

人工智能的进步正在创造新的机会，以改善从业务到医疗保健，从生活方式到教育的全球人的生活。例如，某些系统使用其人口统计学和行为特征来介绍用户，以进行某些特定领域的预测。通常，这种预测会直接或间接影响用户的寿命（例如，贷款支出，确定保险范围，入围申请等）。结果，对这种支持AI-Systems的担忧也在增加。为了解决这些问题，此类系统被要求负责，即对开发人员和最终用户的透明，公平和可解释。在本文中，我们介绍了一个独特的框架，该框架可以启用，观察，分析和量化在漂移方案中的解释性，鲁棒性，绩效，公平性和模型行为，并提供一个单一的信任因素，以评估不同监督的机器学习模型，而不仅仅是从他们做出正确的预测能力，而是从整体责任的角度来看。该框架可帮助用户（a）连接其模型并启用解释，（b）评估和可视化模型的不同方面，例如鲁棒性，漂移敏感性和公平性，以及（c）比较不同的模型（来自不同模型家族或通过不同的超级参数获得的模型）从整体的角度来看，从而促进该模型改进该模型的依据。它是模型不可知论，可与不同监督的机器学习方案（即二进制分类，多类分类和回归）和框架一起使用。它可以与任何ML生命周期框架无缝集成。因此，这个已经部署的框架旨在统一负责人AI系统的关键方面，以调节这种真实系统的开发过程。

The advances in Artificial Intelligence are creating new opportunities to improve lives of people around the world, from business to healthcare, from lifestyle to education. For example, some systems profile the users using their demographic and behavioral characteristics to make certain domain-specific predictions. Often, such predictions impact the life of the user directly or indirectly (e.g., loan disbursement, determining insurance coverage, shortlisting applications, etc.). As a result, the concerns over such AI-enabled systems are also increasing. To address these concerns, such systems are mandated to be responsible i.e., transparent, fair, and explainable to developers and end-users. In this paper, we present ComplAI, a unique framework to enable, observe, analyze and quantify explainability, robustness, performance, fairness, and model behavior in drift scenarios, and to provide a single Trust Factor that evaluates different supervised Machine Learning models not just from their ability to make correct predictions but from overall responsibility perspective. The framework helps users to (a) connect their models and enable explanations, (b) assess and visualize different aspects of the model, such as robustness, drift susceptibility, and fairness, and (c) compare different models (from different model families or obtained through different hyperparameter settings) from an overall perspective thereby facilitating actionable recourse for improvement of the models. It is model agnostic and works with different supervised machine learning scenarios (i.e., Binary Classification, Multi-class Classification, and Regression) and frameworks. It can be seamlessly integrated with any ML life-cycle framework. Thus, this already deployed framework aims to unify critical aspects of Responsible AI systems for regulating the development process of such real systems.

下载PDF全文

下载文献需遵守相关版权规定

论文标题