论文标题
学习公平分类器的基因编程方法
Genetic programming approaches to learning fair classifiers
论文作者
论文摘要
社会已经依靠诸如分类器之类的算法来进行重要的决策,从而导致需要诸如公平之类的道德保证。公平性通常是通过要求分类器的某些统计量与人口中受保护的群体大致相等的统计数据。在本文中,讨论并使用当前的公平方法来激励算法提案,这些建议将公平性纳入分类的基因编程中。我们提出了两个想法。首先是将公平目标纳入多目标优化。第二个是适应词汇酶选择,以动态定义受保护基团的相交的情况。我们描述了为什么词汇酶选择非常适合压力模型,以在可能无限的许多亚组中表现良好。我们使用最近的遗传编程方法在四个数据集上构建模型,这是必要公平限制的,并从经验上比较了使用游戏理论解决方案的先前方法。方法是根据其产生帕累托最佳准确性和准确性权衡取舍的能力来评估方法的。结果表明,遗传编程方法一般,尤其是随机搜索非常适合此任务。
Society has come to rely on algorithms like classifiers for important decision making, giving rise to the need for ethical guarantees such as fairness. Fairness is typically defined by asking that some statistic of a classifier be approximately equal over protected groups within a population. In this paper, current approaches to fairness are discussed and used to motivate algorithmic proposals that incorporate fairness into genetic programming for classification. We propose two ideas. The first is to incorporate a fairness objective into multi-objective optimization. The second is to adapt lexicase selection to define cases dynamically over intersections of protected groups. We describe why lexicase selection is well suited to pressure models to perform well across the potentially infinitely many subgroups over which fairness is desired. We use a recent genetic programming approach to construct models on four datasets for which fairness constraints are necessary, and empirically compare performance to prior methods utilizing game-theoretic solutions. Methods are assessed based on their ability to generate trade-offs of subgroup fairness and accuracy that are Pareto optimal. The result show that genetic programming methods in general, and random search in particular, are well suited to this task.