论文标题
Blockflow:用于联合学习的负责和保密的解决方案
BlockFLow: An Accountable and Privacy-Preserving Solution for Federated Learning
论文作者
论文摘要
联合学习可以在协作代理商中开发机器学习模型,而无需他们共享其基本数据。但是,在随机数据或更糟糕的数据集上倒置的结果类别的恶意药物会倒置,可以削弱组合模型。 Blockflow是一个负责任的联合学习系统,已完全分散并保留隐私。它的主要目标是奖励与其贡献质量成正比的代理商,同时保护基础数据集的隐私,并对恶意对手有弹性。具体而言,Blockflow融合了差异隐私,引入了一种新颖的审计机制来进行模型贡献,并使用以太坊智能合约来激励良好的行为。与联合学习系统的现有审计和问责方法不同,我们的系统不需要集中式测试数据集,代理之间的数据集共享或一个或多个值得信赖的审计师;在恶意信任模型中,它完全分散且具有弹性,可达到50%的勾结攻击。当在公共以太坊区块链上运行时,Blockflow使用审计的结果根据其贡献的质量来奖励具有加密货币的派对。我们评估了两个数据集上的块流,这些数据集提供了可通过逻辑回归模型来解决的分类任务。我们的结果表明,最终的审计分数反映了诚实代理数据集的质量。此外,不诚实代理商的分数在统计学上低于诚实代理商的分数。这些结果以及合理的区块链成本,证明了区块流作为一个负责任的联合学习系统的有效性。
Federated learning enables the development of a machine learning model among collaborating agents without requiring them to share their underlying data. However, malicious agents who train on random data, or worse, on datasets with the result classes inverted, can weaken the combined model. BlockFLow is an accountable federated learning system that is fully decentralized and privacy-preserving. Its primary goal is to reward agents proportional to the quality of their contribution while protecting the privacy of the underlying datasets and being resilient to malicious adversaries. Specifically, BlockFLow incorporates differential privacy, introduces a novel auditing mechanism for model contribution, and uses Ethereum smart contracts to incentivize good behavior. Unlike existing auditing and accountability methods for federated learning systems, our system does not require a centralized test dataset, sharing of datasets between the agents, or one or more trusted auditors; it is fully decentralized and resilient up to a 50% collusion attack in a malicious trust model. When run on the public Ethereum blockchain, BlockFLow uses the results from the audit to reward parties with cryptocurrency based on the quality of their contribution. We evaluated BlockFLow on two datasets that offer classification tasks solvable via logistic regression models. Our results show that the resultant auditing scores reflect the quality of the honest agents' datasets. Moreover, the scores from dishonest agents are statistically lower than those from the honest agents. These results, along with the reasonable blockchain costs, demonstrate the effectiveness of BlockFLow as an accountable federated learning system.