论文标题
HASHVFL:防御垂直联合学习中的数据重建攻击
HashVFL: Defending Against Data Reconstruction Attacks in Vertical Federated Learning
论文作者
论文摘要
垂直联合学习(VFL)是一种流行的协作机器学习模型培训解决方案。现有的工业框架采用安全的多方计算技术,例如同构加密,以确保数据安全和隐私。尽管进行了这些努力,但研究表明,由于中间表示与原始数据之间的相关性,数据泄漏仍然是VFL的风险。神经网络可以准确捕获这些相关性,从而使对手重建数据。这强调了继续研究确保VFL系统的必要性。 我们的工作表明,哈希是对抗数据重建攻击的有前途的解决方案。哈希的单向性质使对手很难从哈希代码中恢复数据。但是,在VFL中实施散列会带来新的挑战,包括消失的梯度和信息丢失。为了解决这些问题,我们提出了HASHVFL,该问题将哈希和同时达到可学习性,平衡和一致性。 实验结果表明,HASHVFL有效地保持任务性能,同时捍卫数据重建攻击。它还为减少标签泄漏,减轻对抗性攻击和检测异常输入的程度带来了其他好处。我们希望我们的工作能够激发人们对HASHVFL潜在应用的进一步研究。
Vertical Federated Learning (VFL) is a trending collaborative machine learning model training solution. Existing industrial frameworks employ secure multi-party computation techniques such as homomorphic encryption to ensure data security and privacy. Despite these efforts, studies have revealed that data leakage remains a risk in VFL due to the correlations between intermediate representations and raw data. Neural networks can accurately capture these correlations, allowing an adversary to reconstruct the data. This emphasizes the need for continued research into securing VFL systems. Our work shows that hashing is a promising solution to counter data reconstruction attacks. The one-way nature of hashing makes it difficult for an adversary to recover data from hash codes. However, implementing hashing in VFL presents new challenges, including vanishing gradients and information loss. To address these issues, we propose HashVFL, which integrates hashing and simultaneously achieves learnability, bit balance, and consistency. Experimental results indicate that HashVFL effectively maintains task performance while defending against data reconstruction attacks. It also brings additional benefits in reducing the degree of label leakage, mitigating adversarial attacks, and detecting abnormal inputs. We hope our work will inspire further research into the potential applications of HashVFL.