使用隐私的联合学习来改善设备演讲者的验证

论文标题

使用隐私的联合学习来改善设备演讲者的验证

Improving on-device speaker verification using federated learning with privacy

论文作者

Granqvist, Filip, Seigel, Matt, van Dalen, Rogier, Cahill, Áine, Shum, Stephen, Paulik, Matthias

论文摘要

关于说话者特征的信息可以作为副信息，可在提高说话者识别精度时提供。但是，这些信息通常是私人的。本文调查了保护隐私的学习如何通过允许使用对隐私敏感的扬声器数据来培训预测说话者声音特征的辅助分类模型来改善说话者验证系统。特别是，本文探讨了通过结合不同联合学习和差异隐私机制的方法实现的效用。这些方法使得在保护用户隐私的同时训练中心模型成为可能，并且用户的数据保留在其设备上。此外，它们使对大量的说话者的学习成为可能，从而在训练模型时确保了对扬声器特征的良好覆盖。此处描述的辅助模型使用从触发扬声器验证系统的短语中提取的功能。从这些功能中，该模型可以预测说话者特性标签，被认为是侧面信息。使用多任务学习将辅助模型的知识蒸馏到扬声器验证系统中，该辅助模型预测的侧面信息标签是附加任务。这种方法导致基线系统相同错误率的相对相对提高6％。

Information on speaker characteristics can be useful as side information in improving speaker recognition accuracy. However, such information is often private. This paper investigates how privacy-preserving learning can improve a speaker verification system, by enabling the use of privacy-sensitive speaker data to train an auxiliary classification model that predicts vocal characteristics of speakers. In particular, this paper explores the utility achieved by approaches which combine different federated learning and differential privacy mechanisms. These approaches make it possible to train a central model while protecting user privacy, with users' data remaining on their devices. Furthermore, they make learning on a large population of speakers possible, ensuring good coverage of speaker characteristics when training a model. The auxiliary model described here uses features extracted from phrases which trigger a speaker verification system. From these features, the model predicts speaker characteristic labels considered useful as side information. The knowledge of the auxiliary model is distilled into a speaker verification system using multi-task learning, with the side information labels predicted by this auxiliary model being the additional task. This approach results in a 6% relative improvement in equal error rate over a baseline system.

下载PDF全文

下载文献需遵守相关版权规定

论文标题