论文标题
关于与重叠区域的多服务器联合学习的收敛
On the Convergence of Multi-Server Federated Learning with Overlapping Area
论文作者
论文摘要
多服务器联合学习(FL)被认为是解决单服务器FL的有限通信资源问题的有前途解决方案。我们考虑了典型的多服务器FL体系结构,区域服务器的覆盖范围可能会重叠。该体系结构的关键点是,位于重叠区域中的客户端根据所有可访问的区域模型的平均模型更新其本地模型,这使不同区域服务器之间的间接模型共享。由于网络拓扑的复杂,收敛分析比单人服务器FL更具挑战性。在本文中,我们首先为该多服务器FL架构提出了一种新颖的MS-FEDAVG算法,并分析其在非IID数据集上的一般非凸面设置的收敛性。由于位于每个区域服务器中的客户端数量远低于单服务器FL中,因此每个客户端的带宽应足够大,可以与服务器成功通信培训模型,这表明完整的客户参与可以在多服务器fl中使用。此外,我们还提供了部分客户参与计划的收敛分析,并制定了新的有偏见的部分参与策略,以进一步加速收敛。我们的结果表明,收敛结果很大程度上取决于每个区域类型中客户端数量与所有三种策略中客户总数的比率。广泛的实验表现出显着的性能并支持我们的理论结果。
Multi-server Federated learning (FL) has been considered as a promising solution to address the limited communication resource problem of single-server FL. We consider a typical multi-server FL architecture, where the coverage areas of regional servers may overlap. The key point of this architecture is that the clients located in the overlapping areas update their local models based on the average model of all accessible regional models, which enables indirect model sharing among different regional servers. Due to the complicated network topology, the convergence analysis is much more challenging than single-server FL. In this paper, we firstly propose a novel MS-FedAvg algorithm for this multi-server FL architecture and analyze its convergence on non-iid datasets for general non-convex settings. Since the number of clients located in each regional server is much less than in single-server FL, the bandwidth of each client should be large enough to successfully communicate training models with the server, which indicates that full client participation can work in multi-server FL. Also, we provide the convergence analysis of the partial client participation scheme and develop a new biased partial participation strategy to further accelerate convergence. Our results indicate that the convergence results highly depend on the ratio of the number of clients in each area type to the total number of clients in all three strategies. The extensive experiments show remarkable performance and support our theoretical results.