论文标题
用于层次多任务学习的编码分布式计算
Coded Distributed Computing for Hierarchical Multi-task Learning
论文作者
论文摘要
在本文中,我们考虑了一个层次分布式多任务学习(MTL)系统,在该系统中,分布式用户希望借助多个继电器一层,共同学习由中央服务器协调的不同模型。由于用户需要在下行链路传输中下载不同的学习模型,因此与单任务学习系统相比,分布式的MTL在通信瓶颈上更严重。为了解决此问题,我们提出了一个编码的层次MTL方案,该方案利用连接拓扑并引入编码技术以减少通信负载。结果表明,所提出的方案可以显着减少继电器和服务器之间上行链路和下行链路传输中的通信负载。此外,我们在最佳上行链路和下行链路通信载荷上提供了信息理论的下限,并证明可实现的上限和下限之间的差距在所有继电器之间的最小连接用户数量之内。特别是,当可以精心设计网络连接拓扑时,建议的方案可以实现信息理论的最佳通信负载。实际数据集的实验表明,与传统的未编码方案相比,我们提出的计划可以将整体培训时间降低17%$ \ sim $ 26%。
In this paper, we consider a hierarchical distributed multi-task learning (MTL) system where distributed users wish to jointly learn different models orchestrated by a central server with the help of a layer of multiple relays. Since the users need to download different learning models in the downlink transmission, the distributed MTL suffers more severely from the communication bottleneck compared to the single-task learning system. To address this issue, we propose a coded hierarchical MTL scheme that exploits the connection topology and introduces coding techniques to reduce communication loads. It is shown that the proposed scheme can significantly reduce the communication loads both in the uplink and downlink transmissions between relays and the server. Moreover, we provide information-theoretic lower bounds on the optimal uplink and downlink communication loads, and prove that the gaps between achievable upper bounds and lower bounds are within the minimum number of connected users among all relays. In particular, when the network connection topology can be delicately designed, the proposed scheme can achieve the information-theoretic optimal communication loads. Experiments on real datasets show that our proposed scheme can reduce the overall training time by 17% $\sim$ 26% compared to the conventional uncoded scheme.