论文标题

最佳的超级量度负载平衡,并严格的队列限制

Optimal Hyper-Scalable Load Balancing with a Strict Queue Limit

论文作者

van der Boor, Mark, Borst, Sem, van Leeuwaarden, Johan

论文摘要

负载平衡在有效派遣云网络和数据中心等并行服务系统中的作业方面起着至关重要的作用。负载平衡算法设计的基本挑战是在延迟性能和实施开销(例如通信或内存使用情况)之间实现最佳权衡。这一权衡主要是从远离实现渐近性能所需的高架量的角度进行研究的,尤其是大规模系统中延迟的消失。相比之下,在本文中,我们专注于任意稀疏的沟通预算,这可能远低于消失延迟的最低要求,这被称为“超量表”操作区域。此外,只有在可以保证工作队列位置的特定限制时,才能被录取。 我们分析的核心是用于给定通信预算和队列限制的任何调度员驱动算法的可实现吞吐量的通用上限。我们还提出了一个特定的超级尺度方案,该方案可以以任何给定的消息速率运行并执行任何给定的队列限制,同时允许通过封闭的产品形式网络捕获服务器状态,在该网络中,服务器充当遍历各种节点的客户。利用产品形式的分布来证明界限是紧密的,并且鉴于通信和队列限制限制,提出的超尺度方案在多服务器方面是吞吐量的最佳选择。进行了广泛的仿真实验以说明结果。

Load balancing plays a critical role in efficiently dispatching jobs in parallel-server systems such as cloud networks and data centers. A fundamental challenge in the design of load balancing algorithms is to achieve an optimal trade-off between delay performance and implementation overhead (e.g. communication or memory usage). This trade-off has primarily been studied so far from the angle of the amount of overhead required to achieve asymptotically optimal performance, particularly vanishing delay in large-scale systems. In contrast, in the present paper, we focus on an arbitrarily sparse communication budget, possibly well below the minimum requirement for vanishing delay, referred to as the hyper-scalable operating region. Furthermore, jobs may only be admitted when a specific limit on the queue position of the job can be guaranteed. The centerpiece of our analysis is a universal upper bound for the achievable throughput of any dispatcher-driven algorithm for a given communication budget and queue limit. We also propose a specific hyper-scalable scheme which can operate at any given message rate and enforce any given queue limit, while allowing the server states to be captured via a closed product-form network, in which servers act as customers traversing various nodes. The product-form distribution is leveraged to prove that the bound is tight and that the proposed hyper-scalable scheme is throughput-optimal in a many-server regime given the communication and queue limit constraints. Extensive simulation experiments are conducted to illustrate the results.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源