论文标题
DWTCP:数据中心的超低潜伏期控制协议
DWTCP: Ultra Low Latency Congestion Control Protocol for Data Centers
论文作者
论文摘要
拥塞控制算法依赖于各种拥塞信号(数据包丢失,明确的拥塞通知,延迟等)来实现快速收敛,高利用率和流量之间的公平性。这些拥堵信号的关键局限性是它们要么在反馈中迟到,要么会产生大量的开销。理想的拥塞控制必须发现网络中的任何可用带宽,一旦链接利用接近满负荷,就会发现拥塞,并及时做出反应以避免排队和数据包下降,而没有明显的开销。为此,这项工作提出了侦察服务,该服务利用优先级队列推断出带宽的可用性并在主机上链接忙碌。这里的主要观察结果是,随着高优先级队列(HPQ)变得更忙,低优先级队列(LPQ)的服务较少。因此,可以从LPQ观察到链接的状态,并且与观察HPQ相比,可以更早地检测到几个RTT。我们提出了一个新的运输协议,双窗口传输控制协议(DWTCP),该协议构建在侦察服务上,以动态调整其拥塞窗口。我们的基于测试床和基于模拟的评估表明,侦察兵可以使数据中心的传输获得高吞吐量,接近零的队列,较低的潜伏期和高公平。
Congestion control algorithms rely on a variety of congestion signals (packet loss, Explicit Congestion Notification, delay, etc.) to achieve fast convergence, high utilization, and fairness among flows. A key limitation of these congestion signals is that they are either late in feedback or they incur significant overheads. An ideal congestion control must discover any available bandwidth in the network, detect congestion as soon as link utilization approaches full capacity, and react timely to avoid queuing and packet drops, without significant overheads. To this end, this work proposes Scout service that leverages priority queues to infer bandwidth availability and link busyness at the host. The key observation here is that as the high priority queue (HPQ) gets busier, the low priority queue (LPQ) is served less. Therefore, the state of the link can be observed from the LPQ and any congestion can be detected several RTTs earlier than observing the HPQ. We propose a new transport protocol, Double-Window Transmission Control Protocol (DWTCP) that builds upon the Scout service to dynamically adjust its congestion window. Our testbed and simulation-based evaluation demonstrates that Scout enables a data center transport to achieve high throughput, near-zero queues, lower latency, and high fairness.