论文标题

Couder:光电开关数据中心网络的强大拓扑工程

COUDER: Robust Topology Engineering for Optical Circuit Switched Data Center Networks

论文作者

Teh, Min Yee, Zhao, Shizhen, Cao, Peirui, Bergman, Keren

论文摘要

过去已经提出了许多光电路交换数据中心网络(DCN),以达到更高的容量和拓扑结构性,尽管这些架构的商业采用量很小。这些架构面对的一个主要挑战是难以使用具有较高开关延迟的商用光电开关(OCS)来处理不确定的交通需求。先前的工作通常集中于开发快速开关的OCS原型,以通过频繁的重新配置来快速对流量变化做出反应。但是,这种方法为控制平面增添了巨大的复杂性,并提高了光电路开关数据中心网络的商业采用障碍。 我们提出了Couder,这是可重新配置的光电路切换数据中心的强大拓扑和路由优化框架。 Couder根据一组凸流矩阵优化了拓扑和路由,并为任何未来的流量矩阵提供了严格的吞吐量保证。对于凸组集合的爆发交通需求,我们采用一种脱敏技术来降低性能命中率。这使Couder能够生成拓扑和路由解决方案,能够处理意外的流量变化,而无需依赖频繁的拓扑重新配置。我们基于Facebook的生产DCN痕迹的广泛评估表明,即使每天重新配置,与成本等效的静态拓扑相比,Couder的吞吐量高约20 \%,平均HOP计数低约32 \%。我们的工作表明,即使没有快速的OCS,在商业DCN中采用可重构拓扑也是可行的。

Many optical circuit switched data center networks (DCN) have been proposed in the past to attain higher capacity and topology reconfigurability, though commercial adoption of these architectures have been minimal. One major challenge these architectures face is the difficulty of handling uncertain traffic demands using commercial optical circuit switches (OCS) with high switching latency. Prior works have generally focused on developing fast-switching OCS prototypes to quickly react to traffic changes through frequent reconfigurations. This approach, however, adds tremendous complexity to the control plane, and raises the barrier for commercial adoption of optical circuit switched data center networks. We propose COUDER, a robust topology and routing optimization framework for reconfigurable optical circuit switched data centers. COUDER optimizes topology and routing based on a convex set of traffic matrices, and offers strict throughput guarantees for any future traffic matrices bounded by the convex set. For the bursty traffic demands that are unbounded by the convex set, we employ a desensitization technique to reduce performance hit. This enables COUDER to generate topology and routing solutions capable of handling unexpected traffic changes without relying on frequent topology reconfigurations. Our extensive evaluations based on Facebook's production DCN traces show that, even with daily reconfiguration, COUDER achieves about 20\% higher throughput, and about 32\% lower average hop count compared to cost-equivalent static topologies. Our work shows that adoption of reconfigurable topologies in commercial DCNs is feasible even without fast OCSs.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源