论文标题
Ridgeline:分布式系统的2D车顶线模型
Ridgeline: A 2D Roofline Model for Distributed Systems
论文作者
论文摘要
在这篇简短的论文中,我们介绍了Ridgeline模型,这是分布式系统的屋顶线模型[4]的扩展。车顶线模型针对共享内存系统,根据其操作强度以及执行系统的峰值计算吞吐量和内存带宽来界定内核的性能。在分布式设置中,具有多个通信计算实体,必须考虑网络以准确地对系统行为进行建模。 Ridgeline以一个2D图汇总了有关计算,内存和网络限制的信息,以直观的方式显示哪个资源是预期的瓶颈。我们显示了基于数据并行多层感知器(MLP)实例的案例研究中山脊线的适用性。
In this short paper, we introduce the Ridgeline model, an extension of the Roofline model [4] for distributed systems. The Roofline model targets shared memory systems, bounding the performance of a kernel based on its operational intensity, and the peak compute throughput and memory bandwidth of the execution system. In a distributed setting, with multiple communicating compute entities, the network must be taken into account to model the system behavior accurately. The Ridgeline aggregates information on compute, memory, and network limits in one 2D plot to show, in an intuitive way, which of the resources is the expected bottleneck. We show the applicability of the Ridgeline in a case study based on a data-parallel Multi-Layer Perceptron (MLP) instance.