Fast Networks and Slow Memories: A Mechanism for Mitigating Bandwidth Mismatches

2017 IEEE 25th Annual Symposium on High-Performance Interconnects (HOTI)(2017)

引用 2|浏览105
暂无评分
摘要
The advent of non-volatile memory (NVM) technologies has added an interesting nuance to the node level memory hierarchy. With modern 100 Gb/s networks, the NVM tier of storage can often be slower than the high performance network in the system; thus, a new challenge arises in the datacenter. Whereas prior efforts have studied the impacts of multiple sources targeting one node (i.e., incast) and have studied multiple flows causing congestion in inter-switch links, it is now possible for a single flow from a single source to overwhelm the bandwidth of a key portion of the memory hierarchy. This can subsequently spread to the switches and lead to congestion trees in a flow-controlled network or excessive packet drops without flow control. In this work we describe protocols which avoid overwhelming the receiver in the case of a source/sink rate mismatch. We design our protocols on top of Portals 4, which enables us to make use of network offload. Our protocol yields up to 4x higher throughput in a 5k node Dragonfly topology for a permutation traffic pattern in which only 1% of all nodes have a memory write-bandwidth limitation of 1/8th of the network bandwidth.
更多
查看译文
关键词
node level memory hierarchy,flow-controlled network,network offload,memory write-bandwidth limitation,nonvolatile memory technologies,protocol,bandwidth mismatch mitigation,NVM technologies,datacenter,interswitch links,source-sink rate mismatch,Portals 4,Dragonfly topology,bit rate 100 Gbit/s
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要