Chrome Extension
WeChat Mini Program
Use on ChatGLM

Improving Network Availability with Protective ReRoute

David Wetherall, Abdul Kabbani,Van Jacobson, Jim Winget,Yuchung Cheng, Charles B. Morrey III, Uma Moravapalle,Phillipa Gill, Steven Knight,Amin Vahdat

ACM SIGCOMM '23 Proceedings of the ACM SIGCOMM 2023 Conference(2023)

Cited 0|Views58
No score
Abstract
We present PRR (Protective ReRoute), a transport technique for shortening user-visible outages that complements routing repair. It can be added to any transport to provide benefits in multipath networks. PRR responds to flow connectivity failure signals, e.g., retransmission timeouts, by changing the FlowLabel on packets of the flow, which causes switches and hosts to choose a different network path that may avoid the outage. To enable it, we shifted our IPv6 network architecture to use the FlowLabel, so that hosts can change the paths of their flows without application involvement. PRR is deployed fleetwide at Google for TCP and Pony Express, where it has been protecting all production traffic for several years. It is also available to our Cloud customers. We find it highly effective for real outages. In a measurement study on our network backbones, adding PRR reduced the cumulative region-pair outage time for RPC traffic by 63--84%. This is the equivalent of adding 0.4--0.8 "nines" of availability.
More
Translated text
Key words
Network availability,Multipathing,FlowLabel
AI Read Science
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined