Visible to the public A Survey of End-System Optimizations for High-Speed Networks

TitleA Survey of End-System Optimizations for High-Speed Networks
Publication TypeJournal Article
Year of Publication2018
AuthorsHanford, Nathan, Ahuja, Vishal, Farrens, Matthew K., Tierney, Brian, Ghosal, Dipak
JournalACM Comput. Surv.
Volume51
Pagination54:1-54:36
ISSN0360-0300
Keywordsclean slate, Collaboration, end-system bottleneck, experimental analysis, flow control, high-speed networks, Human Behavior, human factors, Metrics, policy, Policy Based Governance, policy governance, pubcrawl, queueing model, rate-based protocol, resilience, Resiliency
Abstract

The gap is widening between the processor clock speed of end-system architectures and network throughput capabilities. It is now physically possible to provide single-flow throughput of speeds up to 100 Gbps, and 400 Gbps will soon be possible. Most current research into high-speed data networking focuses on managing expanding network capabilities within datacenter Local Area Networks (LANs) or efficiently multiplexing millions of relatively small flows through a Wide Area Network (WAN). However, datacenter hyper-convergence places high-throughput networking workloads on general-purpose hardware, and distributed High-Performance Computing (HPC) applications require time-sensitive, high-throughput end-to-end flows (also referred to as ``elephant flows'') to occur over WANs. For these applications, the bottleneck is often the end-system and not the intervening network. Since the problem of the end-system bottleneck was uncovered, many techniques have been developed which address this mismatch with varying degrees of effectiveness. In this survey, we describe the most promising techniques, beginning with network architectures and NIC design, continuing with operating and end-system architectures, and concluding with clean-slate protocol design.

URLhttps://dl.acm.org/doi/10.1145/3184899
DOI10.1145/3184899
Citation Keyhanford_survey_2018