A (In)Cast of Thousands: Scaling Datacenter TCP to Kiloservers and Gigabits (CMU-PDL-09-101)

Vasudevan, Vijay; Phanishayee, Amar; Shah, Hiral; Krevat, Elie; Anderson, David G.; Ganger, Gregory R.; Gibson, Garth A.

doi:10.1184/R1/6619352.v1

file.pdf (316.07 kB)

A (In)Cast of Thousands: Scaling Datacenter TCP to Kiloservers and Gigabits (CMU-PDL-09-101)

journal contribution

posted on 2009-02-01, 00:00 authored by Vijay Vasudevan, Amar Phanishayee, Hiral Shah, Elie Krevat, David G. Anderson, Gregory R. Ganger, Garth A. Gibson

This paper presents a practical solution to the problem of high-fan-in, high-bandwidth synchronized TCP workloads in datacenter Ethernets—the Incast problem. In these networks, receivers often experience a drastic reduction in throughput when simultaneously requesting data from many servers using TCP. Inbound data overfills small switch buffers, leading to TCP timeouts lasting hundreds of milliseconds. For many datacenter workloads that have a synchronization requirement (e.g., filesystem reads and parallel dataintensive queries), incast can reduce throughput by up to 90%. Our solution for incast uses high-resolution timers in TCP to allow for microsecond-granularity timeouts. We show that this technique is effective in avoiding incast using simulation and real-world experiments. Last, we show that eliminating the minimum retransmission timeout bound is safe for all environments, including the wide-area.

History

Publisher Statement

Date

2009-02-01

Usage metrics

Keywords

Cluster-based storage systems TCP performance measurement and analysis

Licence

In Copyright

Exports

RefWorks

BibTeX

Ref. manager

Endnote

DataCite

NLM

DC

A (In)Cast of Thousands: Scaling Datacenter TCP to Kiloservers and Gigabits (CMU-PDL-09-101)

History

Publisher Statement

Date

Usage metrics

Categories

Keywords

Licence

Exports