Enabling ECN for datacenter networks with RTT variations

Junxue Zhang, Wei Bai, Kai Chen

Research output: Chapter in Book/Conference Proceeding/ReportConference Paper published in a bookpeer-review

60 Citations (Scopus)

Abstract

ECN has been widely employed in production datacenters to deliver high throughput low latency communications. Despite being successful, prior ECN-based transports have an important drawback: they adopt a fixed RTT value in calculating instantaneous ECN marking threshold while overlooking the RTT variations in practice. In this paper, we reveal that the current practice of using a fixed high-percentile RTT for ECN threshold calculation can lead to persistent queue buildups, significantly increasing packet latency. On the other hand, directly adopting lower percentile RTTs results in throughput degradation. To handle the problem, we introduce ECN#, a simple yet effective solution to enable ECN for RTT variations. At its heart, ECN# inherits the current instantaneous ECN marking (based on a high-percentile RTT) to achieve high throughput and burst tolerance, while further marking packets (conservatively) upon detecting long-term queue buildups to eliminate unnecessary queueing delay without degrading throughput. We implement ECN# on a Barefoot Tofino switch and evaluate it through extensive testbed experiments and large-scale simulations. Our evaluation confirms that ECN# can effectively reduce latency without hurting throughput. For example, compared to the current practice, ECN# achieves up to 23.4% (31.2%) lower average (99th percentile) flow completion time (FCT) for short flows while delivering similar FCT for large flows under production workloads.

Original languageEnglish
Title of host publicationCoNEXT 2019 - Proceedings of the 15th International Conference on Emerging Networking Experiments and Technologies
PublisherAssociation for Computing Machinery, Inc
Pages233-245
Number of pages13
ISBN (Electronic)9781450369985
DOIs
Publication statusPublished - 3 Dec 2019
Event15th ACM International Conference on Emerging Networking Experiments and Technologies, CoNEXT 2019 - Orlando, United States
Duration: 9 Dec 201912 Dec 2019

Publication series

NameCoNEXT 2019 - Proceedings of the 15th International Conference on Emerging Networking Experiments and Technologies

Conference

Conference15th ACM International Conference on Emerging Networking Experiments and Technologies, CoNEXT 2019
Country/TerritoryUnited States
CityOrlando
Period9/12/1912/12/19

Bibliographical note

Publisher Copyright:
© 2019 ACM.

Keywords

  • AQM
  • Datacenters
  • ECN
  • RTT variations

Fingerprint

Dive into the research topics of 'Enabling ECN for datacenter networks with RTT variations'. Together they form a unique fingerprint.

Cite this