Astraea: Towards Fair and Efficient Learning-based Congestion Control

Xudong Liao, Han Tian, Chaoliang Zeng, Xinchen Wan, Kai Chen*

*Corresponding author for this work

Research output: Chapter in Book/Conference Proceeding/ReportConference Paper published in a bookpeer-review

12 Citations (Scopus)

Abstract

Recent years have witnessed a plethora of learning-based solutions for congestion control (CC) that demonstrate better performance over traditional TCP schemes. However, they fail to provide consistently good convergence properties, including fairness, fast convergence and stability, due to the mismatch between their objective functions and these properties. Despite being intuitive, integrating these properties into existing learning-based CC is challenging, because: 1) their training environments are designed for the performance optimization of single flow but incapable of cooperative multi-flow optimization, and 2) there is no directly measurable metric to represent these properties into the training objective function. We present Astraea, a new learning-based congestion control that ensures fast convergence to fairness with stability. At the heart of Astraea is a multi-agent deep reinforcement learning framework that explicitly optimizes these convergence properties during the training process by enabling the learning of interactive policy between multiple competing flows, while maintaining high performance. We further build a faithful multi-flow environment that emulates the competing behaviors of concurrent flows, explicitly expressing convergence properties to enable their optimization during training. We have fully implemented Astraea and our comprehensive experiments show that Astraea can quickly converge to fairness point and exhibit better stability than its counterparts. For example, Astraea achieves near-optimal bandwidth sharing (i.e., fairness) when multiple flows compete for the same bottleneck, delivers up to 8.4× faster convergence speed and 2.8× smaller throughput deviation, while achieving comparable or even better performance over prior solutions.

Original languageEnglish
Title of host publicationEuroSys 2024 - Proceedings of the 2024 European Conference on Computer Systems
PublisherAssociation for Computing Machinery, Inc
Pages99-114
Number of pages16
ISBN (Electronic)9798400704376
DOIs
Publication statusPublished - 22 Apr 2024
Event19th European Conference on Computer Systems, EuroSys 2024 - Athens, Greece
Duration: 22 Apr 202425 Apr 2024

Publication series

NameEuroSys 2024 - Proceedings of the 2024 European Conference on Computer Systems

Conference

Conference19th European Conference on Computer Systems, EuroSys 2024
Country/TerritoryGreece
CityAthens
Period22/04/2425/04/24

Bibliographical note

Publisher Copyright:
© 2024 ACM.

Keywords

  • Congestion Control
  • Reinforcement Learning
  • Transport Protocol

Fingerprint

Dive into the research topics of 'Astraea: Towards Fair and Efficient Learning-based Congestion Control'. Together they form a unique fingerprint.

Cite this