Multi-resource Fair Sharing for Datacenter Jobs with Placement Constraints

Wei Wang, Baochun Li, Ben Liang, Jun Li

Research output: Chapter in Book/Conference Proceeding/ReportConference Paper published in a bookpeer-review

43 Citations (Scopus)

Abstract

Providing quality-of-service guarantees by means of fair sharing has never been more challenging in datacenters. Due to the heterogeneity of machine configurations, datacenter jobs frequently specify placement constraints, restricting them to run on a particular class of machines meeting specific hardware/software requirements. In addition, jobs have diverse demands across multiple resource types, and may saturate any of the CPU, memory, or storage resources. Despite the rich body of recent work on datacenter scheduling, it remains unclear how multi-resource fair sharing is defined and achieved for jobs with placement constraints. In this paper, we propose a new sharing policy called Task Share Fairness (TSF). With TSF, jobs are better off sharing the datacenter, and are better off reporting demands and constraints truthfully. We have prototyped TSF on Apache Mesos and confirmed its service guarantees in a 50-node EC2 cluster. Trace-driven simulations have further revealed that TSF speeds up 60% of tasks over existing fair schedulers.

Original languageEnglish
Title of host publicationProceedings of SC 2016
Subtitle of host publicationThe International Conference for High Performance Computing, Networking, Storage and Analysis
PublisherIEEE Computer Society
Pages1003-1014
Number of pages12
ISBN (Electronic)9781467388153
DOIs
Publication statusPublished - 2 Jul 2016
Event2016 International Conference for High Performance Computing, Networking, Storage and Analysis, SC 2016 - Salt Lake City, United States
Duration: 13 Nov 201618 Nov 2016

Publication series

NameInternational Conference for High Performance Computing, Networking, Storage and Analysis, SC
Volume0
ISSN (Print)2167-4329
ISSN (Electronic)2167-4337

Conference

Conference2016 International Conference for High Performance Computing, Networking, Storage and Analysis, SC 2016
Country/TerritoryUnited States
CitySalt Lake City
Period13/11/1618/11/16

Bibliographical note

Publisher Copyright:
© 2016 IEEE.

Keywords

  • Cluster schedulers
  • fairness
  • multi-resource allocation
  • placement constraints

Fingerprint

Dive into the research topics of 'Multi-resource Fair Sharing for Datacenter Jobs with Placement Constraints'. Together they form a unique fingerprint.

Cite this