SplitServe: Efficiently splitting Apache spark jobs across Faas and IaAs

Aman Jain, Bhuvan Urgaonkar, Ata F. Baarzi, Nader Alfares, George Kesidis, Mahmut Kandemir

Research output: Chapter in Book/Report/Conference proceedingConference contribution

11 Scopus citations

Abstract

Due to their lower startup latencies and finer-grain pricing than virtual machines (VMs), Amazon Lambdas and other cloud functions (CFs) have been identified as ideal candidates for handling unexpected spikes in simple, stateless workloads. However, it is not immediately clear if CFs would be similarly effective in autoscaling complex workloads involving significant state transfer across distributed application components. We have found that, through careful design, currently available CFs can indeed be useful even for complex workloads. To demonstrate this, we design and implement SplitServe, an enhancement of Apache Spark. If not enough executors on existing VMs are available for a newly arriving latency-sensitive job, SplitServe is able to use CFs to quickly bridge this shortfall in VMs, so avoiding the startup latencies of newly requested VMs. If desirable in terms of performance or cost, when newly requested VMs, or executors on existing VMs, do become available, SplitServe is able to move ongoing work from CFs to them. Our experimental evaluation of SplitServe using four different workloads (either on a mixture of VM-based executors and CFs or just CFs) shows that it improves execution time by up to (a) 55% for workloads with small to modest amount of shuffling, and (b) 31% in workloads with large amounts of shuffling, when compared to only VM-based autoscaling.

Original languageEnglish (US)
Title of host publicationMiddleware 2020 - Proceedings of the 2020 21st International Middleware Conference
PublisherAssociation for Computing Machinery, Inc
Pages236-250
Number of pages15
ISBN (Electronic)9781450381536
DOIs
StatePublished - Dec 7 2020
Event21st International Middleware Conference, Middleware 2020 - Virtual, Online, Netherlands
Duration: Dec 7 2020Dec 11 2020

Publication series

NameMiddleware 2020 - Proceedings of the 2020 21st International Middleware Conference

Conference

Conference21st International Middleware Conference, Middleware 2020
Country/TerritoryNetherlands
CityVirtual, Online
Period12/7/2012/11/20

All Science Journal Classification (ASJC) codes

  • Computer Networks and Communications
  • Information Systems
  • Software

Fingerprint

Dive into the research topics of 'SplitServe: Efficiently splitting Apache spark jobs across Faas and IaAs'. Together they form a unique fingerprint.

Cite this