Online VM Service Selection with Spot Cores for Dynamic Workloads

Nader Alfares, G. Kesidis, Bhuvan Urgaonkar, Ata Fatahi Baarzi, Aman Jain

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Over the past ten years, many different approaches have been proposed for different aspects of the problem of cost-effective resources management for long running, dynamic and diverse workloads such as processing query streams or distributed deep learning. Particularly for applications consisting of containerized microservices, researchers have attempted to address problems of dynamic selection of, for example: types and quantities of virtualized services (e.g., VMs, serverless functions, data-storage), vertical and horizontal scaling of different mi-croservices, assigning microservices to VMs, task scheduling, or some combination thereof. Herein focusing on selection decisions of on-demand VM services, we consider the problem of creating and actively maintaining a training dataset for supervised machine-learned frameworks like deep neural networks and more light-weight, adaptable online optimization frameworks. For both decision frameworks, we make a case for the usefulness of spot cores and incremental search techniques like simulated annealing to reduce workload preemption while searching the decision space to explore the trade-offs between service-level objectives (SLOs) and cloud-spend. Based on user input, a macroscopic objective that captures both performance and cost will be used. We are particularly interested in scenarios with complex workloads and cloud-service offerings that vary over time.

Original languageEnglish (US)
Title of host publicationProceeding - 2024 IEEE Cloud Summit, Cloud Summit 2024
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages54-60
Number of pages7
ISBN (Electronic)9798350370065
DOIs
StatePublished - 2024
Event2024 IEEE Cloud Summit, Cloud Summit 2024 - Washington, United States
Duration: Jun 27 2024Jun 28 2024

Publication series

NameProceeding - 2024 IEEE Cloud Summit, Cloud Summit 2024

Conference

Conference2024 IEEE Cloud Summit, Cloud Summit 2024
Country/TerritoryUnited States
CityWashington
Period6/27/246/28/24

All Science Journal Classification (ASJC) codes

  • Computer Networks and Communications
  • Information Systems and Management
  • Safety, Risk, Reliability and Quality
  • Hardware and Architecture

Fingerprint

Dive into the research topics of 'Online VM Service Selection with Spot Cores for Dynamic Workloads'. Together they form a unique fingerprint.

Cite this