Skip to main navigation Skip to search Skip to main content

Workload Intelligence: Workload-Aware IaaS abstraction for Cloud Efficiency

  • Lexiang Huang
  • , Anjaly Parayil
  • , Jue Zhang
  • , Xiaoting Qin
  • , Chetan Bansal
  • , Jovan Stojkovic
  • , Pantea Zardoshti
  • , Pulkit Misra
  • , Eli Cortez
  • , Raphael Ghelman
  • , Íñigo Goiri
  • , Saravan Rajmohan
  • , Jim Kleewein
  • , Rodrigo Fonseca
  • , Timothy Zhu
  • , Ricardo Bianchini

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Today, cloud workloads are largely opaque to the cloud platform. Typically, the only information the platform receives is the virtual machine (VM) type and possibly a decoration to the type (e.g., the VM is evictable). Similarly, workloads receive minimal information from the platform; generally, only telemetry from their VMs or occasional signals (e.g., just before a VM is evicted). The narrow interface between workloads and platforms has several drawbacks: (1) a surge in VM types and decorations in public cloud platforms complicates customer selection; (2) key workload characteristics (e.g., low availability requirements) are often unspecified, hindering platform customization for optimized resource usage and cost savings; and (3) workloads may be unaware of potential optimizations or lack sufficient time to react to platform events. To resolve these issues and improve cloud efficiency, we propose Workload Intelligence (WI), a framework for enabling dynamic bi-directional communication between cloud workloads and cloud platform.

Original languageEnglish (US)
Title of host publicationProceedings of the International Conference for High Performance Computing, Networking, Storage, and Analysis, SC 2025
PublisherAssociation for Computing Machinery, Inc
Pages2203-2215
Number of pages13
ISBN (Electronic)9798400714665
DOIs
StatePublished - Nov 15 2025
Event2025 International Conference for High Performance Computing, Networking, Storage, and Analysis, SC 2025 - St. Louis, United States
Duration: Nov 16 2025Nov 21 2025

Publication series

NameProceedings of the International Conference for High Performance Computing, Networking, Storage, and Analysis, SC 2025

Conference

Conference2025 International Conference for High Performance Computing, Networking, Storage, and Analysis, SC 2025
Country/TerritoryUnited States
CitySt. Louis
Period11/16/2511/21/25

All Science Journal Classification (ASJC) codes

  • Computational Theory and Mathematics
  • Computer Networks and Communications
  • Hardware and Architecture

Fingerprint

Dive into the research topics of 'Workload Intelligence: Workload-Aware IaaS abstraction for Cloud Efficiency'. Together they form a unique fingerprint.

Cite this