Fast and Accurate DNN Performance Estimation across Diverse Hardware Platforms

Vishwas Vasudeva Kakrannaya, Siddhartha Balakrishna Rai, Anand Sivasubramaniam, Timothy Zhu

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Performance modeling is an important tool for many purposes such as designing hardware accelerators, improving scheduling, optimizing system parameters, procuring new hardware, etc. This paper provides a new methodology for constructing performance models for Deep Neural Networks (DNNs), a popular machine learning workload. Prior works require running DNNs on existing hardware, which may not be available, or simulating the computation on futuristic hardware, which is slow and not scalable. We instead take an analytical approach based on analyzing the raw operations within DNN algorithms, which allows us to estimate performance across any hardware, even hardware that is in the process of being designed. Evaluations show our approach is fast and gives a good first order approximation pm 10-15% accuracy) across many DNNs and hardware platforms including GPUs, CPUs, and a futuristic Processing In Memory (PIM) accelerator called BLIMP.

Original languageEnglish (US)
Title of host publicationProceedings - 2024 IEEE 32nd International Symposium on Modeling, Analysis, and Simulation of Computer and Telecommunication Systems, MASCOTS 2024
PublisherIEEE Computer Society
ISBN (Electronic)9798331531300
DOIs
StatePublished - 2024
Event32nd IEEE International Symposium on Modeling, Analysis, and Simulation of Computer and Telecommunication Systems, MASCOTS 2024 - Krakow, Poland
Duration: Oct 21 2024Oct 23 2024

Publication series

NameProceedings - IEEE Computer Society's Annual International Symposium on Modeling, Analysis, and Simulation of Computer and Telecommunications Systems, MASCOTS
ISSN (Print)1526-7539

Conference

Conference32nd IEEE International Symposium on Modeling, Analysis, and Simulation of Computer and Telecommunication Systems, MASCOTS 2024
Country/TerritoryPoland
CityKrakow
Period10/21/2410/23/24

All Science Journal Classification (ASJC) codes

  • Electrical and Electronic Engineering
  • Computer Networks and Communications
  • Software
  • Modeling and Simulation

Fingerprint

Dive into the research topics of 'Fast and Accurate DNN Performance Estimation across Diverse Hardware Platforms'. Together they form a unique fingerprint.

Cite this