DRackSim: Simulating CXL-enabled Large-Scale Disaggregated Memory Systems

Amit Puri, Kartheek Bellamkonda, Kailash Narreddy, John Jose, Venkatesh Tamarapalli, Vijaykrishnan Narayanan

Research output: Chapter in Book/Report/Conference proceedingConference contribution

3 Scopus citations

Abstract

Memory disaggregation has emerged as an alternative to traditional server architecture in data centers to target better memory utilization and higher scalability. It involves multiple independent compute nodes and remote memory pools that get hardware support through high-speed cache-coherent interconnects such as CXL. This paper introduces DRackSim, a simulation infrastructure for scalable disaggregated memory systems. DRackSim primarily models multiple compute nodes, memory pools, local/global memory managers, and a network interconnect for coherent memory access. An application-level simulation approach simulates an out-of-order x86 multi-core processor and a multi-level cache hierarchy at compute nodes. The network interface is simulated through a queue-based approach to handle remote memory access at multiple granularity. It also models a global memory manager for remote address space at the memory pools. Finally, we integrate a modified DRAMSim2 to perform local/remote memory simulation by declaring multiple instances of DRAMSim2. We rigorously validate DRackSim subsystems against Gem5 and a hardware prototype. Finally, we explore the design space by modeling various use-case scenarios for disaggregated memory systems and evaluate their performance over various HPC and cloud benchmarks.

Original languageEnglish (US)
Title of host publicationSIGSIM PADS 2024 - Proceedings of the 38th ACM SIGSIM International Conference on Principles of Advanced Discrete Simulation
PublisherAssociation for Computing Machinery
Pages3-14
Number of pages12
ISBN (Electronic)9798400703638
DOIs
StatePublished - Jun 24 2024
Event38th ACM SIGSIM Conference on Principles of Advanced Discrete Simulation, SIGSIM-PADS 2024 - Atlanta, United States
Duration: Jun 24 2024Jun 26 2024

Publication series

NameACM International Conference Proceeding Series

Conference

Conference38th ACM SIGSIM Conference on Principles of Advanced Discrete Simulation, SIGSIM-PADS 2024
Country/TerritoryUnited States
CityAtlanta
Period6/24/246/26/24

All Science Journal Classification (ASJC) codes

  • Human-Computer Interaction
  • Computer Networks and Communications
  • Computer Vision and Pattern Recognition
  • Software

Fingerprint

Dive into the research topics of 'DRackSim: Simulating CXL-enabled Large-Scale Disaggregated Memory Systems'. Together they form a unique fingerprint.

Cite this