TY - GEN
T1 - DRackSim
T2 - 38th ACM SIGSIM Conference on Principles of Advanced Discrete Simulation, SIGSIM-PADS 2024
AU - Puri, Amit
AU - Bellamkonda, Kartheek
AU - Narreddy, Kailash
AU - Jose, John
AU - Tamarapalli, Venkatesh
AU - Narayanan, Vijaykrishnan
N1 - Publisher Copyright:
© 2024 ACM.
PY - 2024/6/24
Y1 - 2024/6/24
N2 - Memory disaggregation has emerged as an alternative to traditional server architecture in data centers to target better memory utilization and higher scalability. It involves multiple independent compute nodes and remote memory pools that get hardware support through high-speed cache-coherent interconnects such as CXL. This paper introduces DRackSim, a simulation infrastructure for scalable disaggregated memory systems. DRackSim primarily models multiple compute nodes, memory pools, local/global memory managers, and a network interconnect for coherent memory access. An application-level simulation approach simulates an out-of-order x86 multi-core processor and a multi-level cache hierarchy at compute nodes. The network interface is simulated through a queue-based approach to handle remote memory access at multiple granularity. It also models a global memory manager for remote address space at the memory pools. Finally, we integrate a modified DRAMSim2 to perform local/remote memory simulation by declaring multiple instances of DRAMSim2. We rigorously validate DRackSim subsystems against Gem5 and a hardware prototype. Finally, we explore the design space by modeling various use-case scenarios for disaggregated memory systems and evaluate their performance over various HPC and cloud benchmarks.
AB - Memory disaggregation has emerged as an alternative to traditional server architecture in data centers to target better memory utilization and higher scalability. It involves multiple independent compute nodes and remote memory pools that get hardware support through high-speed cache-coherent interconnects such as CXL. This paper introduces DRackSim, a simulation infrastructure for scalable disaggregated memory systems. DRackSim primarily models multiple compute nodes, memory pools, local/global memory managers, and a network interconnect for coherent memory access. An application-level simulation approach simulates an out-of-order x86 multi-core processor and a multi-level cache hierarchy at compute nodes. The network interface is simulated through a queue-based approach to handle remote memory access at multiple granularity. It also models a global memory manager for remote address space at the memory pools. Finally, we integrate a modified DRAMSim2 to perform local/remote memory simulation by declaring multiple instances of DRAMSim2. We rigorously validate DRackSim subsystems against Gem5 and a hardware prototype. Finally, we explore the design space by modeling various use-case scenarios for disaggregated memory systems and evaluate their performance over various HPC and cloud benchmarks.
UR - http://www.scopus.com/inward/record.url?scp=85197554250&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85197554250&partnerID=8YFLogxK
U2 - 10.1145/3615979.3656059
DO - 10.1145/3615979.3656059
M3 - Conference contribution
AN - SCOPUS:85197554250
T3 - ACM International Conference Proceeding Series
SP - 3
EP - 14
BT - SIGSIM PADS 2024 - Proceedings of the 38th ACM SIGSIM International Conference on Principles of Advanced Discrete Simulation
PB - Association for Computing Machinery
Y2 - 24 June 2024 through 26 June 2024
ER -