TY - GEN
T1 - MorphCache
T2 - 17th International Symposium on High-Performance Computer Architecture, HPCA 2011
AU - Srikantaiah, Shekhar
AU - Kultursay, Emre
AU - Zhang, Tao
AU - Kandemir, Mahmut
AU - Irwin, Mary Jane
AU - Xie, Yuan
PY - 2011
Y1 - 2011
N2 - Given the diverse range of application characteristics that chip multiprocessors (CMPs) need to cater to, a "one-cache-topology-fits- all" design philosophy will clearly be inadequate. In this paper, we propose MorphCache, a Reconfigurable Adaptive Multi-level Cache hierarchy. Mor-phCache dynamically tunes a multi-level cache topology in a CMP to allow significantly different cache topologies to exist on the same architecture. Starting from per-core L2 and L3 cache slices as the basic design point, MorphCache alters the cache topology dynamically by merging or splitting cache slices and modifying the accessibility of different cache slice groups to different cores in a CMP. We evaluated MorphCache on a 16 core CMP on a full system simulator and found that it significantly improves both average throughput and harmonic mean of speedups of diverse multithreaded and multiprogrammed workloads. Specifically, our results show that MorphCache improves throughput of the multiprogrammed mixes by 29.9% over a topology with all-shared L2 and L3 caches and 27.9% over a topology with per core private L2 cache and shared L3 cache. In addition, we also compared MorphCache to partitioning a single shared cache at each level using promotion/insertion pseudo-partitioning (PIPP) [28] and managing per-core private cache at each level using dynamic spill receive caches (DSR) [18]. We found that MorphCache improves average throughput by 6.6% over PIPP and by 5.7% over DSR when applied to both L2 and L3 caches.
AB - Given the diverse range of application characteristics that chip multiprocessors (CMPs) need to cater to, a "one-cache-topology-fits- all" design philosophy will clearly be inadequate. In this paper, we propose MorphCache, a Reconfigurable Adaptive Multi-level Cache hierarchy. Mor-phCache dynamically tunes a multi-level cache topology in a CMP to allow significantly different cache topologies to exist on the same architecture. Starting from per-core L2 and L3 cache slices as the basic design point, MorphCache alters the cache topology dynamically by merging or splitting cache slices and modifying the accessibility of different cache slice groups to different cores in a CMP. We evaluated MorphCache on a 16 core CMP on a full system simulator and found that it significantly improves both average throughput and harmonic mean of speedups of diverse multithreaded and multiprogrammed workloads. Specifically, our results show that MorphCache improves throughput of the multiprogrammed mixes by 29.9% over a topology with all-shared L2 and L3 caches and 27.9% over a topology with per core private L2 cache and shared L3 cache. In addition, we also compared MorphCache to partitioning a single shared cache at each level using promotion/insertion pseudo-partitioning (PIPP) [28] and managing per-core private cache at each level using dynamic spill receive caches (DSR) [18]. We found that MorphCache improves average throughput by 6.6% over PIPP and by 5.7% over DSR when applied to both L2 and L3 caches.
UR - http://www.scopus.com/inward/record.url?scp=79955910113&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=79955910113&partnerID=8YFLogxK
U2 - 10.1109/HPCA.2011.5749732
DO - 10.1109/HPCA.2011.5749732
M3 - Conference contribution
AN - SCOPUS:79955910113
SN - 9781424494323
T3 - Proceedings - International Symposium on High-Performance Computer Architecture
SP - 231
EP - 242
BT - Proceedings - 17th International Symposium on High-Performance Computer Architecture, HPCA 2011
Y2 - 12 February 2011 through 16 February 2011
ER -