TY - GEN
T1 - Design and management of 3D chip multiprocessors using network-in-memory
AU - Li, Feihui
AU - Nicopoulos, Chrysostomos
AU - Richardson, Thomas
AU - Xie, Yuan
AU - Narayanan, Vijaykrishnan
AU - Kandemir, Mahmut
PY - 2006
Y1 - 2006
N2 - Long interconnects are becoming an increasingly important problem from both power and performance perspectives. This motivates designers to adopt on-chip network-based communication infrastructures and three-dimensional (3D) designs where multiple device layers are stacked together. Considering the current trends towards increasing use of chip multiprocessing, it is timely to consider 3D chip multiprocessor design and memory networking issues, especially in the context of data management in large L2 caches. The overall goal of this paper is to study the challenges for L2 design and management in 3D chip multiprocessors. Our first contribution is to propose a router architecture and a topology design that makes use of a network architecture embedded into the L2 cache memory. Our second contribution is to demonstrate, through extensive experiments, that a 3D L2 memory architecture generates much better results than the conventional two-dimensional (2D) designs under different number of layers and vertical (inter-wafer) connections. In particular, our experiments show that a 3D architecture with no dynamic data migration generates better performance than a 2D architecture that employs data migration. This also helps reduce power consumption in L2 due to a reduced number of data movements.
AB - Long interconnects are becoming an increasingly important problem from both power and performance perspectives. This motivates designers to adopt on-chip network-based communication infrastructures and three-dimensional (3D) designs where multiple device layers are stacked together. Considering the current trends towards increasing use of chip multiprocessing, it is timely to consider 3D chip multiprocessor design and memory networking issues, especially in the context of data management in large L2 caches. The overall goal of this paper is to study the challenges for L2 design and management in 3D chip multiprocessors. Our first contribution is to propose a router architecture and a topology design that makes use of a network architecture embedded into the L2 cache memory. Our second contribution is to demonstrate, through extensive experiments, that a 3D L2 memory architecture generates much better results than the conventional two-dimensional (2D) designs under different number of layers and vertical (inter-wafer) connections. In particular, our experiments show that a 3D architecture with no dynamic data migration generates better performance than a 2D architecture that employs data migration. This also helps reduce power consumption in L2 due to a reduced number of data movements.
UR - http://www.scopus.com/inward/record.url?scp=33845914023&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=33845914023&partnerID=8YFLogxK
U2 - 10.1109/ISCA.2006.18
DO - 10.1109/ISCA.2006.18
M3 - Conference contribution
AN - SCOPUS:33845914023
SN - 076952608X
SN - 9780769526089
T3 - Proceedings - International Symposium on Computer Architecture
SP - 130
EP - 141
BT - Proceedings - 33rd International Symposium on Computer Architecture,ISCA 2006
T2 - 33rd International Symposium on Computer Architecture, ISCA 2006
Y2 - 17 June 2006 through 21 June 2006
ER -