TY - GEN
T1 - Towards resilient micro-architectures
T2 - 2011 IEEE Computer Society Annual Symposium on VLSI, ISVLSI 2011
AU - Swaminathan, Karthik
AU - Mukundrajan, Ravindhiran
AU - Soundararajan, Niranjan
AU - Narayanan, Vijaykrishnan
PY - 2011
Y1 - 2011
N2 - Transistor scaling and reduction in operating voltages have resulted in cosmic-ray induced soft errors becoming a major threat for reliable processor operation. With the raw device soft error rate expected to remain constant in future generations, the explosion in on-chip transistor count is expected to have a corresponding impact on overall error rate. Consequently it becomes necessary to incorporate resiliency into the pipeline datapath. However, existing methods like redundant execution or error correction used in memory are non-ideal for the pipeline due to their impact on overall performance. In this paper, we propose a novel technique that exploits the characteristics of Spin Transfer Torque - Magnetic Random Access Memory (STT-MRAM) for providing protection of all storage structures in the pipeline, namely the Reorder Buffer, Issue Queue and Load-Store Queue. We identify specific periods during an application's runtime when the MRAM can capture a "snapshot" of the invariant micro-architectural state and restore it later thereby reducing soft error vulnerability. We quantify the reduction in Architectural Vulnerability Factor (AVF) to be 32% on average, with a performance overhead of less than 6%. Further, we present a case where the proposed technique can be combined with existing soft error protection techniques like Instruction Duplication to further enhance system reliability.
AB - Transistor scaling and reduction in operating voltages have resulted in cosmic-ray induced soft errors becoming a major threat for reliable processor operation. With the raw device soft error rate expected to remain constant in future generations, the explosion in on-chip transistor count is expected to have a corresponding impact on overall error rate. Consequently it becomes necessary to incorporate resiliency into the pipeline datapath. However, existing methods like redundant execution or error correction used in memory are non-ideal for the pipeline due to their impact on overall performance. In this paper, we propose a novel technique that exploits the characteristics of Spin Transfer Torque - Magnetic Random Access Memory (STT-MRAM) for providing protection of all storage structures in the pipeline, namely the Reorder Buffer, Issue Queue and Load-Store Queue. We identify specific periods during an application's runtime when the MRAM can capture a "snapshot" of the invariant micro-architectural state and restore it later thereby reducing soft error vulnerability. We quantify the reduction in Architectural Vulnerability Factor (AVF) to be 32% on average, with a performance overhead of less than 6%. Further, we present a case where the proposed technique can be combined with existing soft error protection techniques like Instruction Duplication to further enhance system reliability.
UR - http://www.scopus.com/inward/record.url?scp=80052564953&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=80052564953&partnerID=8YFLogxK
U2 - 10.1109/ISVLSI.2011.84
DO - 10.1109/ISVLSI.2011.84
M3 - Conference contribution
AN - SCOPUS:80052564953
SN - 9780769544472
T3 - Proceedings - 2011 IEEE Computer Society Annual Symposium on VLSI, ISVLSI 2011
SP - 236
EP - 241
BT - Proceedings - 2011 IEEE Computer Society Annual Symposium on VLSI, ISVLSI 2011
Y2 - 4 July 2011 through 6 July 2011
ER -