Towards resilient micro-architectures: Datapath reliability enhancement using STT-MRAM

Karthik Swaminathan, Ravindhiran Mukundrajan, Niranjan Soundararajan, Vijaykrishnan Narayanan

Research output: Chapter in Book/Report/Conference proceedingConference contribution

7 Scopus citations

Abstract

Transistor scaling and reduction in operating voltages have resulted in cosmic-ray induced soft errors becoming a major threat for reliable processor operation. With the raw device soft error rate expected to remain constant in future generations, the explosion in on-chip transistor count is expected to have a corresponding impact on overall error rate. Consequently it becomes necessary to incorporate resiliency into the pipeline datapath. However, existing methods like redundant execution or error correction used in memory are non-ideal for the pipeline due to their impact on overall performance. In this paper, we propose a novel technique that exploits the characteristics of Spin Transfer Torque - Magnetic Random Access Memory (STT-MRAM) for providing protection of all storage structures in the pipeline, namely the Reorder Buffer, Issue Queue and Load-Store Queue. We identify specific periods during an application's runtime when the MRAM can capture a "snapshot" of the invariant micro-architectural state and restore it later thereby reducing soft error vulnerability. We quantify the reduction in Architectural Vulnerability Factor (AVF) to be 32% on average, with a performance overhead of less than 6%. Further, we present a case where the proposed technique can be combined with existing soft error protection techniques like Instruction Duplication to further enhance system reliability.

Original languageEnglish (US)
Title of host publicationProceedings - 2011 IEEE Computer Society Annual Symposium on VLSI, ISVLSI 2011
Pages236-241
Number of pages6
DOIs
StatePublished - 2011
Event2011 IEEE Computer Society Annual Symposium on VLSI, ISVLSI 2011 - Chennai, India
Duration: Jul 4 2011Jul 6 2011

Publication series

NameProceedings - 2011 IEEE Computer Society Annual Symposium on VLSI, ISVLSI 2011

Other

Other2011 IEEE Computer Society Annual Symposium on VLSI, ISVLSI 2011
Country/TerritoryIndia
CityChennai
Period7/4/117/6/11

All Science Journal Classification (ASJC) codes

  • Hardware and Architecture
  • Electrical and Electronic Engineering

Fingerprint

Dive into the research topics of 'Towards resilient micro-architectures: Datapath reliability enhancement using STT-MRAM'. Together they form a unique fingerprint.

Cite this