SlicK: Slice-based locality exploitation for efficient redundant multithreading

Angshuman Parashar, Anand Sivasubramaniam, Sudhanva Gurumurthi

Research output: Chapter in Book/Report/Conference proceedingConference contribution

34 Scopus citations

Abstract

Transient faults are expected a be a major design consideration in future microprocessors. Recent proposals for transient fault detection in processor cores have revolved around the idea of redundant threading, which involves redundant execution of a program across multiple execution contexts. This paper presents a new approach to redundant threading by bringing together the concepts of slice-level execution and value and control-flow locality into a novel partial redundant threading mechanism called SlicK.The purpose of redundant execution is to check the integrity of the outputs propagating out of the core (typically through stores). SlicK implements redundancy at the granularity of backward-slices of these output instructions and exploits value and control-flow locality to avoid redundantly executing slices that lead to predictable outputs, thereby avoiding redundant execution of a significant fraction of instructions while maintaining extremely low vulnerabilities for critical processor structures.We propose the microarchitecture of a backward-slice extractor called SliceEM that is able to identify backward slices without interrupting the instruction flow, and show how this extractor and a set of predictors can be integrated into a redundant threading mechanism to form SlicK. Detailed simulations with SPEC CPU2000 benchmarks show that SlicK can provide around 10.2% performance improvement over a well known redundant threading mechanism, buying back over 50% of the loss suffered due to redundant execution. SlicK can keep the Architectural Vulnerability Factors of processor structures to typically 0%-2%. More importantly, SlicK's slice-based mechanisms provide future opportunities for exploring interesting points in the performance-reliability design space based on market segment needs.

Original languageEnglish (US)
Title of host publicationASPLOS XII
Subtitle of host publicationTwelfth International Conference on Architectural Support for Programming Languages and Operating Systems
Pages95-105
Number of pages11
DOIs
StatePublished - 2006

Publication series

NameInternational Conference on Architectural Support for Programming Languages and Operating Systems - ASPLOS

All Science Journal Classification (ASJC) codes

  • Software
  • Information Systems
  • Hardware and Architecture

Fingerprint

Dive into the research topics of 'SlicK: Slice-based locality exploitation for efficient redundant multithreading'. Together they form a unique fingerprint.

Cite this