TY - GEN
T1 - Exploring performance-power tradeoffs in providing reliability for NoC-based MPSoCs
AU - Zhao, Hui
AU - Kandemir, Mahmut
AU - Irwin, Mary Jane
PY - 2011
Y1 - 2011
N2 - Performance and power consumption are important challenges faced by Network-on-Chip (NoC) designers. The situation is exacerbated when error control techniques are employed to provide reliability, since such techniques can lead to extra power consumption and execution cycles. In many systems today, ECC codes are used for error detection. Once an error is detected, recovery schemes are invoked to correct it. In this paper, we focus on tuning error recovery schemes to explore performance, power and reliability tradeoffs. Previous reliability work targeting NoCs proposed two retransmission techniques to recover from errors: End-to-End retransmission and Hop-by-Hop retransmission. End-to-End retransmission can save power but can also incur longer delays for recovery by checking errors only at the destination. In comparison, Hop-by-Hop retransmission checks for errors at every router and has better performance at the expense of increased power overhead. We propose a novel retransmission scheme that employs feedback control theory to dynamically choose the time for error checking based on the performance requirements of the applications. Our scheme ensures that applications meet performance QoS and save power at the same time. Our experimental evaluation shows that, if a 10% slack in delay is allowed, our scheme can save as much as 80% of the power consumed by the underlying error control scheme.
AB - Performance and power consumption are important challenges faced by Network-on-Chip (NoC) designers. The situation is exacerbated when error control techniques are employed to provide reliability, since such techniques can lead to extra power consumption and execution cycles. In many systems today, ECC codes are used for error detection. Once an error is detected, recovery schemes are invoked to correct it. In this paper, we focus on tuning error recovery schemes to explore performance, power and reliability tradeoffs. Previous reliability work targeting NoCs proposed two retransmission techniques to recover from errors: End-to-End retransmission and Hop-by-Hop retransmission. End-to-End retransmission can save power but can also incur longer delays for recovery by checking errors only at the destination. In comparison, Hop-by-Hop retransmission checks for errors at every router and has better performance at the expense of increased power overhead. We propose a novel retransmission scheme that employs feedback control theory to dynamically choose the time for error checking based on the performance requirements of the applications. Our scheme ensures that applications meet performance QoS and save power at the same time. Our experimental evaluation shows that, if a 10% slack in delay is allowed, our scheme can save as much as 80% of the power consumed by the underlying error control scheme.
UR - http://www.scopus.com/inward/record.url?scp=79959284687&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=79959284687&partnerID=8YFLogxK
U2 - 10.1109/ISQED.2011.5770773
DO - 10.1109/ISQED.2011.5770773
M3 - Conference contribution
AN - SCOPUS:79959284687
SN - 9781612849140
T3 - Proceedings of the 12th International Symposium on Quality Electronic Design, ISQED 2011
SP - 495
EP - 501
BT - Proceedings of the 12th International Symposium on Quality Electronic Design, ISQED 2011
T2 - 12th International Symposium on Quality Electronic Design, ISQED 2011
Y2 - 14 March 2011 through 16 March 2011
ER -