TY - GEN
T1 - Exploiting processor workload heterogeneity for reducing energy consumption in chip multiprocessors
AU - Kadayif, I.
AU - Kandemir, M.
AU - Kolcu, I.
PY - 2004/7/12
Y1 - 2004/7/12
N2 - Advances in semiconductor technology are enabling designs with several hundred million transistors. Since building sophisticated single processor based systems is a complex process from design, verification, and software development perspectives, the use of chip multiprocessing is inevitable in future microprocessors. In fact, the abundance of explicit loop-level parallelism in many embedded applications helps us identify chip multiprocessing as one of the most promising directions in designing systems for embedded applications. Another architectural trend that we observe in embedded systems, namely, multi-voltage processors, is driven by the need of reducing energy consumption during program execution. Practical implementations such as Transmeta's Crusoe and Intel's XScale tune processor voltage/frequency depending on current execution load. Considering these two trends, chip multiprocessing and voltage/frequency scaling, this paper presents an optimization strategy for an architecture that makes use of both chip parallelism and voltage scaling. In our proposal, the compiler takes advantage of heterogeneity in parallel execution between the loads of different processors and assigns different voltages/frequencies to different processors if doing so reduces energy consumption without increasing overall execution cycles significantly. Our experiments with a set of applications show that this optimization can bring large energy benefits without much performance loss.
AB - Advances in semiconductor technology are enabling designs with several hundred million transistors. Since building sophisticated single processor based systems is a complex process from design, verification, and software development perspectives, the use of chip multiprocessing is inevitable in future microprocessors. In fact, the abundance of explicit loop-level parallelism in many embedded applications helps us identify chip multiprocessing as one of the most promising directions in designing systems for embedded applications. Another architectural trend that we observe in embedded systems, namely, multi-voltage processors, is driven by the need of reducing energy consumption during program execution. Practical implementations such as Transmeta's Crusoe and Intel's XScale tune processor voltage/frequency depending on current execution load. Considering these two trends, chip multiprocessing and voltage/frequency scaling, this paper presents an optimization strategy for an architecture that makes use of both chip parallelism and voltage scaling. In our proposal, the compiler takes advantage of heterogeneity in parallel execution between the loads of different processors and assigns different voltages/frequencies to different processors if doing so reduces energy consumption without increasing overall execution cycles significantly. Our experiments with a set of applications show that this optimization can bring large energy benefits without much performance loss.
UR - http://www.scopus.com/inward/record.url?scp=3042662150&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=3042662150&partnerID=8YFLogxK
U2 - 10.1109/DATE.2004.1269048
DO - 10.1109/DATE.2004.1269048
M3 - Conference contribution
AN - SCOPUS:3042662150
SN - 0769520855
SN - 9780769520858
T3 - Proceedings - Design, Automation and Test in Europe Conference and Exhibition
SP - 1158
EP - 1163
BT - Proceedings - Design, Automation and Test in Europe Conference and Exhibition, DATE 04
A2 - Gielen, G.
A2 - Figueras, J.
T2 - Proceedings - Design, Automation and Test in Europe Conference and Exhibition, DATE 04
Y2 - 16 February 2004 through 20 February 2004
ER -