TY - GEN
T1 - Usas
T2 - 30th IEEE International Symposium on High-Performance Computer Architecture, HPCA 2024
AU - Mishra, Cyan Subhra
AU - Sampson, Jack
AU - Kandemir, Mahmut Taylan
AU - Narayanan, Vijaykrishnan
AU - Das, Chita R.
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024
Y1 - 2024
N2 - Edge servers have recently become very popular for performing localized analytics, especially on video, as they reduce data traffic and protect privacy. However, due to their resource constraints, these servers often employ compressed models, which are typically prone to data drift. Consequently, for edge servers to provide cloud-comparable quality, they must also perform continuous learning to mitigate this drift. However, at expected deployment scales, performing continuous training on every edge server is not sustainable due to their aggregate power demands on grid supply and associated sustainability footprints. To address these challenges, we propose Us.as, an approach combining algorithmic adjustments, hardware-software co-design, and morphable acceleration hardware to enable the training of workloads on these edge servers to be powered by renewable, but intermittent, solar power that can sustainably scale alongside data sources. Our evaluation of Us.as on a real-world traffic dataset indicates that our continuous learning approach simultaneously improves both accuracy and efficiency: Us.as offers a 4.96% greater mean accuracy than prior approaches while our morphable accelerator that adapts to solar variance can save up to {234.95kWH, 2.63MWH}/year/edge-server compared to a {DNN accelerator, data center scale GPU}, respectively.
AB - Edge servers have recently become very popular for performing localized analytics, especially on video, as they reduce data traffic and protect privacy. However, due to their resource constraints, these servers often employ compressed models, which are typically prone to data drift. Consequently, for edge servers to provide cloud-comparable quality, they must also perform continuous learning to mitigate this drift. However, at expected deployment scales, performing continuous training on every edge server is not sustainable due to their aggregate power demands on grid supply and associated sustainability footprints. To address these challenges, we propose Us.as, an approach combining algorithmic adjustments, hardware-software co-design, and morphable acceleration hardware to enable the training of workloads on these edge servers to be powered by renewable, but intermittent, solar power that can sustainably scale alongside data sources. Our evaluation of Us.as on a real-world traffic dataset indicates that our continuous learning approach simultaneously improves both accuracy and efficiency: Us.as offers a 4.96% greater mean accuracy than prior approaches while our morphable accelerator that adapts to solar variance can save up to {234.95kWH, 2.63MWH}/year/edge-server compared to a {DNN accelerator, data center scale GPU}, respectively.
UR - http://www.scopus.com/inward/record.url?scp=85190311457&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85190311457&partnerID=8YFLogxK
U2 - 10.1109/HPCA57654.2024.00073
DO - 10.1109/HPCA57654.2024.00073
M3 - Conference contribution
AN - SCOPUS:85190311457
T3 - Proceedings - International Symposium on High-Performance Computer Architecture
SP - 891
EP - 907
BT - Proceedings - 2024 IEEE International Symposium on High-Performance Computer Architecture, HPCA 2024
PB - IEEE Computer Society
Y2 - 2 March 2024 through 6 March 2024
ER -