TY - GEN
T1 - Performance Evaluation of Video Analytics Workloads on Emerging Processing-In-Memory Architectures
AU - Challapalle, Nagadastagiri
AU - Narayanan, Vijaykrishnan
N1 - Publisher Copyright:
© 2022 IEEE.
PY - 2022
Y1 - 2022
N2 - Deep learning based artificial intelligence algorithms are widely deployed in the video analytics pipelines as they drastically reduce the need for manual analysis and achieve human-like accuracy. However, they have high compute memory/storage requirements due to ever increasing model architecture size and large volumes of data. Processing-in-memory architectures are gaining prominence for efficient execution of deep learning workloads as they reduce the data movement bottlenecks by moving compute closer to the data. In this work, we present the system level analysis of processing-in-memory architectures across the memory hierarchy for the execution of deep learning algorithms in the video analytics workloads using the proposed SysPIM methodology. We compare processing-in-memory architectures at cache memory, main memory, and non-volatile memory in terms of their execution latency, energy consumption, and overall data movement for representative video analytics workloads.
AB - Deep learning based artificial intelligence algorithms are widely deployed in the video analytics pipelines as they drastically reduce the need for manual analysis and achieve human-like accuracy. However, they have high compute memory/storage requirements due to ever increasing model architecture size and large volumes of data. Processing-in-memory architectures are gaining prominence for efficient execution of deep learning workloads as they reduce the data movement bottlenecks by moving compute closer to the data. In this work, we present the system level analysis of processing-in-memory architectures across the memory hierarchy for the execution of deep learning algorithms in the video analytics workloads using the proposed SysPIM methodology. We compare processing-in-memory architectures at cache memory, main memory, and non-volatile memory in terms of their execution latency, energy consumption, and overall data movement for representative video analytics workloads.
UR - http://www.scopus.com/inward/record.url?scp=85140880366&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85140880366&partnerID=8YFLogxK
U2 - 10.1109/ISVLSI54635.2022.00040
DO - 10.1109/ISVLSI54635.2022.00040
M3 - Conference contribution
AN - SCOPUS:85140880366
T3 - Proceedings of IEEE Computer Society Annual Symposium on VLSI, ISVLSI
SP - 158
EP - 163
BT - Proceedings - 2022 IEEE Computer Society Annual Symposium on VLSI, ISVLSI 2022
PB - IEEE Computer Society
T2 - 2022 IEEE Computer Society Annual Symposium on VLSI, ISVLSI 2022
Y2 - 4 July 2022 through 6 July 2022
ER -