TY - GEN
T1 - Heuristic Approximation of Early-Stage CNN Data Representation for Vision Intelligence Systems
AU - Choi, Jinhang
AU - Sampson, Jack
AU - Narayanan, Vijaykrishnan
N1 - Funding Information:
This work was supported in part by NSF Expeditions in Computing Program: Visual Cortex on Silicon CCF 1317560, and Semiconductor Research Corporation (SRC) Center for Brain-inspired Computing Enabling Autonomous Intelligence (C-BRIC).
Publisher Copyright:
© 2018 IEEE.
PY - 2019/1/16
Y1 - 2019/1/16
N2 - Massive memory requirements and disruptive I/O traffic in Convolutional Neural Networks (CNNs) restrict hardware/software optimization for training and inference tasks using larger networks despite their success in improving the intelligence of systems into which they are integrated. Interestingly, it has been observed that the early CNN layers in vision tasks consistently produce edge-like feature representations that can be mimicked by traditional vision algorithms. In this work, we demonstrate how to exploit this to create common edge-like features for sharing among multiple CNNs, and how to heuristically approximate them in a reduced dimensionality. In our proposed approximation, the feature space of three representative CNNs decreases by 1.6×-5.1×, and the size of training dataset is halved. As a result, we enhance both inference throughput and training speed by 2×, while providing accuracies that are still close to the original versions. We anticipate that this approach will lead to new opportunities in distributed intelligence systems, and the technique for redesigning CNN models based on dimensional reduction of feature space is orthogonal to and compatible with many other existing hardware/software optimizations.
AB - Massive memory requirements and disruptive I/O traffic in Convolutional Neural Networks (CNNs) restrict hardware/software optimization for training and inference tasks using larger networks despite their success in improving the intelligence of systems into which they are integrated. Interestingly, it has been observed that the early CNN layers in vision tasks consistently produce edge-like feature representations that can be mimicked by traditional vision algorithms. In this work, we demonstrate how to exploit this to create common edge-like features for sharing among multiple CNNs, and how to heuristically approximate them in a reduced dimensionality. In our proposed approximation, the feature space of three representative CNNs decreases by 1.6×-5.1×, and the size of training dataset is halved. As a result, we enhance both inference throughput and training speed by 2×, while providing accuracies that are still close to the original versions. We anticipate that this approach will lead to new opportunities in distributed intelligence systems, and the technique for redesigning CNN models based on dimensional reduction of feature space is orthogonal to and compatible with many other existing hardware/software optimizations.
UR - http://www.scopus.com/inward/record.url?scp=85062208772&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85062208772&partnerID=8YFLogxK
U2 - 10.1109/ICCD.2018.00041
DO - 10.1109/ICCD.2018.00041
M3 - Conference contribution
AN - SCOPUS:85062208772
T3 - Proceedings - 2018 IEEE 36th International Conference on Computer Design, ICCD 2018
SP - 218
EP - 225
BT - Proceedings - 2018 IEEE 36th International Conference on Computer Design, ICCD 2018
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 36th International Conference on Computer Design, ICCD 2018
Y2 - 7 October 2018 through 10 October 2018
ER -