TY - GEN
T1 - Enhancing Inference Time and Memory Utilization for Machine Learning in Resource Constrained Internet of Battlefield Things
AU - Ghimire, Bimal
AU - Rawat, Danda B.
N1 - Funding Information:
This research was funded by the DoD Center of Excellence in AI and Machine Learning (CoE-AIML) at Howard University under Contract Number W911NF-20-2-0277 with the U.S. Army Research Laboratory and DoE/NNSA MSIPP grant. However, any opinion, finding, and conclusions or recommendations expressed in this document are those of the authors and should not be interpreted as necessarily representing the official policies, either expressed or implied, of the funding agencies.
Publisher Copyright:
© 2022 SPIE.
PY - 2022
Y1 - 2022
N2 - Convolution Neural Network (CNN) models have demonstrated remarkable success for many applications including computer vision in recent years. However, the increase in accuracy achieved so far is at the cost of memory and complex computations. The CNN models are growing deeper and wider making it difficult to be fit in a single device that has limited resources. Moreover, the inference time of such models is large enough to be unfit to apply for the real-time mission critical applications like the Internet of Battlefield Things (IoBT). In IoBT, where the unmanned aerial vehicles (UAVs) flying in the battlefield zone and capturing images, require accurate learning and immediate inference. It becomes problematic if the learning model does not fit in a single UAV when it has limited resources. Considering the aforementioned issues, in this paper, we study a formal approach to improve the inference time and memory utilization in resource constrained IoBT. We consider that multiple UAVs are involved in the inference process in which we apply spatially parallel convolution and pooling operations for all the convolution layers and pooling layers as well as model parallelism for fully connected (FC) layers. Finally, we present the numerical results for varying number of participating UAVs, input data/image sizes, and communication speeds.
AB - Convolution Neural Network (CNN) models have demonstrated remarkable success for many applications including computer vision in recent years. However, the increase in accuracy achieved so far is at the cost of memory and complex computations. The CNN models are growing deeper and wider making it difficult to be fit in a single device that has limited resources. Moreover, the inference time of such models is large enough to be unfit to apply for the real-time mission critical applications like the Internet of Battlefield Things (IoBT). In IoBT, where the unmanned aerial vehicles (UAVs) flying in the battlefield zone and capturing images, require accurate learning and immediate inference. It becomes problematic if the learning model does not fit in a single UAV when it has limited resources. Considering the aforementioned issues, in this paper, we study a formal approach to improve the inference time and memory utilization in resource constrained IoBT. We consider that multiple UAVs are involved in the inference process in which we apply spatially parallel convolution and pooling operations for all the convolution layers and pooling layers as well as model parallelism for fully connected (FC) layers. Finally, we present the numerical results for varying number of participating UAVs, input data/image sizes, and communication speeds.
UR - http://www.scopus.com/inward/record.url?scp=85146555032&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85146555032&partnerID=8YFLogxK
U2 - 10.1117/12.2621287
DO - 10.1117/12.2621287
M3 - Conference contribution
AN - SCOPUS:85146555032
T3 - Proceedings of SPIE - The International Society for Optical Engineering
BT - Artificial Intelligence and Machine Learning for Multi-Domain Operations Applications IV
A2 - Pham, Tien
A2 - Solomon, Latasha
PB - SPIE
T2 - Artificial Intelligence and Machine Learning for Multi-Domain Operations Applications IV 2022
Y2 - 6 June 2022 through 12 June 2022
ER -