TY - JOUR
T1 - Accelerating multiresolution gabor feature extraction for real time vision applications
AU - Cho, Yong Cheol Peter
AU - Chandramoorthy, Nandhini
AU - Irick, Kevin M.
AU - Narayanan, Vijaykrishnan
N1 - Funding Information:
Acknowledgments This work was supported in part by NSF grants 0916887, 1147388, 1213052 and 0903432. Opinions expressed in this paper are solely of the authors.
Funding Information:
This work was supported in part by the IT R&D program of MKE/ KEIT [10035152, Energy Scalable Vector Processor - Primary Technology]
PY - 2014/8
Y1 - 2014/8
N2 - Multiresolution Gabor filter banks are used for feature extraction in a variety of applications as Gabor filters have shown to be exceptional feature extractors with a close correspondence to the simple cells in the primary visual cortex (V1) of the brain. Yet applying the Gabor filter is a computationally intensive task. Most applications that utilize the Gabor feature space require real time results; however, the large quantity of computations involved has hindered systems from achieving real time performance. The natural solution for such compute intensive tasks is parallelization. FPGAs have emerged as attractive platforms for compute intensive signal processing applications due to their massively parallel computation resources as well as low power consumption and affordability. We present a configurable architecture for Gabor feature extraction on FPGA that enhances the resource utilization of the FPGA hardware fabric while maintaining a streaming data flow to yield exceptional performance. The increased resource utilization resulting from configurability, optimizations, and resource sharing allows for higher levels of parallelism to achieve real time feature extraction of high resolution images. Two architectures are introduced. The first is an architecture for multiresolution feature extraction with extensive resource sharing for enhanced resource utilization. The second is an architecture for many-orientation applications using a coarse to fine grain method to enhance resource utilization by reducing the number of filters applied at different orientations. Our results show that our multiresolution implementation achieves real-time performance on 2048×1526 images and exhibits 6X speed up over a GPU implementation while exhibiting energy efficiency with 0.4fps/W compared to the GPU that achieves 0.036fps/W.[1] The implementation for many-orientation applications using the coarse to fine grain method exhibits resource saving of at most 2√O for O number of orientations and higher, compared to a fully parallel architecture and 25× speedup compared to a GPU implementation for 16 orientations.
AB - Multiresolution Gabor filter banks are used for feature extraction in a variety of applications as Gabor filters have shown to be exceptional feature extractors with a close correspondence to the simple cells in the primary visual cortex (V1) of the brain. Yet applying the Gabor filter is a computationally intensive task. Most applications that utilize the Gabor feature space require real time results; however, the large quantity of computations involved has hindered systems from achieving real time performance. The natural solution for such compute intensive tasks is parallelization. FPGAs have emerged as attractive platforms for compute intensive signal processing applications due to their massively parallel computation resources as well as low power consumption and affordability. We present a configurable architecture for Gabor feature extraction on FPGA that enhances the resource utilization of the FPGA hardware fabric while maintaining a streaming data flow to yield exceptional performance. The increased resource utilization resulting from configurability, optimizations, and resource sharing allows for higher levels of parallelism to achieve real time feature extraction of high resolution images. Two architectures are introduced. The first is an architecture for multiresolution feature extraction with extensive resource sharing for enhanced resource utilization. The second is an architecture for many-orientation applications using a coarse to fine grain method to enhance resource utilization by reducing the number of filters applied at different orientations. Our results show that our multiresolution implementation achieves real-time performance on 2048×1526 images and exhibits 6X speed up over a GPU implementation while exhibiting energy efficiency with 0.4fps/W compared to the GPU that achieves 0.036fps/W.[1] The implementation for many-orientation applications using the coarse to fine grain method exhibits resource saving of at most 2√O for O number of orientations and higher, compared to a fully parallel architecture and 25× speedup compared to a GPU implementation for 16 orientations.
UR - http://www.scopus.com/inward/record.url?scp=84907599600&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84907599600&partnerID=8YFLogxK
U2 - 10.1007/s11265-014-0873-4
DO - 10.1007/s11265-014-0873-4
M3 - Article
AN - SCOPUS:84907599600
SN - 1939-8018
VL - 76
SP - 149
EP - 168
JO - Journal of Signal Processing Systems
JF - Journal of Signal Processing Systems
IS - 2
ER -