An FPGA implementation of information theoretic visual-saliency system and its optimization

Sungmin Bae, Yong Cheol Peter Cho, Sungho Park, Kevin M. Irick, Yongseok Jin, Vijaykrishnan Narayanan

Research output: Chapter in Book/Report/Conference proceedingConference contribution

26 Scopus citations

Abstract

Biological vision systems use saliency-based visual attention mechanisms to limit higher-level vision processing on the most visually-salient subsets of an input image. Among several computational models that capture the visual-saliency in biological system, an information theoretic AIM(Attention based on Information Maximization) algorithm has been demonstrated to predict human gaze patterns better than other existing models. We present an FPGA based implementation of this computationally intensive AIM algorithm to support embedded vision applications. Our implementation provides performance of processing about 4M pixels/sec for 25 basis functions with a convolution kernel size of 21 by 21 for each of the R, G, and B color-channels, when implemented on a Virtex-6 LX240T. We also provide an optimization aimed at controlling the trade-off between power consumption and latency, and performance comparisons with a GPU implementation.

Original languageEnglish (US)
Title of host publicationProceedings - IEEE International Symposium on Field-Programmable Custom Computing Machines, FCCM 2011
Pages41-48
Number of pages8
DOIs
StatePublished - 2011
Event19th IEEE International Symposium on Field-Programmable Custom Computing Machines, FCCM 2011 - Salt Lake City, UT, United States
Duration: May 1 2011May 3 2011

Publication series

NameProceedings - IEEE International Symposium on Field-Programmable Custom Computing Machines, FCCM 2011

Other

Other19th IEEE International Symposium on Field-Programmable Custom Computing Machines, FCCM 2011
Country/TerritoryUnited States
CitySalt Lake City, UT
Period5/1/115/3/11

All Science Journal Classification (ASJC) codes

  • Hardware and Architecture

Fingerprint

Dive into the research topics of 'An FPGA implementation of information theoretic visual-saliency system and its optimization'. Together they form a unique fingerprint.

Cite this