TY - JOUR
T1 - Face recognition
T2 - A convolutional neural-network approach
AU - Lawrence, Steve
AU - Giles, C. Lee
AU - Tsoi, Ah Chung
AU - Back, Andrew D.
N1 - Funding Information:
Manuscript received January 1, 1996; revised June 13, 1996. This work was supported in part by the Australian Research Council (ACT) and the Australian Telecommunications and Electronics Research Board (SL). S. Lawrence is with the NEC Research Institute, Princeton, NJ 08540 USA. He is also with the Department of Electrical and Computer Engineering, University of Queensland, St. Lucia, Australia. C. L. Giles is with the NEC Research Institute, Princeton, NJ 08540 USA. He is also with the Institute for Advanced Computer Studies, University of Maryland, College Park, MD 20742 USA. A. C. Tsoi and A. D. Back are with the Department of Electrical and Computer Engineering, University of Queensland, St. Lucia, Australia. Publisher Item Identifier S 1045-9227(97)00234-8. 1Physiological or behavioral characteristics which uniquely identify us.
PY - 1997
Y1 - 1997
N2 - Faces represent complex multidimensional meaningful visual stimuli and developing a computational model for face recognition is difficult. We present a hybrid neural-network solution which compares favorably with other methods. The system combines local image sampling, a self-organizing map (SOM) neural network, and a convolutional neural network. The SOM provides a quantization of the image samples into a topological space where inputs that are nearby in the original space are also nearby in the output space, thereby providing dimensionality reduction and invariance to minor changes in the image sample, and the convolutional neural network provides for partial invariance to translation, rotation, scale, and deformation. The convolutional network extracts successively larger features in a hierarchical set of layers. We present results using the Karhunen-Loève (KL) transform in place of the SOM, and a multilayer perceptron (MLP) in place of the convolutional network. The KL transform performs almost as well (5.3% error versus 3.8%). The MLP performs very poorly (40% error versus 3.8%). The method is capable of rapid classification, requires only fast approximate normalization and preprocessing, and consistently exhibits better classification performance than the eigenfaces approach on the database considered as the number of images per person in the training database is varied from one to five. With five images per person the proposed method and eigenfaces result in 3.8% and 10.5% error, respectively. The recognizer provides a measure of confidence in its output and classification error approaches zero when rejecting as few as 10% of the examples. We use a database of 400 images of 40 individuals which contains quite a high degree of variability in expression, pose, and facial details. We analyze computational complexity and discuss how new classes could be added to the trained recognizer.
AB - Faces represent complex multidimensional meaningful visual stimuli and developing a computational model for face recognition is difficult. We present a hybrid neural-network solution which compares favorably with other methods. The system combines local image sampling, a self-organizing map (SOM) neural network, and a convolutional neural network. The SOM provides a quantization of the image samples into a topological space where inputs that are nearby in the original space are also nearby in the output space, thereby providing dimensionality reduction and invariance to minor changes in the image sample, and the convolutional neural network provides for partial invariance to translation, rotation, scale, and deformation. The convolutional network extracts successively larger features in a hierarchical set of layers. We present results using the Karhunen-Loève (KL) transform in place of the SOM, and a multilayer perceptron (MLP) in place of the convolutional network. The KL transform performs almost as well (5.3% error versus 3.8%). The MLP performs very poorly (40% error versus 3.8%). The method is capable of rapid classification, requires only fast approximate normalization and preprocessing, and consistently exhibits better classification performance than the eigenfaces approach on the database considered as the number of images per person in the training database is varied from one to five. With five images per person the proposed method and eigenfaces result in 3.8% and 10.5% error, respectively. The recognizer provides a measure of confidence in its output and classification error approaches zero when rejecting as few as 10% of the examples. We use a database of 400 images of 40 individuals which contains quite a high degree of variability in expression, pose, and facial details. We analyze computational complexity and discuss how new classes could be added to the trained recognizer.
UR - http://www.scopus.com/inward/record.url?scp=0030737097&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=0030737097&partnerID=8YFLogxK
U2 - 10.1109/72.554195
DO - 10.1109/72.554195
M3 - Article
C2 - 18255614
AN - SCOPUS:0030737097
SN - 1045-9227
VL - 8
SP - 98
EP - 113
JO - IEEE Transactions on Neural Networks
JF - IEEE Transactions on Neural Networks
IS - 1
ER -