TY - JOUR
T1 - Automatic linguistic indexing of pictures by a statistical modeling approach
AU - Li, Jia
AU - Wang, James Z.
N1 - Funding Information:
The material about the SIMPLIcity system was based upon work supported by the US National Science Foundation under grant no. IIS-9817511 and, in part, by Stanford University. Work on the ALIP system is supported by the US National Science Foundation under grant no. IIS-0219272, the Pennsylvania State University, the PNC Foundation, and Sun Microsystems under grant EDUD-7824-010456-US.
PY - 2003/9
Y1 - 2003/9
N2 - Automatic linguistic indexing of pictures is an important but highly challenging problem for researchers in computer vision and content-based image retrieval. In this paper, we introduce a statistical modeling approach to this problem. Categorized images are used to train a dictionary of hundreds of statistical models each representing a concept. Images of any given concept are regarded as instances of a stochastic process that characterizes the concept. To measure the extent of association between an image and the textual description of a concept, the likelihood of the occurrence of the image based on the characterizing stochastic process is computed. A high likelihood indicates a strong association. In our experimental implementation, we focus on a particular group of stochastic processes, that is, the two-dimensional multiresolution hidden Markov models (2D MHMMs). We implemented and tested our ALIP (Automatic Linguistic Indexing of Pictures) system on a photographic image database of 600 different concepts, each with about 40 training images. The system is evaluated quantitatively using more than 4,600 images outside the training database and compared with a random annotation scheme. Experiments have demonstrated the good accuracy of the system and its high potential in linguistic indexing of photographic images.
AB - Automatic linguistic indexing of pictures is an important but highly challenging problem for researchers in computer vision and content-based image retrieval. In this paper, we introduce a statistical modeling approach to this problem. Categorized images are used to train a dictionary of hundreds of statistical models each representing a concept. Images of any given concept are regarded as instances of a stochastic process that characterizes the concept. To measure the extent of association between an image and the textual description of a concept, the likelihood of the occurrence of the image based on the characterizing stochastic process is computed. A high likelihood indicates a strong association. In our experimental implementation, we focus on a particular group of stochastic processes, that is, the two-dimensional multiresolution hidden Markov models (2D MHMMs). We implemented and tested our ALIP (Automatic Linguistic Indexing of Pictures) system on a photographic image database of 600 different concepts, each with about 40 training images. The system is evaluated quantitatively using more than 4,600 images outside the training database and compared with a random annotation scheme. Experiments have demonstrated the good accuracy of the system and its high potential in linguistic indexing of photographic images.
UR - https://www.scopus.com/pages/publications/0141836773
UR - https://www.scopus.com/inward/citedby.url?scp=0141836773&partnerID=8YFLogxK
U2 - 10.1109/TPAMI.2003.1227984
DO - 10.1109/TPAMI.2003.1227984
M3 - Article
AN - SCOPUS:0141836773
SN - 0162-8828
VL - 25
SP - 1075
EP - 1088
JO - IEEE Transactions on Pattern Analysis and Machine Intelligence
JF - IEEE Transactions on Pattern Analysis and Machine Intelligence
IS - 9
ER -