Journal of Multimedia, Vol 4, No 5 (2009), 284-297, Oct 2009
doi:10.4304/jmm.4.5.284-297

Semantic Restructuring of Natural Language Image Captions to Enhance Image Retrieval

Kraisak Kesorn, Stefan Poslad

Abstract


The rapid growth in the volume of visual information can make the task of finding and accessing visual information of interest, overwhelming for users. Semantic analysis of image captions can be used in conjunction with image retrieval systems (IMR) to retrieve selected images more precisely. To do this, we first exploit a Natural Language Processing (NLP) framework in order to extract concepts from image captions. Next, an ontology-based framework is deployed in order to resolve natural language ambiguities. The novelty of the proposed framework is that the combination of LSI with the Ontology framework enables the combined framework to tolerate ambiguities and variations in the Ontology. A key feature is that the system can find indirectly relevant concepts in image captions and thus leverage these to represent the semantics of images at a higher level. Experimental results show that the use of LSI based NLP combined with an ontological framework significantly enhances image retrieval.



Keywords


image retrieval, latent semantic indexing, natural language processing, knowledge base, semantic model, Ontology

References



Full Text: PDF


Journal of Multimedia (JMM, ISSN 1796-2048)

Copyright @ 2006-2011 by ACADEMY PUBLISHER – All rights reserved.