Multi-Modal Image Retrieval for Complex Queries Using Small Codes


Siddiquie, B., White, B., Sharma, A., & Davis, L. S. (2014, 1-4 April). Multi-modal image retrieval for complex queries using small codes. Paper presented at the International Conference on Multimedia Retrieval (ICMR’14), Glasgow, United Kingdom.


We propose a unified framework for image retrieval capable of handling complex and descriptive queries of multiple modalities in a scalable manner. A novel aspect of our approach is that it supports query specification in terms of objects, attributes and spatial relationships, thereby allowing for substantially more complex and descriptive queries. We allow these complex queries to be specified in three different modalities – images, sketches and structured textual descriptions. Furthermore, we propose a unique multi-modal hashing algorithm capable of mapping queries of different modalities to the same binary representation, enabling efficient and scalable image retrieval based on multi-modal queries. Extensive experimental evaluation shows that our approach outperforms the state-of-the-art image retrieval and hashing techniques on the MSRC and SUN09 datasets by about 100%, while the performance on a dataset of 1M images, from Flickr, demonstrates its scalability.

Read more from SRI