Learning Binary Code Representations for Effective and Efficient Image Retrieval

Ozdemir, Bahadir

Learning Binary Code Representations for Effective and Efficient Image Retrieval

dc.contributor.advisor	Davis, Larry S	en_US
dc.contributor.author	Ozdemir, Bahadir	en_US
dc.contributor.department	Computer Science	en_US
dc.contributor.publisher	Digital Repository at the University of Maryland	en_US
dc.contributor.publisher	University of Maryland (College Park, Md.)	en_US
dc.date.accessioned	2016-09-03T05:31:50Z
dc.date.available	2016-09-03T05:31:50Z
dc.date.issued	2016	en_US
dc.description.abstract	The size of online image datasets is constantly increasing. Considering an image dataset with millions of images, image retrieval becomes a seemingly intractable problem for exhaustive similarity search algorithms. Hashing methods, which encodes high-dimensional descriptors into compact binary strings, have become very popular because of their high efficiency in search and storage capacity. In the first part, we propose a multimodal retrieval method based on latent feature models. The procedure consists of a nonparametric Bayesian framework for learning underlying semantically meaningful abstract features in a multimodal dataset, a probabilistic retrieval model that allows cross-modal queries and an extension model for relevance feedback. In the second part, we focus on supervised hashing with kernels. We describe a flexible hashing procedure that treats binary codes and pairwise semantic similarity as latent and observed variables, respectively, in a probabilistic model based on Gaussian processes for binary classification. We present a scalable inference algorithm with the sparse pseudo-input Gaussian process (SPGP) model and distributed computing. In the last part, we define an incremental hashing strategy for dynamic databases where new images are added to the databases frequently. The method is based on a two-stage classification framework using binary and multi-class SVMs. The proposed method also enforces balance in binary codes by an imbalance penalty to obtain higher quality binary codes. We learn hash functions by an efficient algorithm where the NP-hard problem of finding optimal binary codes is solved via cyclic coordinate descent and SVMs are trained in a parallelized incremental manner. For modifications like adding images from an unseen class, we propose an incremental procedure for effective and efficient updates to the previous hash functions. Experiments on three large-scale image datasets demonstrate that the incremental strategy is capable of efficiently updating hash functions to the same retrieval performance as hashing from scratch.	en_US
dc.identifier	https://doi.org/10.13016/M2C214
dc.identifier.uri	http://hdl.handle.net/1903/18533
dc.language.iso	en	en_US
dc.subject.pqcontrolled	Computer science	en_US
dc.subject.pquncontrolled	Binary Codes	en_US
dc.subject.pquncontrolled	Gaussian Process	en_US
dc.subject.pquncontrolled	Hashing	en_US
dc.subject.pquncontrolled	Image Retrieval	en_US
dc.subject.pquncontrolled	Indian Buffet Process	en_US
dc.subject.pquncontrolled	Online Learning	en_US
dc.title	Learning Binary Code Representations for Effective and Efficient Image Retrieval	en_US
dc.type	Dissertation	en_US

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Ozdemir_umd_0117E_17230.pdf
Size:: 19.27 MB
Format:: Adobe Portable Document Format

Download

Collections

UMD Theses and Dissertations
Computer Science Theses and Dissertations