Supervised Robust Discrete Multimodal Hashing for Cross-Media Retrieval

Submitted by grigby1 on Tue, 05/16/2017 - 11:30am

Title	Supervised Robust Discrete Multimodal Hashing for Cross-Media Retrieval
Publication Type	Conference Paper
Year of Publication	2016
Authors	Yan, Ting-Kun, Xu, Xin-Shun, Guo, Shanqing, Huang, Zi, Wang, Xiao-Lin
Conference Name	Proceedings of the 25th ACM International on Conference on Information and Knowledge Management
Publisher	ACM
Conference Location	New York, NY, USA
ISBN Number	978-1-4503-4073-1
Keywords	Algorithm, approximate nearest neighbor search, composability, cross-media retrieval, discrete hashing, hash algorithms, learning to hash, Metrics, multimodal hashing, nearest neighbor search, pubcrawl, Resiliency, Scalability
Abstract	Recently, multimodal hashing techniques have received considerable attention due to their low storage cost and fast query speed for multimodal data retrieval. Many methods have been proposed; however, there are still some problems that need to be further considered. For example, some of these methods just use a similarity matrix for learning hash functions which will discard some useful information contained in original data; some of them relax binary constraints or separate the process of learning hash functions and binary codes into two independent stages to bypass the obstacle of handling the discrete constraints on binary codes for optimization, which may generate large quantization error; some of them are not robust to noise. All these problems may degrade the performance of a model. To consider these problems, in this paper, we propose a novel supervised hashing framework for cross-modal retrieval, i.e., Supervised Robust Discrete Multimodal Hashing (SRDMH). Specifically, SRDMH tries to make final binary codes preserve label information as same as that in original data so that it can leverage more label information to supervise the binary codes learning. In addition, it learns hashing functions and binary codes directly instead of relaxing the binary constraints so as to avoid large quantization error problem. Moreover, to make it robust and easy to solve, we further integrate a flexible l2,p loss with nonlinear kernel embedding and an intermediate presentation of each instance. Finally, an alternating algorithm is proposed to solve the optimization problem in SRDMH. Extensive experiments are conducted on three benchmark data sets. The results demonstrate that the proposed method (SRDMH) outperforms or is comparable to several state-of-the-art methods for cross-modal retrieval task.
URL	http://doi.acm.org/10.1145/2983323.2983743
DOI	10.1145/2983323.2983743
Citation Key	yan_supervised_2016

Groups:

Science of Security VO