Data driven multi-index hashing
ICIP(2013)
摘要
Binary representation for large scale nearest neighbor search received more and more concern recently. Although binary codes can be directly used as indices of the hash tables, correlations between the bits may lead to non-uniform codes distribution and reduce the performance of the hash table. In this paper, we propose a data driven multi-index hashing method for exact nearest neighbor search in Hamming space. By exploring the statistics properties of the dataset, we can separate the correlated bits into different segments during the process of building multiple hash tables, and thus make binary codes distributed as uniformly as possible in each hash table. Experiments conducted on a huge amount of binary codes extracted from the UK Bench dataset show that our method can achieve significant acceleration in searching speed for large scale dataset.
更多查看译文
关键词
clustering algorithms,image representation,binary codes,data driven multiindex hashing,statistical analysis,statistics properties,hash tables,searching speed,indexing,search problems,nearest neighbor search,binary representation,uk bench dataset,large scale nearest neighbor search,nonuniform codes distribution
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络