Shortlist Selection With Residual-Aware Distance Estimator For K-Nearest Neighbor Search
2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)(2016)
摘要
In this paper, we introduce a novel shortlist computation algorithm for approximate, high-dimensional nearest neighbor search. Our method relies on a novel distance estimator: the residual-aware distance estimator, that accounts for the residual distances of data points to their respective quantized centroids, and uses it for accurate shortlist computation. Furthermore, we perform the residual-aware distance estimation with little additional memory and computational cost through simple pre-computation methods for inverted index and multi-index schemes. Because it modifies the initial shortlist collection phase, our new algorithm is applicable to most inverted indexing methods that use vector quantization. We have tested the proposed method with the inverted index and multi-index on a diverse set of benchmarks including up to one billion data points with varying dimensions, and found that our method robustly improves the accuracy of shortlists (up to 127% relatively higher) over the state-of-the-art techniques with a comparable or even faster computational cost.
更多查看译文
关键词
shortlist selection,residual-aware distance estimator,k-nearest neighbor search,high-dimensional nearest neighbor search,quantized centroids,inverted index scheme,multiindex scheme
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络