I-LSH: I/O efficient c-approximate nearest neighbor search in high-dimensional space

Wanqi Liu, Hanchen Wang, Ying Zhang, Wei Wang, Lu Qin

Research output: Chapter in Book/Conference Proceeding/ReportConference Paper published in a bookpeer-review

42 Citations (Scopus)

Abstract

Nearest Neighbor search has been well solved in low-dimensional space, but is challenging in high-dimensional space due to the curse of dimensionality. As a trade-off between efficiency and result accuracy, a variety of c-approximate nearest neighbor (c-ANN) algorithms have been proposed to return a c-approximate NN with confident at least δ. We observe that existing c-ANN search algorithms have some limitations on I/O efficiency when their indexes are resided on the external memory, which is critical for handling large scale high-dimensional data. In this paper, we introduce an incremental search based c-ANN search algorithm, named I-LSH. Unlike the previous LSH methods, which expand the bucket width in an exponential way, I-LSH adopts a more natural search strategy to incrementally access the hash values of the objects. We provide rigorous theoretical analysis to underpin our incremental search strategy. Our comprehensive experiment results show that, compared with state-of-the-art I/O efficient c-ANN techniques, our algorithm can achieve much better I/O efficiency under the same theoretical guarantee.

Original languageEnglish
Title of host publicationProceedings - 2019 IEEE 35th International Conference on Data Engineering, ICDE 2019
PublisherIEEE Computer Society
Pages1670-1673
Number of pages4
ISBN (Electronic)9781538674741
DOIs
Publication statusPublished - Apr 2019
Externally publishedYes
Event35th IEEE International Conference on Data Engineering, ICDE 2019 - Macau, China
Duration: 8 Apr 201911 Apr 2019

Publication series

NameProceedings - International Conference on Data Engineering
Volume2019-April
ISSN (Print)1084-4627

Conference

Conference35th IEEE International Conference on Data Engineering, ICDE 2019
Country/TerritoryChina
CityMacau
Period8/04/1911/04/19

Bibliographical note

Publisher Copyright:
© 2019 IEEE.

Keywords

  • Cnn
  • Lsh
  • Nns

Fingerprint

Dive into the research topics of 'I-LSH: I/O efficient c-approximate nearest neighbor search in high-dimensional space'. Together they form a unique fingerprint.

Cite this