Abstract
Nearest Neighbor search has been well solved in low-dimensional space, but is challenging in high-dimensional space due to the curse of dimensionality. As a trade-off between efficiency and result accuracy, a variety of c-approximate nearest neighbor (c-ANN) algorithms have been proposed to return a c-approximate NN with confident at least δ. We observe that existing c-ANN search algorithms have some limitations on I/O efficiency when their indexes are resided on the external memory, which is critical for handling large scale high-dimensional data. In this paper, we introduce an incremental search based c-ANN search algorithm, named I-LSH. Unlike the previous LSH methods, which expand the bucket width in an exponential way, I-LSH adopts a more natural search strategy to incrementally access the hash values of the objects. We provide rigorous theoretical analysis to underpin our incremental search strategy. Our comprehensive experiment results show that, compared with state-of-the-art I/O efficient c-ANN techniques, our algorithm can achieve much better I/O efficiency under the same theoretical guarantee.
| Original language | English |
|---|---|
| Title of host publication | Proceedings - 2019 IEEE 35th International Conference on Data Engineering, ICDE 2019 |
| Publisher | IEEE Computer Society |
| Pages | 1670-1673 |
| Number of pages | 4 |
| ISBN (Electronic) | 9781538674741 |
| DOIs | |
| Publication status | Published - Apr 2019 |
| Externally published | Yes |
| Event | 35th IEEE International Conference on Data Engineering, ICDE 2019 - Macau, China Duration: 8 Apr 2019 → 11 Apr 2019 |
Publication series
| Name | Proceedings - International Conference on Data Engineering |
|---|---|
| Volume | 2019-April |
| ISSN (Print) | 1084-4627 |
Conference
| Conference | 35th IEEE International Conference on Data Engineering, ICDE 2019 |
|---|---|
| Country/Territory | China |
| City | Macau |
| Period | 8/04/19 → 11/04/19 |
Bibliographical note
Publisher Copyright:© 2019 IEEE.
Keywords
- Cnn
- Lsh
- Nns