Abstract
The k Nearest Neighbor (kNN) join operation associates each data object in one data set with its k nearest neighbors from the same or a different data set. The kNN join on high-dimensional data (high-dimensional kNN join) is a very expensive operation. Existing high-dimensional kNN join algorithms were designed for static data sets and therefore cannot handle updates efficiently. In this article, we propose a novel kNN join method, named kNNJoin+, which supports efficient incremental computation of kNN join results with updates on high-dimensional data. As a by-product, our method also provides answers for the reverse kNN queries with very little overhead. We have performed an extensive experimental study. The results show the effectiveness of kNNJoin+ for processing high-dimensional kNN joins in dynamic workloads.
Original language | English (US) |
---|---|
Pages (from-to) | 55-82 |
Number of pages | 28 |
Journal | GeoInformatica |
Volume | 14 |
Issue number | 1 |
DOIs | |
State | Published - 2010 |
All Science Journal Classification (ASJC) codes
- Geography, Planning and Development
- Information Systems
Keywords
- Optimization and performance
- Query optimization
- Storage & access