High-dimensional kNN joins with incremental updates

Cui Yu, Rui Zhang, Yaochun Huang, Hui Xiong

Research output: Contribution to journalArticlepeer-review

58 Scopus citations

Abstract

The k Nearest Neighbor (kNN) join operation associates each data object in one data set with its k nearest neighbors from the same or a different data set. The kNN join on high-dimensional data (high-dimensional kNN join) is a very expensive operation. Existing high-dimensional kNN join algorithms were designed for static data sets and therefore cannot handle updates efficiently. In this article, we propose a novel kNN join method, named kNNJoin+, which supports efficient incremental computation of kNN join results with updates on high-dimensional data. As a by-product, our method also provides answers for the reverse kNN queries with very little overhead. We have performed an extensive experimental study. The results show the effectiveness of kNNJoin+ for processing high-dimensional kNN joins in dynamic workloads.

Original languageEnglish (US)
Pages (from-to)55-82
Number of pages28
JournalGeoInformatica
Volume14
Issue number1
DOIs
StatePublished - 2010

All Science Journal Classification (ASJC) codes

  • Geography, Planning and Development
  • Information Systems

Keywords

  • Optimization and performance
  • Query optimization
  • Storage & access

Fingerprint

Dive into the research topics of 'High-dimensional kNN joins with incremental updates'. Together they form a unique fingerprint.

Cite this