Spatial neighborhood based anomaly detection in sensor datasets

Research output: Contribution to journalArticlepeer-review

16 Scopus citations

Abstract

Success of anomaly detection, similar to other spatial data mining techniques, relies on neighborhood definition. In this paper, we argue that the anomalous behavior of spatial objects in a neighborhood can be truly captured when both (a) spatial autocorrelation (similar behavior of nearby objects due to proximity) and (b) spatial heterogeneity (distinct behavior of nearby objects due to difference in the underlying processes in the region) are taken into consideration for the neighborhood definition. Our approach begins by generating micro neighborhoods around spatial objects encompassing all the information about a spatial object. We selectively merge these based on spatial relationships accounting for autocorrelation and inferential relationships accounting for heterogeneity, forming macro neighborhoods. In such neighborhoods, we then identify (i) spatio-temporal outliers, where individual sensor readings are anomalous, (ii) spatial outliers, where the entire sensor is an anomaly, and (iii) spatio-temporally coalesced outliers, where a group of spatio-temporal outliers in the macro neighborhood are separated by a small time lag indicating the traversal of the anomaly. We demonstrate the effectiveness of our approach in neighborhood formation and anomaly detection with experimental results in (i) water monitoring and (ii) highway traffic monitoring sensor datasets. We also compare the results of our approach with an existing approach for spatial anomaly detection.

Original languageEnglish (US)
Pages (from-to)221-258
Number of pages38
JournalData Mining and Knowledge Discovery
Volume20
Issue number2
DOIs
StatePublished - Mar 2010

All Science Journal Classification (ASJC) codes

  • Information Systems
  • Computer Science Applications
  • Computer Networks and Communications

Keywords

  • Outlier detection
  • Sensors
  • Spatial neighborhood

Fingerprint

Dive into the research topics of 'Spatial neighborhood based anomaly detection in sensor datasets'. Together they form a unique fingerprint.

Cite this