TY - JOUR
T1 - Spatial neighborhood based anomaly detection in sensor datasets
AU - Janeja, Vandana P.
AU - Adam, Nabil R.
AU - Atluri, Vijayalakshmi
AU - Vaidya, Jaideep
N1 - Funding Information:
This work is supported in part by the National Science Foundation under grants IIS-0306838 and CNS-0746943.
PY - 2010/3
Y1 - 2010/3
N2 - Success of anomaly detection, similar to other spatial data mining techniques, relies on neighborhood definition. In this paper, we argue that the anomalous behavior of spatial objects in a neighborhood can be truly captured when both (a) spatial autocorrelation (similar behavior of nearby objects due to proximity) and (b) spatial heterogeneity (distinct behavior of nearby objects due to difference in the underlying processes in the region) are taken into consideration for the neighborhood definition. Our approach begins by generating micro neighborhoods around spatial objects encompassing all the information about a spatial object. We selectively merge these based on spatial relationships accounting for autocorrelation and inferential relationships accounting for heterogeneity, forming macro neighborhoods. In such neighborhoods, we then identify (i) spatio-temporal outliers, where individual sensor readings are anomalous, (ii) spatial outliers, where the entire sensor is an anomaly, and (iii) spatio-temporally coalesced outliers, where a group of spatio-temporal outliers in the macro neighborhood are separated by a small time lag indicating the traversal of the anomaly. We demonstrate the effectiveness of our approach in neighborhood formation and anomaly detection with experimental results in (i) water monitoring and (ii) highway traffic monitoring sensor datasets. We also compare the results of our approach with an existing approach for spatial anomaly detection.
AB - Success of anomaly detection, similar to other spatial data mining techniques, relies on neighborhood definition. In this paper, we argue that the anomalous behavior of spatial objects in a neighborhood can be truly captured when both (a) spatial autocorrelation (similar behavior of nearby objects due to proximity) and (b) spatial heterogeneity (distinct behavior of nearby objects due to difference in the underlying processes in the region) are taken into consideration for the neighborhood definition. Our approach begins by generating micro neighborhoods around spatial objects encompassing all the information about a spatial object. We selectively merge these based on spatial relationships accounting for autocorrelation and inferential relationships accounting for heterogeneity, forming macro neighborhoods. In such neighborhoods, we then identify (i) spatio-temporal outliers, where individual sensor readings are anomalous, (ii) spatial outliers, where the entire sensor is an anomaly, and (iii) spatio-temporally coalesced outliers, where a group of spatio-temporal outliers in the macro neighborhood are separated by a small time lag indicating the traversal of the anomaly. We demonstrate the effectiveness of our approach in neighborhood formation and anomaly detection with experimental results in (i) water monitoring and (ii) highway traffic monitoring sensor datasets. We also compare the results of our approach with an existing approach for spatial anomaly detection.
KW - Outlier detection
KW - Sensors
KW - Spatial neighborhood
UR - http://www.scopus.com/inward/record.url?scp=77649274106&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=77649274106&partnerID=8YFLogxK
U2 - 10.1007/s10618-009-0147-0
DO - 10.1007/s10618-009-0147-0
M3 - Article
AN - SCOPUS:77649274106
SN - 1384-5810
VL - 20
SP - 221
EP - 258
JO - Data Mining and Knowledge Discovery
JF - Data Mining and Knowledge Discovery
IS - 2
ER -