Modified balanced iterative reducing and clustering using hierarchies (m-BIRCH) for visual clustering

Siddharth Madan, Kristin J. Dana

Research output: Contribution to journalArticlepeer-review

34 Scopus citations

Abstract

In modern visual clustering applications where datasets are large and updates with new data may be ongoing, methods of online clustering are extremely important. Online clustering algorithms incrementally cluster the data points, use a fraction of the dataset memory, and update the clustering decisions when new data comes in. In this paper we adapt a classic online clustering algorithm called balanced iterative reducing and clustering using hierarchies (BIRCH) to incrementally cluster large datasets of features commonly used in visual clustering, e.g., 840 K color SIFT descriptors, 1.09 million color patches, 60 K outlier corrupted grayscale patches, and 700 K grayscale SIFT descriptors. We use the algorithm to cluster datasets consisting of non-convex clusters, e.g., Hopkins 155 3D motion segmentation dataset. We call the adapted version modified-BIRCH (m-BIRCH). BIRCH was originally developed by the database management community, but has not been used in computer vision. Modifications made in m-BIRCH enable data-driven parameter selection and effectively handle varying density regions in the feature space. Data-driven parameter selection automatically controls the level of coarseness of the data summarization. Effective handling of varying density regions is necessary to well represent the different density regions in the data summarization. Our implementation of the algorithm provides a useful clustering tool and is made publicly available.

Original languageEnglish (US)
Pages (from-to)1023-1040
Number of pages18
JournalPattern Analysis and Applications
Volume19
Issue number4
DOIs
StatePublished - Nov 1 2016

All Science Journal Classification (ASJC) codes

  • Computer Vision and Pattern Recognition
  • Artificial Intelligence

Keywords

  • Automatic threshold selection
  • BIRCH
  • Computer vision
  • Online clustering
  • Outlier detection

Fingerprint

Dive into the research topics of 'Modified balanced iterative reducing and clustering using hierarchies (m-BIRCH) for visual clustering'. Together they form a unique fingerprint.

Cite this