M-BIRCH: An online clustering approach for computer vision applications

Siddharth K. Madan, Kristin J. Dana

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2 Scopus citations

Abstract

We adapt a classic online clustering algorithm called Balanced Iterative Reducing and Clustering using Hierarchies (BIRCH), to incrementally cluster large datasets of features commonly used in multimedia and computer vision. We call the adapted version modified-BIRCH (m-BIRCH). The algorithm uses only a fraction of the dataset memory to perform clustering, and updates the clustering decisions when new data comes in. Modifications made in m-BIRCH enable data driven parameter selection and effectively handle varying density regions in the feature space. Data driven parameter selection automatically controls the level of coarseness of the data summarization. Effective handling of varying density regions is necessary to well represent the different density regions in data summarization. We use m-BIRCH to cluster 840K color SIFT descriptors, and 60K outlier corrupted grayscale patches. We use the algorithm to cluster datasets consisting of challenging non-convex clustering patterns. Our implementation of the algorithm provides an useful clustering tool and is made publicly available.

Original languageEnglish (US)
Title of host publicationProceedings of SPIE-IS and T Electronic Imaging - Imaging and Multimedia Analytics in a Web and Mobile World 2015
EditorsJan P. Allebach, Zhigang Fan, Qian Lin
PublisherSPIE
ISBN (Electronic)9781628414981
DOIs
StatePublished - 2015
EventImaging and Multimedia Analytics in a Web and Mobile World 2015 - San Francisco, United States
Duration: Feb 11 2015Feb 12 2015

Publication series

NameProceedings of SPIE - The International Society for Optical Engineering
Volume9408
ISSN (Print)0277-786X
ISSN (Electronic)1996-756X

Other

OtherImaging and Multimedia Analytics in a Web and Mobile World 2015
Country/TerritoryUnited States
CitySan Francisco
Period2/11/152/12/15

All Science Journal Classification (ASJC) codes

  • Electronic, Optical and Magnetic Materials
  • Condensed Matter Physics
  • Computer Science Applications
  • Applied Mathematics
  • Electrical and Electronic Engineering

Keywords

  • Automatic Threshold Selection
  • BIRCH
  • Computer Vision
  • Online clustering
  • Outlier Detection

Fingerprint

Dive into the research topics of 'M-BIRCH: An online clustering approach for computer vision applications'. Together they form a unique fingerprint.

Cite this