Foundations for center-based clustering: Worst-case approximations and modern developments

Pranjal Awasthi, Maria Florina Balcan

Research output: Chapter in Book/Report/Conference proceedingChapter

3 Scopus citations

Abstract

In the first part of this chapter, we detail center-based clustering methods, namely methods based on finding a “best” set of center points and then assigning data points to their nearest center. In particular, we focus on k-means and k-median clustering which are two of the most widely used clustering objectives. We describe popular heuristics for these methods and theoretical guarantees associated with them.We also describe how to design worst case approximately optimal algorithms for these problems. In the second part of the chapter, we describe recent work on how to improve on these worst case algorithms even further by using insights from the nature of real world clustering problems and datasets. Finally, we also summarize theoretical work on clustering data generated from mixture models such as a mixture of Gaussians.

Original languageEnglish (US)
Title of host publicationHandbook of Cluster Analysis
PublisherCRC Press
Pages67-102
Number of pages36
ISBN (Electronic)9781466551893
ISBN (Print)9781466551886
DOIs
StatePublished - Jan 1 2015
Externally publishedYes

All Science Journal Classification (ASJC) codes

  • General Computer Science
  • General Economics, Econometrics and Finance
  • General Business, Management and Accounting
  • General Mathematics

Fingerprint

Dive into the research topics of 'Foundations for center-based clustering: Worst-case approximations and modern developments'. Together they form a unique fingerprint.

Cite this