TY - GEN
T1 - A new privacy-preserving distributed k-clustering algorithm
AU - Jagannathan, Geetha
AU - Pillaipakkamnatt, Krishnan
AU - Wright, Rebecca N.
PY - 2006
Y1 - 2006
N2 - We present a simple I/O-efficient k-clustering algorithm that was designed with the goal of enabling a privacy-preserving version of the algorithm. Our experiments show that this algorithm produces cluster centers that are, on average, more accurate than the ones produced by the well known iterative k-means algorithm. We use our new algorithm as the basis for a communication-efficient privacy-preserving k-clustering protocol for databases that are horizontally partitioned between two parties. Unlike existing privacy-preserving protocols based on the k-means algorithm, this protocol does not reveal intermediate candidate cluster centers.
AB - We present a simple I/O-efficient k-clustering algorithm that was designed with the goal of enabling a privacy-preserving version of the algorithm. Our experiments show that this algorithm produces cluster centers that are, on average, more accurate than the ones produced by the well known iterative k-means algorithm. We use our new algorithm as the basis for a communication-efficient privacy-preserving k-clustering protocol for databases that are horizontally partitioned between two parties. Unlike existing privacy-preserving protocols based on the k-means algorithm, this protocol does not reveal intermediate candidate cluster centers.
UR - http://www.scopus.com/inward/record.url?scp=33745439412&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=33745439412&partnerID=8YFLogxK
U2 - 10.1137/1.9781611972764.47
DO - 10.1137/1.9781611972764.47
M3 - Conference contribution
AN - SCOPUS:33745439412
SN - 089871611X
SN - 9780898716115
T3 - Proceedings of the Sixth SIAM International Conference on Data Mining
SP - 494
EP - 498
BT - Proceedings of the Sixth SIAM International Conference on Data Mining
PB - Society for Industrial and Applied Mathematics
T2 - Sixth SIAM International Conference on Data Mining
Y2 - 20 April 2006 through 22 April 2006
ER -