TY - GEN
T1 - Enhancing semi-supervised clustering
T2 - KDD-2007: 13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining
AU - Tang, W.
AU - Wu, J.
AU - Xiong, H.
AU - Zhong, S.
PY - 2007
Y1 - 2007
N2 - Semi-supervised clustering employs limited supervision in the form of labeled instances or pairwise instance constraints to aid unsupervised clustering and often significantly improves the clustering performance. Despite the vast amount of expert knowledge spent on this problem, most existing work is not designed for handling high-dimensional sparse data. This paper thus fills this crucial void by developing a Semi-supervised Clustering method based on spheRical K-mEans via fEature projectioN (SCREEN). Specifically, we formulate the problem of constraint-guided feature projection, which can be nicely integrated with semi-supervised clustering algorithms and has the ability to effectively reduce data dimension. Indeed, our experimental results on several real-world data sets show that the SCREEN method can effectively deal with high-dimensional data and provides an appealing clustering performance.
AB - Semi-supervised clustering employs limited supervision in the form of labeled instances or pairwise instance constraints to aid unsupervised clustering and often significantly improves the clustering performance. Despite the vast amount of expert knowledge spent on this problem, most existing work is not designed for handling high-dimensional sparse data. This paper thus fills this crucial void by developing a Semi-supervised Clustering method based on spheRical K-mEans via fEature projectioN (SCREEN). Specifically, we formulate the problem of constraint-guided feature projection, which can be nicely integrated with semi-supervised clustering algorithms and has the ability to effectively reduce data dimension. Indeed, our experimental results on several real-world data sets show that the SCREEN method can effectively deal with high-dimensional data and provides an appealing clustering performance.
KW - Feature projection
KW - Pairwise instance constraints
KW - Semi-supervised clustering
UR - http://www.scopus.com/inward/record.url?scp=36849025479&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=36849025479&partnerID=8YFLogxK
U2 - 10.1145/1281192.1281268
DO - 10.1145/1281192.1281268
M3 - Conference contribution
AN - SCOPUS:36849025479
SN - 1595936092
SN - 9781595936097
T3 - Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining
SP - 707
EP - 716
BT - KDD-2007
Y2 - 12 August 2007 through 15 August 2007
ER -