TY - GEN
T1 - Mining quantitative maximal hyperclique patterns
T2 - 10th Pacific-Asia Conference on Advances in Knowledge Discovery and Data Mining, PAKDD 2006
AU - Huang, Yaochun
AU - Xiong, Hui
AU - Wu, Weili
AU - Sung, Sam Y.
PY - 2006
Y1 - 2006
N2 - Hyperclique patterns are groups of objects which are strongly related to each other. Indeed, the objects in a hyperclique pattern have a guaranteed level of global pairwise similarity to one another as measured by uncentered Pearson's correlation coefficient. Recent literature has provided the approach to discovering hyperclique patterns over data sets with binary attributes. In this paper, we introduce algorithms for mining maximal hyperclique patterns in large data sets containing quantitative attributes. An intuitive and simple solution is to partition quantitative attributes into binary attributes. However, there is potential information loss due to partitioning. Instead, our approach is based on a normalization scheme and can directly work on quantitative attributes. In addition, we adopt the algorithm structures of three popular association pattern mining algorithms and add a critical clique pruning technique. Finally, we compare the performance of these algorithms for finding quantitative maximal hyperclique patterns using some real-world data sets.
AB - Hyperclique patterns are groups of objects which are strongly related to each other. Indeed, the objects in a hyperclique pattern have a guaranteed level of global pairwise similarity to one another as measured by uncentered Pearson's correlation coefficient. Recent literature has provided the approach to discovering hyperclique patterns over data sets with binary attributes. In this paper, we introduce algorithms for mining maximal hyperclique patterns in large data sets containing quantitative attributes. An intuitive and simple solution is to partition quantitative attributes into binary attributes. However, there is potential information loss due to partitioning. Instead, our approach is based on a normalization scheme and can directly work on quantitative attributes. In addition, we adopt the algorithm structures of three popular association pattern mining algorithms and add a critical clique pruning technique. Finally, we compare the performance of these algorithms for finding quantitative maximal hyperclique patterns using some real-world data sets.
UR - http://www.scopus.com/inward/record.url?scp=33745773300&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=33745773300&partnerID=8YFLogxK
U2 - 10.1007/11731139_65
DO - 10.1007/11731139_65
M3 - Conference contribution
AN - SCOPUS:33745773300
SN - 3540332065
SN - 9783540332060
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 552
EP - 556
BT - Advances in Knowledge Discovery and Data Mining - 10th Pacific-Asia Conference, PAKDD 2006, Proceedings
Y2 - 9 April 2006 through 12 April 2006
ER -