Emergence of Consensus and Shared Vocabularies in Collaborative Tagging Systems

Valentin Robu, Harry Halpin, Hana Shepherd

Research output: Contribution to journalArticlepeer-review

97 Scopus citations


This article uses data from the social bookmarking site del.icio.us to empirically examine the dynamics of collaborative tagging systems and to study how coherent categorization schemes emerge from unsupervised tagging by individual users. First, we study the formation of stable distributions in tagging systems, seen as an implicit form of “consensus” reached by the users of the system around the tags that best describe a resource. We show that final tag frequencies for most resources converge to power law distributions and we propose an empirical method to examine the dynamics of the convergence process, based on the Kullback-Leibler divergence measure. The convergence analysis is performed for both the most utilized tags at the top of tag distributions and the so-called long tail. Second, we study the information structures that emerge from collaborative tagging, namely tag correlation (or folksonomy) graphs.We show how community-based network techniques can be used to extract simple tag vocabularies from the tag correlation graphs by partitioning them into subsets of related tags. Furthermore, we also show, for a specialized domain, that shared vocabularies produced by collaborative tagging are richer than the vocabularies which can be extracted from large-scale query logs provided by a major search engine.

Original languageEnglish (US)
Pages (from-to)1-34
Number of pages34
JournalACM Transactions on the Web
Issue number4
StatePublished - Sep 1 2009
Externally publishedYes

All Science Journal Classification (ASJC) codes

  • Computer Networks and Communications


  • Algorithms
  • Collaborative tagging
  • Human Factors
  • Measurement
  • community identification algorithms
  • complex systems
  • emergent semantics
  • graphical models
  • knowledge extraction
  • power laws
  • search engines


Dive into the research topics of 'Emergence of Consensus and Shared Vocabularies in Collaborative Tagging Systems'. Together they form a unique fingerprint.

Cite this