Supporting research data collection from YouTube with TubeKit

Chirag Shah

Research output: Contribution to journalArticlepeer-review

6 Scopus citations


We present TubeKit, a query-based YouTube crawling toolkit. This software is a collection of tools that allows users to build their own crawler that can crawl YouTube based on a set of seed queries and collect up to 17 different attributes. TubeKit assists in all the phases of this process, starting with database creation to finally giving access to the collected data with browsing and searching interfaces. We further demonstrate how we used this toolkit to collect elections-related data from YouTube for nearly two years. Some analysis of the collected data relating to the elections is also given.

Original languageEnglish (US)
Pages (from-to)226-240
Number of pages15
JournalJournal of Information Technology and Politics
Issue number2-3
StatePublished - Apr 2010

All Science Journal Classification (ASJC) codes

  • Computer Science(all)
  • Sociology and Political Science
  • Public Administration


  • Presidential elections
  • Video data collection
  • YouTube crawling


Dive into the research topics of 'Supporting research data collection from YouTube with TubeKit'. Together they form a unique fingerprint.

Cite this