TY - JOUR
T1 - Supporting research data collection from YouTube with TubeKit
AU - Shah, Chirag
N1 - Funding Information:
This work was not possible without constant guidance and support of the other members of The VidArch Project team—Gary Marchionini, Rob Capra, Paul Jones, Sarah Jordan, Cal Lee, Terrell Russell, Laura Sheble, Yaxiao Song, and Helen Tibbo. The work reported here is supported by NSF grant # IIS 0455970.
PY - 2010/4
Y1 - 2010/4
N2 - We present TubeKit, a query-based YouTube crawling toolkit. This software is a collection of tools that allows users to build their own crawler that can crawl YouTube based on a set of seed queries and collect up to 17 different attributes. TubeKit assists in all the phases of this process, starting with database creation to finally giving access to the collected data with browsing and searching interfaces. We further demonstrate how we used this toolkit to collect elections-related data from YouTube for nearly two years. Some analysis of the collected data relating to the elections is also given.
AB - We present TubeKit, a query-based YouTube crawling toolkit. This software is a collection of tools that allows users to build their own crawler that can crawl YouTube based on a set of seed queries and collect up to 17 different attributes. TubeKit assists in all the phases of this process, starting with database creation to finally giving access to the collected data with browsing and searching interfaces. We further demonstrate how we used this toolkit to collect elections-related data from YouTube for nearly two years. Some analysis of the collected data relating to the elections is also given.
KW - Presidential elections
KW - Video data collection
KW - YouTube crawling
UR - http://www.scopus.com/inward/record.url?scp=78049341546&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=78049341546&partnerID=8YFLogxK
U2 - 10.1080/19331681003748875
DO - 10.1080/19331681003748875
M3 - Article
AN - SCOPUS:78049341546
SN - 1933-1681
VL - 7
SP - 226
EP - 240
JO - Journal of Information Technology and Politics
JF - Journal of Information Technology and Politics
IS - 2-3
ER -