Facilitating Twitter data analytics

Platform, language and functionality

Conference Paper (2015)
Author(s)

Ke Tao (TU Delft - Web Information Systems)

Claudia Hauff (TU Delft - Web Information Systems)

Geert Jan Houben (TU Delft - Web Information Systems)

Fabian Abel (XING AG)

Guido Wachsmuth (TU Delft - Web Information Systems, TU Delft - Programming Languages)

DOI related publication
https://doi.org/10.1109/BigData.2014.7004259 Final published version
More Info
expand_more
Publication Year
2015
Language
English
Article number
7004259
Pages (from-to)
421-430
ISBN (electronic)
9781479956654
Event
Downloads counter
98

Abstract

Conducting analytics over data generated by Social Web portals such as Twitter is challenging, due to the volume, variety and velocity of the data. Commonly, adhoc pipelines are used that solve a particular use case. In this paper, we generalize across a range of typical Twitter-data use cases and determine a set of common characteristics. Based on this investigation, we present our Twitter Analytical Platform (TAP), a generic platform for conducting analytical tasks with Twitter data. The platform provides a domain-specific Twitter Analysis Language (TAL) as the interface to its functionality stack. TAL includes a set of analysis tools ranging from data collection and semantic enrichment, to machine learning. With these tools, it becomes possible to create and customize analytical workflows in TAL and build applications that make use of the analytics results. We showcase the applicability of our platform by building Twinder-a search engine for Twitter streams.