Identifying memes and their interactions in online communities

Master Thesis (2018)
Author(s)

J.P. Verdoorn (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Contributor(s)

N. Tintarev – Mentor

Christoph Lofi – Mentor

Geert-Jan Houben – Graduation committee member

H. Wang – Graduation committee member

Faculty
Electrical Engineering, Mathematics and Computer Science
Copyright
© 2018 Joost Verdoorn
More Info
expand_more
Publication Year
2018
Language
English
Copyright
© 2018 Joost Verdoorn
Graduation Date
11-06-2018
Awarding Institution
Delft University of Technology
Programme
['Computer Science | Web Information Systems']
Faculty
Electrical Engineering, Mathematics and Computer Science
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Memes are theorized to be the building blocks of culture. Due to a lack of empirical validation, however, the theory of memes — memetics — remains in its infancy. We argue that one of the missing components for such empirical validation is a method for the large-scale identification of memes.

In this thesis, we develop a method for the identification of scientific memes — ngrams of length 1 through 4, denoting scientific concepts — propagating within online communities. With data extracted from science-oriented correspondence extracted from five communities on the online discussion platform Reddit, and five communities on the online question and answer platform StackExchange, we perform a large-scale automated evaluation in which we find that memes identified in these communities correspond to the titles of Wikipedia articles; and a small-scale human evaluation in which we find that the identified memes represent relevant concepts to the community’s scientific field.

Furthermore, we introduce a slight adaptation of this method to elucidate one of memetics’ predictions: the occurrence of interactions between memes, where the occurrence of one meme has a positive or negative influence on the propagation of another meme. To evaluate this method for the identification of meme interactions, we construct meme interaction networks, in which we find that the most central memes correspond to the most relevant scientific concepts.

We find that our methods are able to extract key concepts within online communities, identifying thousands of relevant concepts from millions of candidate ngrams. Thus, our method may contribute to contemporary text mining research, and could be used in place of, or in conjunction with current approaches, such as TF-IDF or LDA.

Files

License info not available