Big Data and Cognitive Computing (Jul 2019)

Archetype-Based Modeling and Search of Social Media

  • Brent D. Davis,
  • Kamran Sedig,
  • Daniel J. Lizotte

DOI
https://doi.org/10.3390/bdcc3030044
Journal volume & issue
Vol. 3, no. 3
p. 44

Abstract

Read online

Existing keyword-based search techniques suffer from limitations owing to unknown, mismatched, and obscure vocabulary. These challenges are particularly prevalent in social media, where slang, jargon, and memetics are abundant. We develop a new technique, Archetype-Based Modeling and Search, that can mitigate these challenges as they are encountered in social media. This technique learns to identify new relevant documents based on a specified set of archetypes from which both vocabulary and relevance information are extracted. We present a case study from the social media data from Reddit, by using authors from /r/Opiates to characterize discourse around opioid use and to find additional relevant authors on this topic.

Keywords