Население и экономика (Dec 2020)

Database “Pro-family (pronatalist) communities in the social network VKontakte”

  • Irina E. Kalabikhina,
  • Evgeny P. Banin

DOI
https://doi.org/10.3897/popecon.4.e60915
Journal volume & issue
Vol. 4, no. 3
pp. 98 – 103

Abstract

Read online Read online Read online

The database contains uploading text comments from the social network VKontakte in .csv format (UTF-8 encoding). The comments are collected from communities discussing pregnancy, childhood, motherhood, etc. Uploading contains comments to posts with which the interaction took place. The absolute number of likes was used as a criterion (comments were collected where the number of likes is greater than or equal to 5). Text data was pre-processed (stemmization and lemmatization). The data is suitable for thematic analysis (e.g. LDA – Latent Dirichlet Allocation), for modelling the graph structure of communities (the link_comment variable contains a unique post identifier, link_author contains a unique user identifier), for analysis of tonalities of statements and formation of a dictionary of demographic connotation in Russian. Analysis of the tonalities of statements enables measuring the dynamics of “demographic temperature” in pro-family (pronatalist) communities.

Keywords