Demographic Research (Mar 2019)
Happy parents' tweets: An exploration of Italian Twitter data using sentiment analysis
Abstract
Background: Demographers are increasingly interested in connecting demographic behaviour and trends with 'soft' measures, i.e., complementary information on attitudes, values, feelings, and intentions. Objective: The aim of this paper is to demonstrate how computational linguistic techniques can be used to explore opinions and semantic orientations related to parenthood. Methods: In this article we scrutinize about three million filtered Italian tweets from 2014. First, we implement a methodological framework relying on Natural Language Processing techniques for text analysis, which is used to extract sentiments. We then run a supervised machine-learning experiment on the overall dataset, based on the annotated set of tweets from the previous stage. Consequently, we infer to what extent social media users report negative or positive affect on topics relevant to the fertility domain. Results: Parents express a generally positive attitude towards being and becoming parents, but they are also fearful, surprised, and sad. They also have quite negative sentiments about their children's future, politics, fertility, and parental behaviour. By exploiting geographical information from tweets we find a significant correlation between the prevalence of positive sentiments about parenthood and macro-regional indicators of both life satisfaction and fertility level. Contribution: We show how tweets can be used to represent soft measures such as attitudes, values, and feelings, and we establish how they relate to demographic features. Linguistic analysis of social media data provides a middle ground between qualitative studies and more standard quantitative approaches.
Keywords