Big Data and Cognitive Computing (Sep 2018)

Recreating the Relationship between Subjective Wellbeing and Personality Using Machine Learning: An Investigation into Facebook Online Behaviours

  • Alexandra Marinucci,
  • Jake Kraska,
  • Shane Costello

DOI
https://doi.org/10.3390/bdcc2030029
Journal volume & issue
Vol. 2, no. 3
p. 29

Abstract

Read online

The twenty-first century has delivered technological advances that allow researchers to utilise social media to predict personal traits and psychological constructs. This article aims to further our understanding of the relationship between subjective wellbeing (SWB) and the Five Factor Model (FFM) of personality by attempting to replicate the relationship using machine learning prediction models. Data from the myPersonality Project was used; with observed SWB scores derived from the Satisfaction With Life Scale (SWLS) and Five Factor Model (FFM) personality profiles generated using responses on the 100-item IPIP proxy of the NEO-PI-R. After data cleaning, FFM personality traits and SWB scores were predicted by reducing Facebook Likes into 50 dimensions using SVD and then running the data through six multiple regressions (fitting the model via least squares and splitting the data via k-folds validation) with the Likes dimensions as predictors and each of the FFM traits and the SWB score as response variables. Standard multiple regression analyses were conducted for the observed and machine learning predicted variables to compare the relationships in the context of previous literature. The results revealed that in the observed model, high SWB was predicted by high extraversion, conscientiousness, and agreeableness, and low openness to experience and neuroticism as per previous research. For the machine learning model, high SWB was predicted by high extraversion, openness to experience, conscientiousness, and agreeableness, and low neuroticism. The relationships between SWB and extraversion, neuroticism, and conscientiousness were successfully replicated in the machine learning model. Openness to experience changed direction in its relationship with SWB from the observed to machine learning-derived variables due to failure to accurately recreate the variable, and agreeableness was multicollinear with SWB in the machine learning model due to the unknowing use of identical digital behaviours to replicate each construct. Implications of the results and directions for future research are discussed.

Keywords