Inferring social networks from unstructured text data: A proof of concept detection of hidden communities of interest

Christophe Malaterre; Francis Lareau

doi:10.1017/dap.2023.48

Data & Policy (Jan 2024)

Inferring social networks from unstructured text data: A proof of concept detection of hidden communities of interest

Christophe Malaterre,
Francis Lareau

Affiliations

Christophe Malaterre: ORCiD; Département de philosophie, Université du Québec à Montréal (UQAM), Montréal, Québec, Canada Centre interuniversitaire de recherche sur la science et la technologie, Université du Québec à Montréal (UQAM), Montréal, Québec, Canada
Francis Lareau: Département d’informatique, Université du Québec à Montréal (UQAM), Montréal, Québec, Canada

DOI: https://doi.org/10.1017/dap.2023.48
Journal volume & issue: Vol. 6

Abstract

Read online

Social network analysis is known to provide a wealth of insights relevant to many aspects of policymaking. Yet, the social data needed to construct social networks are not always available. Furthermore, even when they are, interpreting such networks often relies on extraneous knowledge. Here, we propose an approach to infer social networks directly from the texts produced by actors and the terminological similarities that these texts exhibit. This approach relies on fitting a topic model to the texts produced by these actors and measuring topic profile correlations between actors. This reveals what can be called “hidden communities of interest,” that is, groups of actors sharing similar semantic contents but whose social relationships with one another may be unknown or underlying. Network interpretation follows from the topic model. Diachronic perspectives can also be built by modeling the networks over different time periods and mapping genealogical relationships between communities. As a case study, the approach is deployed over a working corpus of academic articles (domain of philosophy of science; N=16,917).

Published in Data & Policy

ISSN: 2632-3249 (Online)
Publisher: Cambridge University Press
Country of publisher: United Kingdom
LCC subjects: Technology: Technology (General): Industrial engineering. Management engineering: Information technology; Political science: Political institutions and public administration (General)
Website: https://www.cambridge.org/core/journals/data-and-policy

About the journal

Abstract

Keywords