Patent relatedness and velocity in the Chinese pharmaceutical industry: A dataset of Jaccard similarity indices
Charlotte Marie Vorreuther,
Thierry Warin
Affiliations
Charlotte Marie Vorreuther
CIRANO (Inter-University Research Center on the Analysis of Organizations), 1130, Sherbrooke West, Suite 1400, Montreal H3A 2 M8, QC, Canada
Thierry Warin
CIRANO (Inter-University Research Center on the Analysis of Organizations), 1130, Sherbrooke West, Suite 1400, Montreal H3A 2 M8, QC, Canada; Department of International Business, HEC Montréal, 3000, chemin de la Côte-Sainte-Catherine, Montreal H3T 2A7, QC, Canada; Corresponding author at: Department of International Business, HEC Montréal, 3000, chemin de la Côte-Sainte-Catherine, Montreal H3T 2A7, QC, Canada.
The dataset is about innovation dynamics in the pharmaceutical industry in China. Innovation dynamics is interpreted as knowledge transfer across technologies and through time (velocity). The dataset provides access to 143,916 Jaccard similarity indices. A Jaccard similarity indice is a distance measure between two units. Here, they proxy relatedness across technologies (classes) and through time (velocity). The Jaccard similarity indices are computed based on a Natural Language Processing treatment of 69,923 patents in the pharmaceutical industry in China from 1990 to 2017.