New Genetics and Society (Jan 2021)
Data curation-research: practices of data standardization and exploration in a precision medicine database
Abstract
Key to precision medicine is the development of expert database projects that gather data, integrate them in the pre-existing database, and publish the product of their processing for others to make use of. Increasingly, it is required that data infrastructure managers and curators pursue and lead research projects on the data so as to learn about new ways data could be used or information that could be potentially generated from them. I call these efforts “data curation-research” and use the case study of COSMIC, the Catalogue of Somatic Mutations in Cancer, to analyze the contextual factors shaping the science of data curation-research. I build on March’s organizational learning categories of exploitation and exploration to place these factors within a theory of organizational change and innovation, and contribute to a richer picture of the social drivers of cancer genomics.
Keywords