Journal of eScience Librarianship (Mar 2019)

What’s in the Box? Assessing the potential usability of four decades of thesis and dissertation supplementary files

  • Steven Van Tuyl

DOI
https://doi.org/10.7191/jeslib.2019.1142
Journal volume & issue
Vol. 8, no. 1
p. e1142

Abstract

Read online

Objectives: The objective of this study is to evaluate the quality and usability of supplementary data files deposited, between 1971 and 2015, to our university institutional repository. Understanding the extent to which content historically deposited in digital repositories is usable by today’s researchers can help inform digital preservation and documentation practices for researchers today. Methods: I identified all graduate level theses and dissertations (GTDs) in the institutional repository with multiple files as a first pass at identifying documents that included supplementary data files. These GTDs were then individually examined, removing supplementary files that were artifacts of either the upload or digitization process. The remaining “true” supplementary files were then individually opened and evaluated following elements of the DATA rubric of Van Tuyl and Whitmire (2016). Results: Supplementary files were discovered in the repository dating back to 1971 in 116 GTD submissions totalling more than 25,000 files. Most GTD submissions included fewer than 30 files, though some submissions included thousands of individual data files. The most common file types submitted include imagery, tabular data, and databases, with a very large number of unknown file types. Overall, levels of documentation were poor while actionability of datasets was generally middling. Conclusions: The results presented in this study suggest that legacy data submitted to our institutional repository with GTDs is generally in poor shape with respect to Transparency and somewhat less so for Actionability. It is clear from this study and others that researchers have a long road ahead when it comes to sharing data in a way that makes it potentially useable by other researchers.

Keywords