Scientific Data (Aug 2023)

Digitization of the Australian Parliamentary Debates, 1998–2022

  • Lindsay Katz,
  • Rohan Alexander

DOI
https://doi.org/10.1038/s41597-023-02464-w
Journal volume & issue
Vol. 10, no. 1
pp. 1 – 14

Abstract

Read online

Abstract Public knowledge of what is said in parliament is a tenet of democracy, and a critical resource for political science research. In Australia, following the British tradition, the written record of what is said in parliament is known as Hansard. While the Australian Hansard has always been publicly available, it has been difficult to use for the purpose of large-scale macro- and micro-level text analysis because it has only been available as PDFs or XMLs. Following the lead of the Linked Parliamentary Data project which achieved this for Canada, we provide a new, comprehensive, high-quality, rectangular database that captures proceedings of the Australian parliamentary debates from 1998 to 2022. The database is publicly available and can be linked to other datasets such as election results. The creation and accessibility of this database enables the exploration of new questions and serves as a valuable resource for both researchers and policymakers.