Seismica (May 2023)

Curated Pacific Northwest AI-ready Seismic Dataset

  • Yiyu Ni,
  • Alexander Hutko,
  • Francesca Skene,
  • Marine Denolle,
  • Stephen Malone,
  • Paul Bodin,
  • Renate Hartog,
  • Amy Wright

DOI
https://doi.org/10.26443/seismica.v2i1.368
Journal volume & issue
Vol. 2, no. 1

Abstract

Read online

The curation of seismic datasets is the cornerstone of seismological research and the starting point of machine-learning applications in seismology. We present a 21-year-long AI-ready dataset of diverse seismic event parameters, instrumentation metadata, and waveforms, as curated by the Pacific Northwest Seismic Network and ourselves. The dataset contains about 190,000 three-component (3C) waveform traces from more than 65,000 earthquake and explosion events, and about 9,200 waveforms from 5,600 exotic events. The magnitude of the events ranges from 0 to 6.4, while the biggest one is 20 December 2022 M6.4 Ferndale Earthquake. We include waveforms from high-gain (EH, BH, and HH channels) and strong-motion (EN channels) seismometers and resample to 100 Hz. We describe the earthquake catalog and the temporal evolution of the data attributes (e.g., event magnitude type, channel type, waveform polarity, and signal-tonoise ratio, phase picks) as the network earthquake monitoring system evolved through time. We propose this AI-ready dataset as a new open-source benchmark dataset.