Data in Brief (Jun 2024)

A dataset of logistics sites in England and Wales: Location, size, type and loading bays

  • Christopher de Saxe,
  • Daniel Ainalis,
  • David Cebon

Journal volume & issue
Vol. 54
p. 110399

Abstract

Read online

Data on the location and size of logistics sites is essential for the accurate system-level modelling of transport and logistics operations. This is becoming increasingly important to support governments and industry transition to a net zero future which will feature new operating models and vehicle technologies, particularly for electric vehicle operations. In this work we present a dataset of logistics sites across England and Wales categorised into warehouses, retail sites, and factories. There are 47,683 rows of data in total, comprising 27,691 warehouse sites, 6,441 retail sites, and 13,551 factory sites. Each row contains the site's category, location (latitude and longitude), size (in square meters), and modelled number of heavy goods vehicle loading bays. Raw data on non-domestic properties in England and Wales were sourced from the UK's Valuation Agency Office database. Addresses were geocoded to determine the coordinates of each site, floor area was determined for each site via a web crawler script, and the type of site was derived using a keyword-based categorisation process. The size of the site gives an indication of the expected transport activity (i.e. volume of goods handled) and is a useful proxy to estimate the number of loading bays which, in turn, is a useful proxy for the number of electric heavy goods vehicle charging points the site may have to accommodate to support electric vehicle operations. Models relating the floor area to the number of loading bays were developed using satellite imagery of a sample of data from each category. Uncertainty in the geolocation, category and floor area data is deemed to be very low<1%), while the models to predict loading bay data are based on a sample of the overall dataset and subject to higher uncertainty (<20 %). Larger sample datasets and alternative models may be explored in future work to suit other applications.

Keywords