Data in Brief (Feb 2024)

Data on LEGO sets release dates and worldwide retail prices combined with aftermarket transaction prices in Poland between June 2018 and June 2023

  • Wiktor Oczkoś,
  • Bartosz Podgórski,
  • Wiktoria Szczepańska,
  • Tomasz Boiński

Journal volume & issue
Vol. 52
p. 110056

Abstract

Read online

The dataset contains LEGO bricks sets item count and pricing history for AI-based set pricing prediction. The data spans the timeframe from June 2018 to June 2023. The data was obtained from three sources: Brickset.com (LEGO sets retail prices, release dates, and IDs), Lego.com official web page (ID number of each set that was released by Lego, its retail prices, the current status of the set) and promoklocki.pl web page (the retail prices for Poland, prices from aftermarket transactions). The data was merged based on the official LEGO set ID. With high granularity of the data (averaged monthly prices per LEGO set) the dataset permits the computation of variables at the set level and could support both aggregate and time-series analyses whereas the sparseness of the data permits the analysis of collector behavior allowing pinpointing of expected qualities from the purchased products and their resale potential. This may be useful to a broad range of researchers and data scientists using statistical methods and machine-learning techniques for price prediction.

Keywords