Applied Sciences (Aug 2019)

Pricing Personal Data Based on Data Provenance

  • Yuncheng Shen,
  • Bing Guo,
  • Yan Shen,
  • Fan Wu,
  • Hong Zhang,
  • Xuliang Duan,
  • Xiangqian Dong

DOI
https://doi.org/10.3390/app9163388
Journal volume & issue
Vol. 9, no. 16
p. 3388

Abstract

Read online

Data have become an important asset. Mining the value contained in personal data, making personal data an exchangeable commodity, has become a hot spot of industry research. Then, how to price personal data reasonably becomes a problem we have to face. Based on previous research on data provenance, this paper proposes a novel minimum provenance pricing method, which is to price the minimum source tuple set that contributes to the query. Our pricing model first sets prices for source tuples according to their importance and then makes query pricing based on data provenance, which considers both the importance of the data itself and the relationships between the data. We design an exact algorithm that can calculate the exact price of a query in exponential complexity. Furthermore, we design an easy approximate algorithm, which can calculate the approximate price of the query in polynomial time. We instantiated our model with a select-joint query and a complex query and extensively evaluated its performances on two practical datasets. The experimental results show that our pricing model is feasible.

Keywords