Acta Informatica Pragensia (Dec 2020)

The Process of Unit Price Extraction from Public Sector Contracts

  • Tomáš Bruckner,
  • Filip Vencovský

DOI
https://doi.org/10.18267/j.aip.139
Journal volume & issue
Vol. 9, no. 2
pp. 170 – 183

Abstract

Read online

Czech government institutions commissioned a research on extracting usual unit prices from public IT contracts to aid future public tender sizing. The goal of the project is to obtain millions of contracts from the public register, convert them to full text, extract unit prices from the text and publish a pricelist of IT industry manday prices. This paper designs the process and method of price extraction, demonstrates and evaluates the result on five iterations of extraction and discusses the experience of two years of project performance. The process is designed as a set of repeatable workflows and specified activity and role description. The method is designed as a combination of automated and manual actions. Due to the format and content variability of involved documents and the low mistake tolerance, the possibility of automated extraction of unit prices from full text contract is limited, and human workforce for validation is crucial.

Keywords