IEEE Access (Jan 2024)
Optimizing the Execution of Product Data Models
Abstract
The Product Data Model (PDM) is an example of a declarative data-centric approach to modelling information-intensive business processes, which offers flexibility and facilitates process optimization. Declarative approaches are the de facto choice in all modern data-oriented workflows, but they require an optimizer to choose among multiple, alternative execution plans that can produce the desired end product. In PDM business processes, current optimization heuristics suffer from severe limitations regarding both their efficiency and applicability to realistic scenarios, stemming from a lack of consideration for the resource perspective of the processes being modelled and the advances in modern data flow optimizers. This work tackles both of these limitations with the proposal of rank-based operation ordering optimizations tailored to the specificities of PDM, which are also combined with the consideration of the resources available to execute the process operations and parallelism options. Through an extensive evaluation of the proposed solutions, it is showcased that there are significant performance gains from the advanced rank-based operation ordering techniques with the added support of parallel execution. The speedups observed were up to 5.5X compared to the state-of-the-art optimization heuristics.
Keywords