International Journal of Distributed Sensor Networks (Oct 2015)
MSSUTop-: Determining the Minimum Scan Scope for UTop- Query over Uncertain Data
Abstract
The semantics of UTop- k query is based on the possible world model, and the greatest challenge in processing UTop- k queries is the explosion of possible world space. In this direction, several optimized algorithms have been developed. However, uncertain databases are different in data distributions under different scoring functions, which has significant influence on the performance of the existing optimizing algorithms. In this paper, we propose two novel algorithms, MSSUTop- k and quick MSSUTop- k , for determining the minimum scan scope for UTop- k query processing. This work is important because before UTop- k query processing is started, users hope to know in advance how many and which tuples will be involved in UTop- k query processing. Then, they can make a balance between result precision and processing cost. So, it should be the prerequisite for answering UTop- k queries. MSSUTop- k can achieve accurate results but is relatively more costly in time complexity. Oppositely, quick MSSUTop- k can only achieve approximate results but performs better in time cost. We conduct comprehensive experiments to evaluate the performance of our proposed algorithms and analyze the relationship between the data distribution and the minimum scan scope of UTop- k queries.