Disjoint interval partitioning

VLDB J.（2017）

引用 41|浏览51

暂无评分

摘要

In databases with time interval attributes, query processing techniques that are based on sort-merge or sort-aggregate deteriorate. This happens because for intervals no total order exists and either the start or end point is used for the sorting. Doing so leads to inefficient solutions with lots of unproductive comparisons that do not produce an output tuple. Even if just one tuple with a long interval is present in the data, the number of unproductive comparisons of sort-merge and sort-aggregate gets quadratic. In this paper we propose disjoint interval partitioning ( 𝒟ℐ𝒫 ), a technique to efficiently perform sort-based operators on interval data. 𝒟ℐ𝒫 divides an input relation into the minimum number of partitions, such that all tuples in a partition are non-overlapping. The absence of overlapping tuples guarantees efficient sort-merge computations without backtracking. With 𝒟ℐ𝒫 the number of unproductive comparisons is linear in the number of partitions. In contrast to current solutions with inefficient random accesses to the active tuples, 𝒟ℐ𝒫 fetches the tuples in a partition sequentially. We illustrate the generality and efficiency of 𝒟ℐ𝒫 by describing and evaluating three basic database operators over interval data: join, anti-join and aggregation.

查看译文

关键词

Temporal data,Interval data,Query processing,Join,Anti-join,Aggregation

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要