Journal Home > Volume 33 , Issue 5

Time intervals are often associated with tuples to represent their valid time in temporal relations, where overlap join is crucial for various kinds of queries. Many existing overlap join algorithms use indices based on tree structures such as quad-tree, B+-tree and interval tree. These algorithms usually have high CPU cost since deep path traversals are unavoidable, which makes them not so competitive as data-partition or plane-sweep based algorithms. This paper proposes an efficient overlap join algorithm based on a new two-layer flat index named as Overlap Interval Inverted Index (i.e., O2i Index). It uses an array to record the end points of intervals and approximates the nesting structures of intervals via two functions in the first layer, and the second layer uses inverted lists to trace all intervals satisfying the approximated nesting structures. With the help of the new index, the join algorithm only visits the must-be-scanned lists and skips all others. Analyses and experiments on both real and synthetic datasets show that the proposed algorithm is as competitive as the state-of-the-art algorithms.

File
jcst-33-5-1023-Highlights.pdf (784.6 KB)
Publication history
Copyright
Acknowledgements

Publication history

Received: 21 June 2017
Revised: 18 June 2018
Published: 12 September 2018
Issue date: September 2018

Copyright

©2018 LLC & Science Press, China

Acknowledgements

Acknowledgment

We would like to appreciate Zeng-Long Wang, a Master student of Ji-Zhou Luo, for his work on early experiments.

Return