UniKL Logo

Lehrgebiet Informationssysteme

FB Informatik

FB Informatik
 
LG IS
AG DBIS
AG HIS
Jobs / Tasks
Courses
Publications
Contact
Misc
Impressum
(C) AG DBIS
 

TID Hash Joins


Robert Marek

University of Kaiserslautern, GERMANY
haerder@informatik.uni-kl.de

Erhard Rahm

University of Leipzig, GERMANY
rahm@informatik.uni-leipzig.de


Full paper (postscript version, compressed by gzip or PDF version )


Abstract

TID hash joins are a simple and memory-efficient method for processing large join queries. They are based on standard hash join algorithms but only store TID/key pairs in the hash table instead of entire tuples. This typically reduces memory requirements by more than an order of magnitude bringing substantial benefits. In particular, performance for joins on Giga-Byte relations can substantially be improved by reducing the amount of disk I/O to a large extent. Furthermore, efficient processing of mixed multi-user workloads consisting of both join queries and OLTP transactions is supported. We present a detailed simulation study to analyze the performance of TID hash joins. In particular, we identify the conditions under which TID hash joins are most beneficial. Furthermore, we compare TID hash join with adaptive hash join algorithms that have been proposed to deal with mixed workloads.


Published in: Proc. 3rd Int. Conf. on Information and Knowledge Management (CIKM'94), Gaithersburg, MD, Nov. 1994, pp. 42-49.