By Topic

Performance of the pipelined hash-join algorithm in a heterogeneous distributed environment

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$31 $13
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

1 Author(s)
Khan, Z.S. ; Dept. of Math. & Comput. Sci., Bloomsburg Univ., PA, USA

A pipelined distributed parallel hash-join algorithm is executed in a distributed heterogeneous supercomputing environment which consists of the Connection Machine CM2, and the Cray C90. This algorithm implements the computationally intensive join operation of relational databases. The hash and join phases of the algorithm are executed on the architectures determined to be best suited for them. The hash phase of the algorithm is implemented on the Cray C90. The hashed data sets of the first join relation are transmitted from the Cray to the CM2. A pipeline is established between the two machines as the Cray continues to hash each page of the second join relation and transmits it to the CM2 where the join is performed. Limited improvements in performance of the pipelined algorithm for different combinations of data sizes, data distributions, and join sizes is analyzed and the limitations of the distributed environment are discussed

Published in:

Parallel and Distributed Processing, 1998. PDP '98. Proceedings of the Sixth Euromicro Workshop on

Date of Conference:

21-23 Jan 1998