Presents an efficient algorithm for processing distributed queries with the existence of partition dependencies. For a given query, the algorithm first partitions the referenced relations into a number of non-exclusive subsets such that the join operation(s) associated with the relations in the subset can be locally processed without data transfer. Each subset is associated with a set of processing sites and can be used to generate an execution plan for the given query. Then, the algorithm determines a set of referenced fragmented relations that are not in the subset, such that only the fragments (instead of the whole relation) need to be replicated at the processing sites. The other referenced relations are duplicated at each of the processing sites. Among the alternatives, the algorithm picks the plan that gives the minimum response time for the query. Experimental results show that our algorithm improves the performance of distributed query processing significantly
Published in:
Parallel and Distributed Systems, 2000. Proceedings. Seventh International Conference on
Date of Conference: 2000