Multiple Query Optimization For Data Analysis Applications on Clusters of SMPs

Loading...
Thumbnail Image

Files

CS-TR-4300.ps (578.82 KB)
No. of downloads: 545
CS-TR-4300.pdf (175.57 KB)
No. of downloads: 1183

Publication or External Link

Date

2001-11-21

Advisor

Citation

DRUM DOI

Abstract

This paper is concerned with the efficient execution of multiple query workloads on a cluster of SMPs. We target applications that access and manipulate large scientific datasets. Queries in these applications involve user-defined processing operations on data and distributed data structures to hold intermediate and final results. Our goal is to implement system components to leverage previously computed query results and to effectively utilize processing power and aggregated I/O bandwidth on SMP nodes so that both single queries and multi-query batches can be efficiently executed. (Also referenced as UMIACS-TR-2001-78)

Notes

Rights