Time and Space Optimization for Processing Groups of Multi-Dimensional Scientific Queries

Thumbnail Image
Files
CS-TR-4569.ps(473.79 KB)
No. of downloads: 259
CS-TR-4569.pdf(169.46 KB)
No. of downloads: 833
Publication or External Link
Date
2004-04-19
Authors
Aryangat, Suresh
Andrade, Henrique
Sussman, Alan
Advisor
Citation
DRUM DOI
Abstract
Data analysis applications in areas as diverse as remote sensing and telepathology require operating on and processing very large datasets. For such applications to execute efficiently, careful attention must be paid to the storage, retrieval, and manipulation of the datasets. This paper addresses the optimizations performed by a high performance database system that processes groups of data analysis requests for these applications, which we call queries. The system performs end-to-end processing of the requests, formulated as PostgreSQL declarative queries. The queries are converted into imperative descriptions, multiple imperative descriptions are merged into a single execution plan, the plan is optimized to decrease execution time via common compiler optimization techniques, and, finally, the plan is optimized to decrease memory consumption. The last two steps effectively reduce both the time and space to execute query groups, as shown in the experimental results. (UMIACS-TR-2004-14)
Notes
Rights