Time and Space Optimization for Processing Groups of Multi-Dimensional Scientific Queries

Loading...
Thumbnail Image

Files

CS-TR-4569.ps (473.79 KB)
No. of downloads: 274
CS-TR-4569.pdf (169.46 KB)
No. of downloads: 1098

Publication or External Link

Date

2004-04-19

Advisor

Citation

DRUM DOI

Abstract

Data analysis applications in areas as diverse as remote sensing and telepathology require operating on and processing very large datasets. For such applications to execute efficiently, careful attention must be paid to the storage, retrieval, and manipulation of the datasets. This paper addresses the optimizations performed by a high performance database system that processes groups of data analysis requests for these applications, which we call queries. The system performs end-to-end processing of the requests, formulated as PostgreSQL declarative queries. The queries are converted into imperative descriptions, multiple imperative descriptions are merged into a single execution plan, the plan is optimized to decrease execution time via common compiler optimization techniques, and, finally, the plan is optimized to decrease memory consumption. The last two steps effectively reduce both the time and
space to execute query groups, as shown in the experimental results.
(UMIACS-TR-2004-14)

Notes

Rights