Efficient Execution of Multi-Query Data Analysis Batches Using Compiler Optimization Strategies
dc.contributor.author | Andrade, Henrique | en_US |
dc.contributor.author | Aryangat, Suresh | en_US |
dc.contributor.author | Kurc, Tahsin | en_US |
dc.contributor.author | Saltz, Joel | en_US |
dc.contributor.author | Sussman, Alan | en_US |
dc.date.accessioned | 2004-05-31T23:30:50Z | |
dc.date.available | 2004-05-31T23:30:50Z | |
dc.date.created | 2003-07 | en_US |
dc.date.issued | 2003-08-01 | en_US |
dc.description.abstract | This work investigates the leverage that can be obtained from compiler optimization techniques for efficient execution of multi-query workloads in data analysis applications. Our approach is to address multi-query optimization at the algorithmic level by transforming a declarative specification of scientific data analysis queries into a high-level imperative program that can be made more efficient by applying compiler optimization techniques. These techniques -- including loop fusion, common subexpression elimination and dead code elimination -- are employed to allow data and computation reuse across queries. We describe a preliminary experimental analysis on a real remote sensing application that is used to analyze very large quantities of satellite data. The results show our techniques achieve sizable reduction in the amount of computation and I/O necessary for executing query batches and in average executing times for the individual queries in a given batch. (UMIACS-TR-2003-76) | en_US |
dc.format.extent | 550399 bytes | |
dc.format.mimetype | application/postscript | |
dc.identifier.uri | http://hdl.handle.net/1903/1300 | |
dc.language.iso | en_US | |
dc.relation.isAvailableAt | Digital Repository at the University of Maryland | en_US |
dc.relation.isAvailableAt | University of Maryland (College Park, Md.) | en_US |
dc.relation.isAvailableAt | Tech Reports in Computer Science and Engineering | en_US |
dc.relation.isAvailableAt | UMIACS Technical Reports | en_US |
dc.relation.ispartofseries | UM Computer Science Department; CS-TR-4507 | en_US |
dc.relation.ispartofseries | UMIACS; UMIACS-TR-2003-76 | en_US |
dc.title | Efficient Execution of Multi-Query Data Analysis Batches Using Compiler Optimization Strategies | en_US |
dc.type | Technical Report | en_US |