Optimizing Retrieval and Processing of Multi-dimensional Scientific Datasets

dc.contributor.authorChang, Chialinen_US
dc.contributor.authorKurc, Tahsinen_US
dc.contributor.authorSussman, Alanen_US
dc.contributor.authorSaltz, Joelen_US
dc.date.accessioned2004-05-31T23:01:46Z
dc.date.available2004-05-31T23:01:46Z
dc.date.created2000-02en_US
dc.date.issued2000-02-02en_US
dc.description.abstractExploring and analyzing large volumes of data plays an increasingly important role in many domains of scientific research. We have been developing the Active Data Repository (ADR), an infrastructure that integrates storage, retrieval, and processing of large multi-dimensional scientific datasets on distributed memory parallel machines with multiple disks attached to each node. In earlier work, we proposed three strategies for processing range queries within the ADR framework. Our experimental results show that the relative performance of the strategies changes under varying application characteristics and machine configurations. In this work we investigate approaches to guide and automate the selection of the best strategy for a given application and machine configuration. We describe analytical models to predict the relative performance of the strategies when input data elements are uniformly distributed in the attribute space of the output dataset, restricting the output dataset to be a regular $d$-dimensional array. We present an experimental evaluation of these models for various synthetic datasets and for several driving applications on a 128-node IBM SP. (Also cross-referenced as UMIACS-TR-2000-03)en_US
dc.format.extent971713 bytes
dc.format.mimetypeapplication/postscript
dc.identifier.urihttp://hdl.handle.net/1903/1053
dc.language.isoen_US
dc.relation.isAvailableAtDigital Repository at the University of Marylanden_US
dc.relation.isAvailableAtUniversity of Maryland (College Park, Md.)en_US
dc.relation.isAvailableAtTech Reports in Computer Science and Engineeringen_US
dc.relation.isAvailableAtUMIACS Technical Reportsen_US
dc.relation.ispartofseriesUM Computer Science Department; CS-TR-4101en_US
dc.relation.ispartofseriesUMIACS; UMIACS-TR-2000-03en_US
dc.titleOptimizing Retrieval and Processing of Multi-dimensional Scientific Datasetsen_US
dc.typeTechnical Reporten_US

Files

Original bundle
Now showing 1 - 2 of 2
No Thumbnail Available
Name:
CS-TR-4101.ps
Size:
948.94 KB
Format:
Postscript Files
Loading...
Thumbnail Image
Name:
CS-TR-4101.pdf
Size:
207.81 KB
Format:
Adobe Portable Document Format
Description:
Auto-generated copy of CS-TR-4101.ps