The Dwarf Data Cube Eliminates the Highy Dimensionality Curse
dc.contributor.author | Sismanis, Yannis | en_US |
dc.contributor.author | Roussopoulos, Nick | en_US |
dc.date.accessioned | 2004-05-31T23:35:07Z | |
dc.date.available | 2004-05-31T23:35:07Z | |
dc.date.created | 2003-12 | en_US |
dc.date.issued | 2003-12-18 | en_US |
dc.description.abstract | The data cube operator encapsulates all possible groupings of a data set and has proved to be an invaluable tool in analyzing vast amounts of data. However its apparent exponential complexity has significantly limited its applicability to low dimensional datasets. Recently the idea of the dwarf data cube model was introduced, and showed that high-dimensional ``dwarf data cubes'' are orders of magnitudes smaller in size than the original data cubes even when they calculate and store every possible aggregation with 100\% precision. In this paper we present a surprising analytical result proving that the size of dwarf cubes grows polynomially with the dimensionality of the data set and, therefore, a full data cube at 100% precision is not inherently cursed by high dimensionality. This striking result of polynomial complexity reformulates the context of cube management and redefines most of the problems associated with data-warehousing and On-Line Analytical Processing. We also develop an efficient algorithm for estimating the size of dwarf data cubes before actually computing them. Finally, we complement our analytical approach with an experimental evaluation using real and synthetic data sets, and demonstrate our results. UMIACS-TR-2003-120 | en_US |
dc.format.extent | 252169 bytes | |
dc.format.mimetype | application/pdf | |
dc.identifier.uri | http://hdl.handle.net/1903/1333 | |
dc.language.iso | en_US | |
dc.relation.isAvailableAt | Digital Repository at the University of Maryland | en_US |
dc.relation.isAvailableAt | University of Maryland (College Park, Md.) | en_US |
dc.relation.isAvailableAt | Tech Reports in Computer Science and Engineering | en_US |
dc.relation.isAvailableAt | UMIACS Technical Reports | en_US |
dc.relation.ispartofseries | UM Computer Science Department; CS-TR-4552 | en_US |
dc.relation.ispartofseries | UMIACS; UMIACS-TR-2003-120 | en_US |
dc.title | The Dwarf Data Cube Eliminates the Highy Dimensionality Curse | en_US |
dc.type | Technical Report | en_US |
Files
Original bundle
1 - 1 of 1