XJoin: Getting Fast Answers From Slow and Bursty Networks

dc.contributor.authorUrhan, Tolgaen_US
dc.contributor.authorFranklin, Michael J.en_US
dc.date.accessioned2004-05-31T22:56:12Z
dc.date.available2004-05-31T22:56:12Z
dc.date.created1999-02en_US
dc.date.issued1999-02-26en_US
dc.description.abstractThe combination of increasingly ubiquitous Internet connectivity and advances in heterogeneous and semi-structured databases has the potential to enable database-style querying over data from sources distributed around the world. Traditional query processing techniques, however, fail to deliver acceptable performance in such a scenario for two main reasons: First, they optimize for delivery of the entire query result, while on-line users would typically benefit from receiving initial results as quickly as possible. Second, slow or bursty delivery of data from remote sources can stall query execution, making the already inadequate batch-like behavior even worse. Both of these problems can be addressed using fully pipelined query execution. The symmetric hash join operator supports such pipelining, but it requires all base data and intermediate results to be memory-resident, which is unacceptable for complex queries over large datasets. In this paper we present a multi-threaded extension of the symmetric hash join, called XJoin, that can execute effectively with far less memory. By reactively scheduling background processing, XJoin hides intermittent delays in data arrival to produce more tuples earlier. XJoin includes a very efficient, on-the-fly algorithm for preventing duplicates from being created by its independently running threads. We have implemented the XJoin operator and added it to the PREDATOR Object-Relational DBMS. Using this implementation along with traces obtained by monitoring Internet data delivery, we show that XJoin is an effective solution for providing fast query responses to users even in the presence of slow and bursty remote sources. (Also cross-referenced as UMIACS-TR-99-13)en_US
dc.format.extent398309 bytes
dc.format.mimetypeapplication/postscript
dc.identifier.urihttp://hdl.handle.net/1903/997
dc.language.isoen_US
dc.relation.isAvailableAtDigital Repository at the University of Marylanden_US
dc.relation.isAvailableAtUniversity of Maryland (College Park, Md.)en_US
dc.relation.isAvailableAtTech Reports in Computer Science and Engineeringen_US
dc.relation.isAvailableAtUMIACS Technical Reportsen_US
dc.relation.ispartofseriesUM Computer Science Department; CS-TR-3994en_US
dc.relation.ispartofseriesUMIACS; UMIACS-TR-99-13en_US
dc.titleXJoin: Getting Fast Answers From Slow and Bursty Networksen_US
dc.typeTechnical Reporten_US

Files

Original bundle
Now showing 1 - 2 of 2
No Thumbnail Available
Name:
CS-TR-3994.ps
Size:
388.97 KB
Format:
Postscript Files
Loading...
Thumbnail Image
Name:
CS-TR-3994.pdf
Size:
271.19 KB
Format:
Adobe Portable Document Format
Description:
Auto-generated copy of CS-TR-3994.ps