Learning Response Time for WebSources using Query Feedback and
Application in Query Optimization

Gruser, Jean-Robert; Raschid, Louiqa; Zadorozhny, Vladimir

Learning Response Time for WebSources using Query Feedback and Application in Query Optimization

dc.contributor.author	Gruser, Jean-Robert	en_US
dc.contributor.author	Raschid, Louiqa	en_US
dc.contributor.author	Zadorozhny, Vladimir	en_US
dc.date.accessioned	2004-05-31T21:08:06Z
dc.date.available	2004-05-31T21:08:06Z
dc.date.created	1998-11	en_US
dc.date.issued	1998-11-12	en_US
dc.description.abstract	The rapid growth of the Internet and support for interoperability protocols has increased the number of Web accessible sources, WebSources. Current optimization technology for wrapper mediator architectures needs to be extended to estimate the response time (delays) to access WebSources and to use this delay in query optimization. In this paper, we present a Multi-Dimensional Table (MDT), a tool that is based on learning using query feedback from WebSources. We describe the MDT learning algorithms, and report on the MDT learning for WebSources. The MDT uses dimensions Time of day, Day, and Quantity of data, to learn response times from a particular WebSource, and to predict the expected response time (delay), and a confidence in this prediction, for some query. Experiment data was collected from several WebSources and analyzed, to determine those dimensions that were significant in estimating the response time for particular WebSources. Our research shows that we can improve the quality of learning by tuning the MDT features, e.g., including significant dimensions in the MDT, or changing the ordering of dimensions. We then demonstrate how the MDT prediction of delay may be used by a scrambling enabled optimizer. A scrambling algorithm identifies some critical points of delay, where it makes a decision to scramble (modify) a plan, to attempt to hide the expected delay by computing some other part of the plan that is unaffected by the delay. We explore the space of real delay at a WebSource, versus the MDT prediction of this delay, with respect to critical points of delay in specific plans. We identify those cases where MDT overestimation or underestimation of the real delay results in a penalty in the scrambling enabled optimizer, and those cases where there is no penalty. Using the experimental data and MDT learning, we test how good the MDT is in minimizing these penalties. Also cross-referenced as UMIACS TR #98-64	en_US
dc.format.extent	1156410 bytes
dc.format.mimetype	application/postscript
dc.identifier.uri	http://hdl.handle.net/1903/501
dc.language.iso	en_US
dc.relation.isAvailableAt	Digital Repository at the University of Maryland	en_US
dc.relation.isAvailableAt	University of Maryland (College Park, Md.)	en_US
dc.relation.isAvailableAt	Tech Reports in Computer Science and Engineering	en_US
dc.relation.isAvailableAt	Computer Science Department Technical Reports	en_US
dc.relation.ispartofseries	UM Computer Science Department; CS-TR-3952	en_US
dc.title	Learning Response Time for WebSources using Query Feedback and Application in Query Optimization	en_US
dc.type	Technical Report	en_US

Files

Original bundle

Now showing 1 - 2 of 2

Name:: CS-TR-3952.ps
Size:: 1.1 MB
Format:: Postscript Files

Download

Name:: CS-TR-3952.pdf
Size:: 484.55 KB
Format:: Adobe Portable Document Format
Description:: Auto-generated copy of CS-TR-3952.ps

Download

Collections

Technical Reports of the Computer Science Department