Update Propagation Strategies for Improving the Quality of Data on the Web

dc.contributor.authorLabrinidis, Alexandrosen_US
dc.contributor.authorRoussopoulos, Nicken_US
dc.contributor.departmentISRen_US
dc.contributor.departmentCSHCNen_US
dc.date.accessioned2007-05-23T10:11:23Z
dc.date.available2007-05-23T10:11:23Z
dc.date.issued2001en_US
dc.description.abstractDynamically generated web pages are ubiquitous today but their high demand for resources creates a huge scalability problem at the servers. Traditional web caching is not able to solve this problem since it cannot provide any guarantees as to the freshness of the cached data. A robust solution to the problem is web materialization, where pages are cached at the web server and constantly updated in the background, resulting in fresh data accesses on cache hits. In this work, we define Quality of Data metrics to evaluate how fresh the data served to the users is. We then focus on the update scheduling problem: given a set of views that are materialized, find the best order to refresh them, in the presence of continuous updates, so that the overall Quality of Data (QoD) is maximized. We present a QoD-aware Update Scheduling algorithm that is adaptive and tolerant to surges in the incoming update stream. We performed extensive experiments using real traces and synthetic ones, which show that our algorithm consistently outperforms FIFO scheduling by up to two orders of magnitude.en_US
dc.format.extent835994 bytes
dc.format.mimetypeapplication/pdf
dc.identifier.urihttp://hdl.handle.net/1903/6236
dc.language.isoen_USen_US
dc.relation.ispartofseriesISR; TR 2001-23en_US
dc.relation.ispartofseriesCSHCN; TR 2001-15en_US
dc.subjectGlobal Communication Systemsen_US
dc.titleUpdate Propagation Strategies for Improving the Quality of Data on the Weben_US
dc.typeTechnical Reporten_US

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
TR_2001-23.pdf
Size:
816.4 KB
Format:
Adobe Portable Document Format