100 likes | 123 Views
This study compares ESI and Class-Based Delta Encoding solutions for dynamic web content delivery, addressing server load and network traffic issues. Simulation results and performance metrics are discussed for various page types.
E N D
Evaluation of Delivery Techniques for Dynamic Web Content Mor Naaman, Hector Garcia-Molina, Andreas Paepcke Department of Computer Science Stanford University {mor, hector, paepcke}@cs.stanford.edu http://www-db.stanford.edu/
Problems with Dynamic Pages • Generation of pages is resource-intensive • Pages are too dynamic, or too personalized, to be cached • Higher load on servers (page generation and delivery) • More network traffic
We Evaluate Two Competing Solutions (Both address at least the network load) • ESI(Oracle, Akamai) • Enables assembly of pages from small fragments • Fragments can be cached on specialized network caches (edge servers) • Fragments are assembled on the edge server • Class Based Delta Encoding • Computes delta of generated page from a chosen base file • Base files can be cached on network caches • Client receives delta from the server and base file from cache; applies delta to base file to get final page
A Page Content Model • Page composed from groups; groups include items. • Page construction modeled as two-phase selection (groups, then items) Groups Items
Our Simulation Test-case web pages: Book pages in Amazon-style website MyYahoo-type personalized pages Personalized stock portfolio pages A simple personalized weather page
=Arrival rate; TTL = item time-to-live; = constant Simulation of ESI • Assuming Zipf-like distribution for groups and items (popularityi=k/i) • Performance highly dependant on (ranging from 0.7-1.5 in our simulations) • Hit rate estimates for items: Sample simulation results (bookstore-type resource, With “backend” servers) Hit-rate vs. value of Zipfian parameter Traffic vs. TTL
Class-Based Delta Encoding Simulation • For some pages, client likely to be able to re-use base files Traffic vs. number of base files • For other pages, client-cache link traffic is higher than before. To minimize client traffic, use same base file owned by client if delta is larger than threshold Traffic vs. Same-Base threshold
Sample Comparison Numbers MyYahoo-type pages Amazon-style Book pages
Conclusions Excellent *, Good +, Bad -, Sometimes ~ All the details: http://dbpubs.stanford.edu/pub/2003-7