Web Server QoS Management by Adaptive Content Delivery

Web Server QoS Management by Adaptive Content Delivery Tarek F. Abdelzaher and Nina Bhatti Quality of Service, 1999. IWQoS '99. 1999 Seventh International Workshop on , 1999 September 26 2000 Chul Lee

Introduction (1/2) • Today’s web servers • Offer poor performance under overload • Have no means for prioritizing requests • Have no mechanism for pre-allocating end-system capacity to a particular site or hosted service

Introduction (2/2) • Overload protection • Load balancing and admission control • Multicast • Content adaptation • The compression of Images(65% of the total bytes, e-commerce site) • Reducing # of embedded objects per page • Reducing local links • Multiple content trees • /full_contentand/degraded_content • Static and dynamic

QoS Adaptation Architecture (1/4) • Content adaptation layer • Decides on “right” content tree • Prevent underutilization or overload • Server load monitoring & server utilization control Web Server Process Request with Modified URL Content Adaptation Layer Load Monitor & Utilization Cotrol Response Request Communication subsystem

QoS Adaptation Architecture (2/4) • Load Monitoring • To quantify server utilization • The request service time (a URL of size x) • T(x) = a + bx (a : fixed overhead comp. b : data-size dep. comp.) • System utilization • U = aR + bW (R : request rate, W : delivered BW) • Determine a and b off-line

QoS Adaptation Architecture (3/4) • Utilization Control • The Content Adaptor • M content trees • G : the severity of the adaptation action required from the adaptor • G = M : all requests served the highest quality content • G = 0 : all requests must be rejected • H() : hashing function, maps a given client id to the same number every time

QoS Adaptation Architecture (4/4) • The Utilization Controller • A good value : 85% • Use well-known integral controller

QoS Management (1/3) • Performance Isolation • Service Differentiation • Excess Capacity Sharing

QoS Management (2/3) • Performance Isolation • A virtual server • A web server can host multiple independent sites • Associate a virtual server with each hosted site • Capacity planning • Load Classification • Utilization control

QoS Management (3/3) • Service Differentiation • Support client prioritization lower priority clients are degraded first • The capacity should be made available to clients in priority order • Sharing Excess Capacity • The excess capacity is made available to other virtual servers

Evaluation (1/5) • Environments • Testing tool : httperf • Clients : 4 WSs, connected to 100M switched ethernet • Estimating Service Time • T(x) = a + bx, a = 1.604, b= 0.063

Evaluation (2/5) • Request Rejection Overhead • To quantify the rejection overhead • The server rejects all requests by closing the connection as soon as the request is read off the server socket • The maximum rate was found : 900 reqs/s  1.1ms/req (cf. a = 1.604) • Rejecting a set of requests will consume almost 70% of the resources it would take to serve them a short URL

Evaluation (3/5) • Performance Isolation • Non-guaranteed background best-effort traffic • To overload the machine • 300req/s(for 32KB URLs) • Server V1 • Guaranteed BW : 13Mb/s • Maximum guaranteed rate : 50req/s • Server V2 • Guaranteed BW : 27Mb/s • Maximum guaranteed rate : 100req/s

Evaluation (4/5) • Service Differentiation • 2 classes • B : basic class • increasing • P : premium class • 100 req/s

Evaluation (5/5) • Excess Capacity Sharing • V1 • 13 Mb/s, 100req/s • allowed to overrun its capacity • Increased gradually 0-250req/s • V2 • 27 Mb/s, 100req/s • Held constant 100req/s

Conclusion • Conclusion • Content adaptation enables a server to provide a smooth range of client degradation • Performance isolation • Service differentiation • Sharing excess capacity • Future Work • Handling and adapting dynamic content : unpredictability of CGI • HTTP 1.1 : persistent connection • VOD server • Scalable video encoding schemes  to avoid multiple copies • Appropriate content authoring and management tools to preprocess web contents

Web Server QoS Management by Adaptive Content Delivery

Web Server QoS Management by Adaptive Content Delivery

Presentation Transcript

Web Content Management System

Web Content Delivery Networks

HTTP and Web Content Delivery

Web Content Management

Adaptive Personalization for Mobile Content Delivery

Web Content Management Update

Informed Content Delivery Across Adaptive Overlay Networks

On the Effect of Server Adaptation for Web Content Delivery

Web Content Management

A formal approach to adaptive content delivery

Web Content Management

Content Management Server powered by Semantic Web Search

Web content management Services

Web Caching and Content Delivery

Informed Content Delivery Across Adaptive Overlay Networks

Adaptive Personalization for Mobile Content Delivery

Web Content Management Market

Web Content Management System

Web Content Management System