520 likes | 694 Views
Wayne State University. Quality Assurance and Adaptation: A Key to Next Generation of Stress-Resilient Internet Services. Cheng-Zhong Xu Cluster & Internet Computing Lab Dept of Electrical/Computer Engineering Wayne State University. http://www.cic.eng.wayne.edu. Overview of Research.
E N D
Wayne State University Quality Assurance and Adaptation: A Key to Next Generation of Stress-Resilient Internet Services Cheng-Zhong Xu Cluster & Internet Computing Lab Dept of Electrical/Computer Engineering Wayne State University http://www.cic.eng.wayne.edu
Overview of Research Cluster & Internet Computing Lab http://cic.eng.wayne.edu
Ubiquitous Access Pervasive Internet Services • New communication services • Email, Chat, Instant Message • Voice, Telephony, Video conf. • New information services • News, stock, weather, etc • Location-aware: ATM, restaurant, parking • Mobility-aware: banking, ticketing, etc • Services accessible anytime and anywhere QoS Assurance
Characteristics • Diversity • Diverse Access Networks: • PSTN, Bluetooth, Cellular, DSL, Cable, LAN, Satellite, etc • Diverse Access Devices • PDA, phone, computer, “Dick Tracy” watch, etc • Resource-constrained • Info processing capacity: cpu, memory • Storage, networking, • Battery power, etc • Mobile • Mobility is an inherent nature of human being, moving toward resource or away from scarcity. • User (device) and computation QoS Assurance
MAPS Solution @ CIC Group • MAPS: System Support for Mobility and Adaptation in Pervasive Services • Desgin Goals: • Scalable and Secure Service Arch. • Rapid development/deployment of new services • Mobility Support: access on-the-move • User/Device (physical) vs Computation (logical) • Adaptation: proactive in response to change • user requirements, preferences, • available resources and operation conditions QoS Assurance
service overlay network MAPS Ongoing Projects servers clients Cluster-based Internet services Connection migration in mobile comp. P2P file sharing and load balancing Client-aware streaming service adaptation Mobile codes for network appl Energy-aware RM in mobile & embedded sys Service quality assurance and adaptation Service migration for adaptive grid QoS Assurance
Quality Assurance and Adaptation: A Key to Next Generation of Stress-Resilient Internet Services
Outline • User-Perceived Quality of Service • The Problem and Related work • Approach I: Model Predictive Control • Approach II: Model-Free Self-tuning Fuzzy Control • Performance Evaluation • Summary QoS Assurance
User-Perceived QoS • Client-perceived response time includes network transfer time and server delay and processing time • Network alone is not sufficient to support end-to-end QoS assurance www.wayne.edu delay processing time QoS Assurance
Critical Path Analysis • Early studies (Barford and Croella, 2001) showed • For large files (>500K), user-perceived delay mostly came from network delay • For small files (~50K), server-side delay constituted up to 80% latency • Network/Systems trends • Over-provisioning of network bandwidth makes QoS failure rare in network core • Servers are more vulnerable to congestion and perf. loss. • Due to open access nature of Internet services • Caused by flash crowd-like DDoS attacks QoS Assurance
Our Experience on PlanetLab • Run Apache server at Wayne State with various load • Access from clients in North America and Europe • Server-side delay becomes the dominant factor when the system utilization reaches 50% QoS Assurance
Objectives • QoS Assurance and Adaptation on Servers • QoS-aware resource management to achieve guaranteed perf. and resilience even in the face of system stress. • Observe and respond to per-class traffic change • Graceful performance degradation • In contrast to best-effort, same service to all model • Perspectives for QoS assurance • On an indiscriminate Web site • Control behaviors of aggressive clients for fairness • Protect servers from flash-crowd like DDoS attack • On an e-commerce site • Give higher priority to sessions of buyers than visitors, without over-compromising the needs of occasional visitors • Guarantee the perf of purchase requests when the server is stressed. QoS Assurance
Q1 IP Network Q2 IP Network Dispatcher … … Central queue QN Queueing delay Problem Statement • QoS control over requests in different classes • Schedule requests for processing so as to provide predictable and controllable fair-sharing (PCF) services • Predictability: schedules must be consistent, independent of variations of the class workloads • Controllability: controllable parameters to adjust quality factors between classes • Fairness: lower classes not be over-compromised, especially when workload is high QoS Assurance
Related work • QoS-aware admission control • Early random dropping (Chen & Mohapoatra, 1999) • Feedback control to bound utilization (Abdelzaher et al. 02) • Session-based AC (Cherkasova & Phaal, 2002) • On/off AC model doesn’t support performance graceful degradation • Priority-based request scheduling • Differentiate QoS between different classes of requests by setting priorities (Almeida et al, 98, Eggert, et al 99) • No guarantee of absolute/relative QoS • Processing rate allocation • Queueing-model based: calculate resource amount based on a queueing model w.r.t. processing delay (Cardellini01, Zhu01, Pradhan02, Zhou04) • However, it relies on an accurate server model: • Mean-value analysis provides control over average quality of requests in a long run, but unable to control their QoS variance • Model predictive feedback control QoS Assurance
QoS Assurance • Client-Perceived QoS Assurance • Related work • Approach I: Model Predictive Control • Approach II: Model-Free Self-tuning Fuzzy Control • Performance Evaluation • Summary QoS Assurance
Model Predictive Feedback Control • MPFC = queuing model + feedback control • Queueing model to estimate a processing rate • Feedback control to deal with the impact of traffic self-similarity and bustiness • Performance metric: Slowdown • Slowdown = Queuing delay/Service time • Requests have different service time; users tend to tolerate long delays for “large” requests QoS Assurance
MPFC Resource Allocation • Classifier determines requests’ classes • Scheduler dispatches requests to server based on classes’ allocated processing rate • QoS controller adjusts a class’s rate according to measured system conditions QoS Assurance
Queueing Analysis of Slowdown • Performance Metric: Slowdown • Slowdown = Queuing delay (W) /Service time (X) • For general M/G/1 FCFS, with bounded Pareto service-time distribution • Expected slowdown S is QoS Assurance
Subject to : processing rate of class i : differentiation parameter of class i Proportional Slowdown Differentiation Determine processing rate Ci for each class so that the slowdown Si is proportional to its target quality factor δi: QoS Assurance
Queueing Model-based Estimates Processing rate of class i is First term: baseline rate of class i • prevents the class from being overloaded Second term: portion of surplus rate • determined by its normalized arrival rate • controls quality differences between classes QoS Assurance
Properties of the Solution • [Controllability] Differential weight of a class increases, its quality factor increases • [Self-adaptability]Quality factor of a class drops with the increase of its arrival rate • Resilience to flash crowd-like DDoS attacks, load surge, etc • Guarantee good, block bad, and slowdown suspicious ones • [Self-management] Load decrease of a higher-weighted class causes a big quality increase of others. Per-class quality factor: QoS Assurance
Simulation Results 95th-5th = 25 Target = 8 • Simulation setting: expo arrival, bounded Pareto service distribution for each traffic class • Targets are achieved on average • Large variance unstable quality QoS Assurance
Why large variance? • Web traffic is dynamic in nature • Processing rate is calculated based on estimated arrival rate using history • Estimation is inaccurate Sum of errors ≈ 0, achieve target ratio on average QoS Assurance
Basic Ideas of MPFC • Adjust a class’s processing rate according to errors (feedback) and estimated arrival rate (queueing) • Classical integral feedback control • Adjust service rate proportional to the errors integrated over time • No steady-state error and insensitive to measurement noises • A long process delay poses a severe instability issue • From the perspective of feedback control, a model-based estimate tackles the instability issue. QoS Assurance
Structure of MPFC • Rate predictor: estimates a class’s processing rate using queueing theory • Feedback controller: adjusts the rate allocation according to errors using integral control QoS Assurance
Definition of Control Loop • Control loop includes • Reference input r(k), output y(k), and error e(k) • Class 1 is the base class • A control loop is associated with every other class Reference input: Loop output: Error: QoS Assurance
Processing Rate using MPFC (queueing theory) Predictor output: Controller output: (integral control) MPFC output: Rate of class i: QoS Assurance
Simulation Results Small variance Target = 8 • MPFC achieves the target consistently in both small and large time scales • It assumes M/Gp/1 server model on requests for single object pages, and aims at retaining slowdown ratio QoS Assurance
18 objects Challenges in QoS Assurance • Dynamics of Internet traffic • No accurate models for requests • Multi-object Web pages • Pageview quality vs request response time • Non-deterministic process delay • Long delay between the resource allocation time and the time when QoS is measured (observed). QoS Assurance
Client-Experienced Pageview QoS • Current queuing models are limited to requests to single objects; no models available for multi-object Web pages • Multi-phase handshaking of HTTP protocol makes it possible to take into account network conditions in resource alloc request-based QoS connection close server last object waiting for new requests object 2 object 1 base page client client-perceived pageview QoS Setup connection QoS Assurance
Presentation Outline • Client-Perceived QoS Assurance • Related work • Approach I: Model Predictive Control • Approach II: Model-Free Self-tuning Fuzzy Control • Performance Evaluation • Summary QoS Assurance
eQoS: Model-Free Self-Tuning Control • It monitors and controls client-perceived end-to-end pageview response time in Web servers • It is a middleware, residing between operating systems and web server software Fuzzy control provides a model-free way to translate heuristic control knowledge into a set of control rules QoS Assurance
Self-tuning fuzzy controller Service rate u(k+1) of a class in sampling period k+1 is adjusted according to its error e(k) and change of error ∆e(k) in previous sampling period k Second level is a fuzzy scaling-factor controller to compensate the effect of process delay First level is a fuzzy resource controller to address the issue of lacking accurate server model QoS Assurance
Resource controller Rule base contains quantified control knowledge about how to adjust a class’s service rate according to the e(k) and ∆e(k). QoS Assurance
Experimental Setting • Implemented as a plugin of Apache http/1.1 on Linux • Testbeds • PlanetLab, world wide distributed testbed • Server in Detroit, Michigan • Clients in Boston (RTT: 45ms) • Clients in San Diego (RTT: 70 ms) • Clients in UK (RTT: 130 ms) • Network simulator (Dummynet) • Random xmission time (RTT, packet loss) • RTT: 40, 80, and 180 ms • Benchmark • Surge workload generator • Maximum number of embedded objects: 150 • Base: 30%, Embedded objects 38%, Loner: 32% • World Cup 98 Trace • Requests replayed by clients from PlanetLab to objects in trace QoS Assurance
Input Traffic Profile • Workload is measured in terms of page requests • Page requests from a class is stochastic and changes frequently QoS Assurance
Transient Behavior of eQoS on PlanetLab (World Cup Trace) Statistical guarantee of the target response time on PlanetLab (Surge) QoS Assurance
Robustness of eQoS Self-adaptive to load change Self-adaptive to net condition QoS Assurance
Performance Comparison • Fuzzy controller without self-tuning • Tradition proportional integral (PI) controller, based on M/G/1 model • Adaptive PI controller (Kamra et al. IWQoS’04) • All controllers are carefully tuned for RTT = 180 ms and load = 700 clients QoS Assurance
Performance Relative to eQoS • eQoS outperforms others in most of test cases • eQoS is slightly worse than static controller only in the case when the latter was best tuned. QoS Assurance
Summary • QoS assurance on Internet Servers • Web server, e-commerce server, streaming servers • User-perceived performance • Slowdown: normalized response time • Response time for multi-object web pages • Model predictive feedback control approach for queueing delays of individual requests, relative to their processing time. • Model-free self-tuning control approach for pageview response time • Robustness in both short and long time scales • Self-adaptive to change of server load • Self-adaptive to network conditions QoS Assurance
Related Publications • Robust processing rate allocation for proportional slowdown diff. on Internet servers, IEEE Trans. on Computers, 2005 • Resource allocation for session-based 2D service differentiation on e-commerce servers, IEEE Trans. on Parallel and Distrib. Systems. 2005. • Harmonic bandwidth allocation for QoS control on streaming servers, IEEE Trans. on Parallel and Distrib. Systems, 2004 • eQoS: Provisioning of client-perceived end-to-end QoS guarantees in Web servers, Proc. of IWQoS’05 • Modeling and analysis of 2-d service differentiation on e-commerce servers, Proc. of IEEE ICDCS 2004 • Processing rate allocation for proportional slowdown differentiation on Internet Servers, Proc. of IPDPS'04 QoS Assurance
Other MAPS Publications • Energy-aware resource management “Energy-aware modeling scheduling of real-time tasks for dynamic voltage scaling”, IEEE RTSS’05 “Delay-constrained energy-efficient wireless packet scheduling”, Globecom’05 • Intelligent personalized info agent and prefetching “Keywords-based semantic prefetching to tolerate Web access latecny”, IEEE TKDE’04 • Continuous media adaptation for service differentiation on steaming servers “Harmonic bandwidth allocation for qos control on streaming servers”, IEEE TPDS’04 • Mobility support for network-centric, data-intensive applications “Naplet: A flexible and reliable mobile agent framework”, IPDPS’02 “Mobile codes and Security”, Handbook of Info Security, John Wiley & Sons, 2005 • Load balancing in a cluster of servers and overlay network “Cycloid: A scalable and constant-degree lookup-efficient P2P overlay network”, Perf. Eval.’06 “Locality-aware randomized load balancing on DHT networks”, ICPP’05, and IPDPS’06 • Service migration for adaptive grid computing “service migration in distributed virtual machines for adaptive grid comp.”, ICPP’04, ICPP’05 • Transparent connection migration in mobile computing A reliable connection migration mechanism for synchronous transient communication between mobile objects. ICPP’04 Scalable and Secure Internet Services and Architecture, Chapman & Hall/CRC Press, June 2005 QoS Assurance
MAPS Project in CIC@WSU • MAPS: System support for mobility and adaptation in pervasive services • Team • C. Xu, Principal Investigator • Visiting/Guest Faculty (3) • X. Zhou, G. Chen, Y.-S. Jeong • PhD Students (7) • J. Wei, H. Shen, X. Zhong, S. Fu, B. Liu, M. Xu, B. Wims, • M.Sc. Thesis Students (5) • A. Brodie, W. Chen, R. Sudhindra, E. Henne, S. Shashidhara, • Funded by • U.S. NSF: ACI-0303592, NASA: 03-OBPR-01-0049 • WSU Research Enhanced Program, Career Development Chair Award http://cic.eng.wayne.edu QoS Assurance
Thanks. Cluster and Internet Computing Laboratory Wayne State University, Detroit, Michigan HTTP://www.cic.eng.wayne.edu QoS Assurance
Rule-base design e(k) < 0 and ∆e(k) < 0 e(k) < 0 and ∆e(k) > 0 2 3 5 4 1 e(k) > 0 and ∆e(k) > 0 e(k) > 0 and ∆e(k) < 0 • Zone 1 and Zone 3: Self-correcting, slowdown/speedup current trend • Zone 2 and Zone 4: Moving away, reverse current trend • Zone 5: small e and ∆e, maintain current trend QoS Assurance
Rule-base design (cont.) • Rules are described as IF-THEN statements using linguistic values • Linguistic values QoS Assurance
Rule-base design (cont.) IF error is NM and change of error is NL, THEN change of service rate is PL QoS Assurance
Scaling factor controller • e(k) is large • e(k) and ∆e(k) have the same sign • Far away from target and moving farther away: large change of resource allocation • Different sign • Moving closer: small change of resource • e(k) is small • Resource change to prevent overshoot or undershoot according to transient states QoS Assurance