460 likes | 482 Views
This research focuses on effective scheduling and workload balancing strategies for cloud computing systems, emphasizing Complex Information Systems (CIS). Explore innovative solutions for resource allocation, management, and system reliability in the face of network attacks and malfunctions. Leveraging optimization techniques, the study delves into discrete resource project scheduling and multi-criteria workload balancing. Discover insights into balancing computational loads and reconfiguring CIS architectures to enhance system efficiency and dependability. Uncover the latest methods for addressing challenges in cloud service delivery and maintaining service quality through reconfiguration scenarios
E N D
Scheduling and Workload Balancing for Clouds Czesław Smutnicki Wrocław University of Science and Technology, Wrocław, Poland czeslaw.smutnicki@pwr.edu.pl www.pwr.edu.pl ICA CON, 2-3 June 2016
Wrocław University of Science and Technology, 100+ years of tradition
Wrocław University of Science and Technology. Technopolis (ICT), 2014. Cloud for education
Wroclaw University of Science and Technology. Virtual digital library, 2014. Cloud for education and repository
Presentation Schedule • Clouds and CIS • Resources • Optimization • Off-line models • On-line models • Criteria evaluation • Calculations • Proposals • Conclusion
Computational Cloud (CC) • infrastructure (hardware, operational platform, software, services, broadband, communication links, etc.), • accessible transparently, remotely and virtually through the network, • eliminates the need for maintaining single-handedly expensive computing resources, • sharing the strong power physical resources ensured by a CC provider among large number of users, • offers reduced cost of service, • ensures rational management of computing goods in case of solving very hard problems derived from science and practice CC offers: for scientists - a cheaper alternative to clusters, grids, and supercomputers to solve hard problems computationally expensive; for education – convenient tool providing e-platform; for mass users - the powerful tool to solve quickly applicative problems for the commercial and business usage depending on the client profile.
Complex/Distributed Information Systems (CIS) Efficient distribution of various types of ICT resources is crucial for the efficiency of management of CC systems, as well as for their business justification. CC allows on direct application of Complex Information System (CIS) in a broad spectrum of this meaning. We understand here CIS as special class of computer system composed of huge number of workstations (clients), distributed servers of contents or services and the net linking all these players. Generally, it works in the mode question-and-answer, although some deputed task directed to a server can be split and sub-ordered to next servers.
CC and CIS resources and their distribution One can consider CIS as CC resources dispersed among nodes in the net, called sometimes SaaS cloud computing model with centralized (balanced) management or web-based service. It is clear that events in this system have discrete character, with high dynamics of changes, no-predictive (random) set of coming events and huge size of dimension. From modeling point of view CIS can be perceived as the non-stationary mixed open/closed queening system with queues of limited length, each of which has set its own service policy. Such complex system cannot be analyzed analytically, because of insufficient power of theoretical methods. Then, many researchers consider simulation as thesuitable tool for describing and analyzing behavior of CIS.
Resource allocation and management Managing the system resources lead us to several problems considered commonly in the literature as discrete resource project constrained scheduling and workload balancing optimization. Depending on the character of introduced model, destination of the cloud (public, private, educational, etc.), a lot of formal models and solution algorithms were proposed. It is based on, among others, discrete optimization technology, stochastic processes, on-line algorithms and multi-criteria analysis. A large amount of approaches were proposed and analyzed in recent years.
CC and CIS. Constraints. Scenarios. Beside mentioned aspects, a lot of attacks and malfunctions have been observed in the network, influencing on availability/unavailability of resources of some kind, thus on system dependability. In order to save the viability of service quality after the malfunction, reconfiguration of the system architecture has been recommended. Usually several scenarios of the changes (called sometimes in the literature choreography) are possible and can be considered. Each scenario has been evaluated from several points of view. Our aim is to select the best target scenario. This leads to a problem of discrete optimization (because of finished set of possible scenarios) and multi-criteria (because of the number of evaluation criteria taken into account) with unusual technology of goal function evaluation (simulation). Such view rarely exist.
Solution Approach Starting from a critical survey of models and methods used in scheduling and work-loading in CC systems we direct toward an approach using the best approaches recommended in modern optimization for particularly hard discrete single- or multi-criteria problems, with some proposals for using these tools to solve the problem of optimal balancing in CC and reconfiguring of CIS. Note that either balancing in CC as well as flow of task in CIS can be considered as an discrete optimization problem. It is commonly known, that combinatorial optimization (CO) problems derived from practice, due to features of the domain, require especially increased calculation power in order to find a solution satisfactory for the user. Application in real-time systems formulates quite sharp requirements.
Cloud management. Identical resources. cloud + workload balancer(s) Communicationlinks + network balancer Communicationlinks + network balancer
Cloud management. Uniform resources. cloud + workload balancer Communication + network balancer Communication + network balancer
Cloud management. Unrelated resources. cloud + workload balancer Communication + network balancer Communication + network balancer
CIS management. Inhomogeneous resources. Dispersed cloud + dispersed workload balancer Communication + network balancer Users
Off-line balancing. Deterministic model I. Parallel identical mprocessors, task preemptions, makespan Packing m bins of the calculated size, splitting them if necessary. RR
Off-line balancing. Deterministic model II. Parallel identical mprocessors; no task preemption; makespan Schedule: LPT rule; greedy Experimentally: 1.0 ... 1.1
Off-line balancing. Deterministic model III. Parallel uniformm processors, task preemptions, makespan • Find assignment of task to processors by using processor share • Find the schedule
Off-line balancing. Deterministic model IV. Parallel unrelated m processors, task preemptions, makespan; task j, processor i • Find assignment of tasks to processors • Find the schedule LP program
Off-line balancing. Deterministic model V. Parallel uniform m processors, task preemptions; makespan unit processing times Transportation problem: Transportation network: n sources and m*n targets; arc (j,(i,k)) has cost which corresponds to the completion time of task j processed on processor i as k-th in order; variable y
Off-line balancing. Deterministic model VI. Parallel uniform mprocessors; no task preemptions; makespan Schedule: modified LPT rule
Off-line balancing. Deterministic model VII. Parallel uniform m processors, task preemptions; flowtime unit processing times Transportation problem: Transportation network: n sources and m*n targets; arc (j,(i,k)) has cost which corresponds to the completion time of task j processed on processor i as k-th in order; variable y
On-line balancing. Model I. Parallel identical mprocessors; no task preemption; makespan (Greedy) Assign new task to the least loaded processor. Competitive ratio 2-1/m; 3/2 and 5/3 are the best for m=2 and m=3 respectively Improved to 1.945, 1.923 for small m>=4 (LB is 1.852) Randomized: 4/3 for m=2 and (LB=1.582) for m>2
On-line balancing. Model II. Parallel uniform mprocessors; no task preemption; makespan
On-line balancing. Model III. Limited assignment Parallel identical m processors; no task preemption; makespan
On-line balancing. Model IV Parallel unrelated m processors; no task preemption; makespan
MULTICRITERIA OPTIMIZATION. USER PREFERENCES • Known a priori • Known a posteriori • No user preferences • Iteratively extracted • Pareto frontier
After the analysis we propose the following taxonomy: • x is deterministic, function K(x) is given by a formula (clear, the most frequent case), • x is deterministic, function K(x) is given by a deterministic polynomial-time algorithm (e.g. longest path in the graph defined by x), • x is deterministic, function K(x) is given by a deterministic exponential-time algorithm (e.g. TSP for given set of cities x), • x is deterministic, function K(x) is given by a deterministic algorithm provided in form of pseudocode or program code, • x is random variable, function K(x) represents certain measure on x (e.g. moments, probability), • x is random variable, function K(x) is given by an algorithm (e.g. RR in simple queening systems), • x is fuzzy variable, function K(x) represents certain defuzzified measure on x • x is any variable, function K(x) is given as the result of running program code (especially the result of a simulation), • x is any variable, function K(x) is given as the result of sensor measurement.
Proposals • Set the balancer as the algorithm with some measured values and some control parameters • Measure necessary values, make estimation • Adjust control parameters by solving the optimization problem • Store solution in repository • Evaluate solution by simulation • Extend repository by simulating „if else” cases • Take best solution from the repository
CO Troubles • Inability of finding feasible solution • High calculation cost • Slow convergence • Premature convergence • Search stagnation • Imprecise data
OPTIMIZATION. TIME/COST OF CALCULATIONS CURSE OF DIMENSIONALITY Pleasewait. Calculationswilllast 3 289 years NP-HARDNESS LAB INSTANCE 5..20 VARIABLES ! ! ? NONLINEAR FUNCTION OF 1980 VARIABLES !!! INSTANCE FROM PRACTICE
Approximate Approaches • constructive/improvement • priority rules • random search • greedy randomized adaptive • simulated annealing • simulated jumping • estimation of distribution • tabu search • adaptive memory search • variable neighborhood search • evolutionary, genetic search • differential evolution • biochemistry methods • immunological methods • ant colony optimization • particle swarm optimization • neural networks • threshold accepting • path search • beam search • scatter search • harmony search • path relinking • adaptive search • constraint satisfaction • descending, hill climbing • multi-agent • memetic search • bee search • intelligent water drops * * * * * METHODS RESISTANT TO LOCAL EXTREMES
Balancing as a configuration Referring to the taxonomy provided one can say that the form of optimization task for the case of CIS collapse depends on the style of CIS description.
Balancing with data estimation Taking into account fundamental features of the CIS architecture and activity, the simulation seems to be the most adequate method of criteria calculation. Effects of such approach are manifold. Solution corresponds to configuration of services/balancing in CIS, i.e. their distribution among nodes. The service located at the node using contrary policy of queue or resources is treated as different solution. Requirements coming to CIS from workstations are treated as the noise from statistical point of view. Single simulation provides a measurement of some parameter(s) representing criteria treated as observed realization of the random variable. Statistically important sample is necessary
CONCLUSIONS • Universal method ? • Data collection/estimation • Adaptation • Repository of workload methods • Repository of scenarios/user profile • Approximate methods • Open repository • Task splitting • Unreliable search • Cost reduction • Green calculations • Multi-criteria analysis • Educational, scientific and business app
Thank you for your attention Czesław Smutnicki