Database Replication Policies for Dynamic Content Applications

Database Replication Policies for Dynamic Content Applications Gokul Soundararajan, Cristiana Amza, Ashvin Goel University of Toronto EuroSys 2006: Leuven, Belgium April 19, 2006

Dynamic Content Web Server

Today’s Server Farms • Data centers can run multiple applications • E.g., IBM/HP • Service providers can multiplex resources • E.g., applications have peaks at different times • Challenge: database server becomes the bottleneck

Motivation • Scale the database backend on clusters • Handle more clients • Run multiple applications • Handle failures in the backend • Our approach: • Database replication • Dynamic replica allocation • Adapt to changing load or failures

Database Replication • Read-one, write-all • Plattner & Alonso, MW 04 • Lin et. al, SIGMOD 05 • Amza et. al, ICDE 05 Scaling for E-Commerce (TPC-W)

Dynamic Replication • Assume a cluster hosts 2 applications • App1 (Red) using 2 machines • App2 (Blue) using 2 machines • Assume App1 has a load spike

Dynamic Replication • Choose nr. of replicas to allocate to App1 • Say, we adapt by allocating one more replica • Then, two options • App2 still uses two replicas (overlap replica sets) • App2 loses one replica (disjoint replica sets)

Challenges • Adding a replica can take time • Bring replica up-to-date • Warm-up memory • Can avoid adaptation with fully-overlapped replica sets

Challenges • However, overlapping applications compete for memory causing interference • Can avoid interference with disjoint replica sets

Challenges • However, overlapping applications compete for memory causing interference • Can avoid interference with disjoint replica sets Tradeoff between adaptation delay and interference

Insight for Dynamic Content Apps • Database reads are much heavier than writes • Reads are multi-table joins • Writes are single row updates • Overlapping reads – high interference • Overlapping writes – little interference

Insight for Dynamic Content Apps • Database reads are much heavier than writes • Reads are multi-table joins • Writes are single row updates • Overlapping reads – high interference • Overlapping writes – little interference Solution: Separate reads and overlap writes

Our Solution – Partial Overlap • Reads of applications sent to disjoint replica sets • Avoids interference • Read-Set • Set of replicas where reads are sent

Our Solution – Partial Overlap • Writes of apps sent to overlapping replica sets • Reduces replica addition time • Write-Set • Set of replicas where writes are sent

Optimization • For a given application, • Replicas in Write-Set – Fully Up-to-Date • Other Replicas – Periodic Batch Updates

When do we adapt? • Add when application’s requirements not met • Due to either load spikes or failures • Remove when replica not needed • Application requirements defined through a Service Level Agreement (SLA)

Resource Manager Feedback Loop Global Resource Manager Request Add/Remove Analyze Execute Monitor

Resource Manager Feedback Loop Global Resource Manager Request Add/Remove Analyze Execute When does the feedback loop end? Monitor

Request Add/Remove Analyze Execute Monitor Possible Oscillations • Change not seen immediately • Replica addition takes time • Bring replica fully up-to-date, warm-up memory • May trigger more adds • Oscillations cause interference between applications Global Resource Manager

Request Add/Remove Analyze Execute Monitor Avoiding Oscillations • Delay-Awareness • Use load-balance as heuristic for stabilization after replica addition • Removes are conservative • Tentative removes Global Resource Manager

Cluster Architecture

Experimental Setup • Hardware • AMD Athlon 2600+ running at 2.1 Ghz • 512 MB of RAM • 60 GB Hard Drive • Software • RedHat Fedora Core 2 Linux • Apache 1.3.31 with PHP 4.0 • MySQL 4.0.16 with InnoDB tables • Benchmarks • TPC-W: E-Commerce Retail Store • RUBIS: Online Bidding

Outline of Results • Defined SLA in terms of query latency bound • Query latency < 600 ms • Cluster Size • Up to 8 database replicas • 10 web/application servers • Experiments • Interference between Workloads • Adapting to Load Changes • Adapting to Faults

Disjoint

Partial Overlap

Full Overlap

Interference

Adaptation to Load Changes

Adapting to Load Changes • Three schemes • Disjoint – 4/4 • Dynamic allocation using Partial overlap • Full Overlap – 8/8

Disjoint TPC-W RUBIS

Full Overlap TPC-W RUBIS

Partial Overlap TPC-W RUBIS

Adaptation to Faults

More Results - In the Paper • More complex load scenarios • Including overload • Effect of delay-awareness • Avoiding oscillations

Conclusion • Database replication • Handle more clients • Dynamic replica allocation • Handle multiple workloads with different peaks • Handle faults

Thanks!

Database Replication Policies for Dynamic Content Applications

Database Replication Policies for Dynamic Content Applications

Presentation Transcript

Dynamic Content

Distributed Database and Replication

Replication solutions for Oracle database 11g

Dynamic content

Database Replication in Tashkent

Database Replication in WAN

Dynamic Content

Replication Policies for Federated Digital Repositories

Database replication policies for dynamic content applications

Dynamic languages for dynamic applications

Clustering Web Content for Efficient Replication

Cisco CallManager Database Replication

Database Replication

Dynamic Content

Clustering Web Content for Efficient Replication

Database Replication - Distribution

Cisco CallManager Database Replication

Cisco CallManager Database Replication

Dynamic Restart Policies

Oracle Database Replication

Database Replication Tool