240 likes | 247 Views
SCAN: A Dynamic, Scalable, and Efficient Content Distribution Network. Yan Chen, Randy H. Katz, John D. Kubiatowicz {yanchen, randy, kubitron}@CS.Berkeley.EDU EECS Department UC Berkeley. Outlines. Motivation Goal and Challenges Previous Work SCAN Architecture and Components
E N D
SCAN: A Dynamic, Scalable, and Efficient Content Distribution Network Yan Chen, Randy H. Katz, John D. Kubiatowicz {yanchen, randy, kubitron}@CS.Berkeley.EDU EECS Department UC Berkeley
Outlines • Motivation • Goal and Challenges • Previous Work • SCAN Architecture and Components • Evaluation Methodology • Results • Conclusions
Goal and Challenges Provide content distribution to clients with good latency and staleness, while retaining efficient and balanced resource consumption of the underlying infrastructure • Dynamicchoice ofnumber and location of replicas • Clients’ QoS constraints: latency, staleness • Servers’ capacity constraints • Efficient resource consumption • Small delay, bandwidth consumption for replica update • Small replica management cost • Scalability: millions of objects, clients and servers • No global network topology knowledge
Previous Work • Replica Placement • Research communities: optimal static replica placement • Assume clients’ distributions, access patterns & IP topology • No consideration for clients’ QoS or servers’ capacity constraints • CDN operators: un-cooperative, ad hoc placement • Centralized CDN name server cannot record replica locations – place many more than necessary (ICNP ’02) • Update Multicast • No inter-domain IP multicast • Most application-level multicast (ALM) unscalable • Split root as common solution, suffers consistency overhead
replica cache always update adaptive coherence CDN server client DOLR mesh SCAN: Scalable Content Access Network Dynamic replica placement & d-tree construction data source data plane Web content server network plane DOLR with locality
Components of SCAN • Decentralized Object Location & Routing (DOLR) • Properties needed • Scalable location with guaranteed success • Search with locality • Improve the scalability of d-tree: each member only maintains states for its parent and direct children • Simultaneous Dynamic Replica Placement and d-tree Construction • Replica search: Singular, Localized or Exhaustive • Replica placement on DOLR path: Lazy or Eager
parent candidate proxy DOLR path Replica Search • Singular Search data plane s c network plane DOLR mesh
Greedy load distribution parent candidates DOLR path Replica Search • Localized search data plane client child s parent proxy sibling c server child DOLR mesh network plane
first placement choice Replica Placement: Eager data plane s proxy c network plane DOLR mesh DOLR path
first placement choice DOLR path Replica Placement: Lazy data plane client child s proxy c network plane DOLR mesh
Evaluation of Alternatives • Two dynamic overlay approaches • Overlay_naïve: Singular search + Eager placement • Overlay_smart: Localized search + Lazy placement • Compared with static placement + IP multicast • Overlay_static: With global overlay topology • IP_static: With global IP topology (ideal) • Metrics • Number of replicas deployed, load distribution • Multicast performance: Relative Delay Penalty (RDP) and bandwidth consumption • Tree construction traffic (packets and bandwidth)
Methodology • Network Topology • 5000-node network with GT-ITM transit-stub model • SCAN nodes placed randomly or on transit nodes • NS-like Packet-level Network Simulations • Workloads • Synthetic flash crowd: all clients access a hot object in random order • Real Web server traces: NASA and MSNBC
Methodology: Sensitivity Analysis • Various Client/Server Ratio • Various Server Density • Various Latency & Capacity Constraints • Various Network Topologies • Average over 5 topologies with different setup • All Have Similar Trend of Results • Overlay_smart has close-to-optimal (IP_static) number of replicas, load distribution, multicast performance with reasonable amount of tree construction traffic
Number of Replicas Deployed and Load Distribution • Overlay_smart uses only 30-60% of replicas than overlay_naïve and very close to IP_static • Overlay_smart has two times better load distribution than od_naïve, overlay_static and very close to IP_static
Multicast Performance • 85% of overlay_smart Relative Delay Penalty (RDP) less than 4 • Bandwidth consumed by overlay_smart is very close to IP_static, and is only 1/3 of bandwidth by overlay_naive
Tree Construction Traffic Including “join” requests, “ping” messages, replica placement and parent/child registration • Overlay_smart consumes 3 - 4 times of traffic than overlay_naïve, and the traffic of overlay_naïve is quite close to IP_static • Far less frequent event than access & update dissemination
Conclusions • P2P networks can be used to construct CDNs • SCAN: Scalable Content Access Network with good QoS, efficiency and load balancing • Simultaneous dynamic replica placement & d-tree construction • Leverage DOLR to improve scalabilityandlocality • In particular, overlay_smart recommended • Localized search + Lazy placement • Close to optimal number of replicas, good load distribution, low multicast delay and bandwidth penalty at the price of reasonable construction traffic
Results on Web Server Traces • Limited simulations, most URLs have very few requests • Overlay_smart uses only one third to half replicas than overlay_naïve for hot objects
replica cache always update adaptive coherence CDN server client DOLR mesh SCAN: Scalable Content Access Network Dynamic replica placement & d-tree construction data source data plane Web content server network plane DOLR with locality
parent candidate proxy DOLR path Replica Search • Singular Search data plane s c network plane DOLR mesh
Localized search • Greedy load distribution parent candidates DOLR path Replica Search data plane client child s parent proxy sibling c server child network plane
first placement choice Dynamic Replica Placement: naïve • Singular Search • Eager Placement data plane parent candidate s proxy c network plane Tapestry mesh Tapestry overlay path
first placement choice Tapestry overlay path Dynamic Replica Placement: smart • Localized search • Lazy placement • Greedy load distribution data plane parent candidates client child s parent proxy sibling c server child network plane