1 / 25

Non-Transitive Connectivity and DHTs

Non-Transitive Connectivity and DHTs. Mike Freedman Karthik Lakshminarayanan Sean Rhea Ion Stoica WORLDS 2005. Distributed Hash Tables…. k. System assigns keys to nodes All nodes agree on assignment Chord assigns keys as integers modulo 2 160 Assigns keys via successor relationship

Download Presentation

Non-Transitive Connectivity and DHTs

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Non-Transitive Connectivity and DHTs Mike Freedman Karthik Lakshminarayanan Sean Rhea Ion Stoica WORLDS 2005

  2. Distributed Hash Tables… k • System assigns keys to nodes • All nodes agree on assignment • Chord assigns keys as integers modulo 2160 • Assigns keys via successor relationship • Each node must know predecessor R

  3. Distributed Hash Tables… k • Used to store and retrieve (key, value) pairs • Any node can discover key’s successor, yet without full knowledge of network • Implies some form of routing R

  4. Distributed Hash Tables… • All have implicit assumption: full connectivity

  5. Distributed Hash Tables… • All have implicit assumption: full connectivity • Non-transitive connectivity (NTC)not uncommon B ↔ C , C ↔ A , A ↔ B • A thinks C is its successor! X A B C k

  6. Does non-transitivity exist? • Gerding/Stribling PlanetLab study • 9% of all node triples exhibit NTC • Attributed high extent to Internet-2 • Yet NTC is also transient • One 3 hour PlanetLab all-pair-pings trace • 2.9% have persistent NTC • 2.3% have intermittent NTC • 1.3% fail only for a single 15-minute snapshot • Level3 ↔ Cogent, but Level3 ↔ X ↔ Cogent • NTC motivates RON, Detour, and SOSR!

  7. Our contributions • We have built and run Bamboo (OpenDHT), Chord (i3), Kademlia (Coral) for > 1 year • Vanilla DHT algorithms break under NTC • Identify four main algorithmic problems and present our solutions

  8. Our goals • Short-term • Inform other developers about NTC solutions • Important: DHTs are being widely deployed in Overnet, Morpheus, and BitTorrent • Long-term • Encourage new designs to directly handle NTC • (This topic is far from solved)

  9. A B k R S DHTs 101: Routing • Key space defines an identifier distance • Routing ideally proceeds by halving distance to destination per overlay hop Iterative

  10. A B k R S A B k R S DHTs 101: Routing Iterative Recursive

  11. DHTs 101: Routing tables • successors / leaf set: ensure correctness • fingers / routing table: efficient routing • O ( log (n) ) hops, generally k R

  12. Problems we identify • Invisible nodes • Routing loops • Broken return paths • Inconsistent roots

  13. NTC problem fundamental? S A B C R Traditional routing

  14. NTC problem fundamental? S A B C R • DHTs implement greedy routing for scalability • Sender might not use path, even though exists: finds local minima when id-distance routing Greedy routing Traditional routing

  15. Problems we identify • Invisible nodes • Routing loops • Broken return paths • Inconsistent roots (First discuss how problems apply to iterative routing, then consider recursive routing.)

  16. Iterative routing: Invisible nodes B C k A R X S • Invisible nodes cause lookup to halt

  17. Iterative routing: Invisible nodes X B C D k A R X S • Invisible nodes cause lookup to halt • Enable lookup to continue • Tighter timeouts via network coordinates • Lookup RPCs in parallel • Unreachable node cache

  18. Routing table pollution B C k A R S • Many proposals for maintaining routing tables • E.g., replace nodes with larger RTT • Must first prevent routing table pollution • Only add new nodes upon contacting directly • Do not immediately remove nodes from hearsay

  19. Inconsistent roots • Nodes do not agree where key is assigned: inconsistent views of root • Can be caused by membership changes • Also due to non-transitive connectivity • May persist indefinitely k R S’ ? X S R’

  20. Inconsistent roots • No solution when network partitions • If non-transitivity is limited: • Consensus among leaf set? • [Etna, Rosebud] • Expensive in messages and bandwidth • Link-state routing among leaf set? • [Pastry 1.4.1] • Can use application-level solutions!

  21. Inconsistent roots • Root replicates (key,value) among leaf set • Leafs periodically synchronize • Get gathers results from multiple leafs • [OpenDHT, DHash] • Not applicable when require fast update (i3) k R X S R’ M N

  22. Recursive routing • Invisible nodes • Must also prevent routing table pollution • Easier to achieve accurate timeouts • Harder to perform concurrent RPCs • Inconsistent Roots • Similar solutions • (Routing Loops) • One new problem…

  23. Broken return paths • Direct path back from R to S fails • Source-route reverse path • Use single intermediate hop • RON, Detour, SOSR… k R X S T

  24. Summary • Non-transitive connectivity exists • DHTs must deal with it • Discovered problems the “hard way” • OpenDHT / Bamboo, i3 / Chord, Coral / Kademlia • Presented our “from the trenches” fixes • NTC should be considered during design phase

  25. Thanks… Watch Our Real, Large Distributed Systems… coralcdn.org opendht.org i3.cs.berkeley.edu

More Related