1 / 23

Toward Better Geolocation: Improving Internet Distance Estimates Using Route Traces

Toward Better Geolocation: Improving Internet Distance Estimates Using Route Traces. Chandrika Jayant Ethan Katz-Bassett. Outline. Motivations for geolocation Constraint-Based Geolocation Problems with CBG Our Approach PlanetLab Experiments Conclusion/ Future Work. Geolocation?.

darryl
Download Presentation

Toward Better Geolocation: Improving Internet Distance Estimates Using Route Traces

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Toward Better Geolocation:Improving Internet Distance Estimates Using Route Traces Chandrika Jayant Ethan Katz-Bassett

  2. Outline • Motivations for geolocation • Constraint-Based Geolocation • Problems with CBG • Our Approach • PlanetLab Experiments • Conclusion/ Future Work

  3. Geolocation? • Infer the geographic location of an Internet host • Many applications would benefit from this information • Advertising, EBS, location sensitive info • Different levels of granularity

  4. Constraint-Based Geolocation • Landmarks: Set of hosts with known locations • Each landmark estimates distance to target • Set performs multilateration using these distances -Gueye, Ziviani, Crovella, Fdida (2004)

  5. CBG Multilateration

  6. CBG Multilateration

  7. CBG Multilateration

  8. CBG Multilateration

  9. CBG Bestline Distance Estimates

  10. CBG Breakdowns (CBGB’s) • Estimates are not tight and vary widely  large confidence regions, need many probes to get a few tight ones • No better at estimating training set vs. other hosts (in general) • More data trained on, worse accuracy (in general) • Still underestimate some distances

  11. Our Approach • Intuition: Targets that have similar routes have similar delay  distance conversions • Use route info to achieve more accurate estimates • Want to fit into CBG framework • 2 main techniques, still using bestline fit: • Path-Based • Router-Based

  12. Path-Based Estimation • Landmark learns routes to its training set • Traceroute target up to TTL = x • Find longest partial path shared with a subset of • training hosts • Calculate bestline using only this subset

  13. Router-Based Estimation • Landmark learns routes to its training set • Send packet to target with TTL = x • Find subset of training hosts with paths through this router • Calculate bestline using only this subset • In practice, use x1,x2,…,xn

  14. PlanetLab Experiments • 110 PlanetLab hosts in North America • Lat/long available for each • Used Scriptroute to gather delay and routes between hosts • 26 landmarks (after munging) • Path-Based used TTL up to 12 • Router-Based used TTLs (12,9,6)

  15. Map of Landmark Hosts (26)

  16. Path Length vs. Accuracy

  17. Router TTL vs. Accuracy

  18. Overall Accuracy of Estimations

  19. Estimations to Collocated Targets

  20. Effects of Training Set Size

  21. Conclusions • Geolocation has powerful potential • CBG is interesting, but needs improvement • Route information improves accuracy of distance estimates • With more accurate estimations, likely need fewer landmarks to locate targets

  22. Future Work • Modify CBG to better handle underestimates/ use other line fits • Test on larger data sets • Numerical analysis • Extend to hosts w/ slower connections • Use delay distribution to “normalize” measurements

  23. Questions?

More Related