300 likes | 427 Views
R outing S tate D istance: A Path-based Metric for Network Analysis. Natali Ruchansky Gonca Gürsun , Evimaria Terzi , and Mark Crovella. Distance Metrics for Analyzing Routing. Shortest Path Distance. Similarly Routed. A New Metric.
E N D
Routing State Distance:A Path-based Metric forNetwork Analysis NataliRuchansky GoncaGürsun, EvimariaTerzi, and Mark Crovella
Distance Metrics for Analyzing Routing Shortest Path Distance Similarly Routed
A New Metric Based on this distance intuition we develop a newmetric based on pathsand show it is good for: • Visualizationof networks and routes • Characterizingroutes • Detecting significant patterns • Gaining insightabout routing
We call this path-based distance metric: Routing State Distance
Conceptually… • Imagine capturing the entire interdomain routing state of the internet in a matrix the next hop on path from to • Each rowis the routing table of a single AS • Now consider the columns… Destinations Sources
Routing State Distance We define between two prefixes and as the numberof entries that differ in their columns of • i.e. the number of ASes • that disagree about the next-hop to and .
More Formally Givena universe of prefixes define: A next-hop matrix : the next-hop on the path to As well as :
RSD to BGP In order to apply to measured BGP pathswe define to have ASeson rows and prefixeson columns. the next-hop from AS to prefix • A few issues arise… • Missing Values • Multiple next hops • Solution Key: • is defined on a set • of paths NOT a graph
Our Data From 48 million AS paths consisting of: 359 unique monitors 450K destination prefixes We end up with: 243 sources ASes 130K prefixes Thus our is
Why is appealing? Let’s take a look at its properties…
RSD versus Hop Distance No relation between RSD and hop distance
Finer Grained Measure Varies smoothlyand has a gradualslope. Allows fine granularity Increase of 1 encompasses many prefixes
Highly structured Allows 2D visualization From compute, our distance matrix where:
Wow! Highly structured This happens with anyrandomsample Internet-wide
Yeah, but a cluster of what!?! First think matrix-wise (): • A cluster corresponds to a set of columns • Columns being close in means they are similar in some positions • is highly coherent Now in routingterms: • Any row in must have the same next hop in nearly each cell • The set of ASes make similar routing decisions w.r.t destinations • We call such a pair a local atom
Small cluster “C” Large Cluster Small cluster “C” Large cluster
A local atom is a set of prefixes that are routed similarly in some region of the internet. So the smaller cluster is a local atom of certain prefixes that are routed similarly by a large set of ASes
Why these specific prefixes? For this investigate … • Prefer a specific AS for transit to these prefixes. Hurricane Electric (HE) • If anypath passes through HE : • Source ASes prefer that path • Prefix appears in the smaller cluster . Hurricane Electric Level3 Sprint
But why do sources alwaysroute through HE if the option exists? ….HE has a relatively unique peering policy. Offer peering to ANYAS with presence in the same exchange point. HE’s peers prefer using HE for ANYcustomer of HE And hence consists of networks that peer with HE, and consists of HE’s customers
Can We Find More Clusters? Analysis with uncovered a macroscopic atom Can we formulate a systematic study to uncover other smaller atoms? Intuitivelywe would like a partitioning of the prefixes such that : • In the same group is minimized • Between different groups is maximized
RS-Clustering Problem Intuition: A partitioning of the prefixes such that : • In the same group is minimized • Between different groups is maximized For a partition : Key Advantage: Parameter Free!
Optimal is Hard Finding the optimal solution to the Problem is NP-hard We propose two approaches: Pivot Clustering Overlap Clustering
Pivot Clustering Algorithm Given a set of prefixes , their values, and a threshold parameter : • Start from a random prefix (the pivot) • Find all that fall within distance to and form a cluster • Remove cluster from and repeat Advantages: • The algorithm is fast : O(|E|) • Provable approximation guarantee
5 largest clusters Clusters show a clear separation Each cluster corresponds to a local atom
We ask ourselves if • apartitionis really best? To address this we propose a formalism called Overlap Clustering and show that it is capable of extracting such clusters. Seek a clustering that captures overlap
Related Work • Reported that BGPtables provide an incomplete view of the AS graph. [Roughanet. al. ‘11] • Visualizationbased on AS degree and geo-location. [Huffakerandk. claffy‘10] • Small scale visualizationthrough BGPlay and bgpviz • Clusteringon the inferred AS graph. [Gkantsidiset. al. ‘03] • Grouping prefixes that share the same BGP paths into policy atoms. [Broidoandk. claffy‘01] • Methods for calculating policy atoms and characteristics. [Afeket. al. ‘02]
Take-Away Analysis with typical distance metrics is hard We introduce a new one -- Routing State Distance – that is simpleand based only on paths Overcome BGP hurdles and show it can be used for: • In-depth analysisof BGP • Capturing closenessuseful for visualization • Uncovering surprising patterns • General setting Developed a new set of tools for extracting insight from BGP measurements
Code, data, and more informationis available on our website at: csr.bu.edu/rsd Code • Pivot Clustering • Overlap Clustering • RSD Computation Data • Prefix List • Pairwise RSD
Thank you!! NataliRuchansky GoncaGürsun, EvimariaTerzi, and Mark Crovella