1 / 35

NetProfiler: Profiling Wide-Area Networks Using Peer Cooperation

NetProfiler: Profiling Wide-Area Networks Using Peer Cooperation Venkata N. Padmanabhan (Microsoft Research) Sriram Ramabadhrah (UCSD) Jitendra Padhye (Microsoft Research) IPTPS 2005. University of Cincinnati, 2005. Motivation. ISP2. ISP3. Internet. ping. traceroute.

Download Presentation

NetProfiler: Profiling Wide-Area Networks Using Peer Cooperation

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. NetProfiler: Profiling Wide-Area Networks Using Peer Cooperation Venkata N. Padmanabhan (Microsoft Research) Sriram Ramabadhrah (UCSD) Jitendra Padhye (Microsoft Research) IPTPS 2005 University of Cincinnati, 2005

  2. Motivation ISP2 ISP3 Internet ping traceroute Conventional tools do not allow us to view the network from multiple vantage points. ISP1 University of Cincinnati, 2005

  3. Problems • Operators/ISPs: cant really see what kind of network experience there users are having. • Users: cant do much if there are network problems. • Current tools like ping and traceroute capture information only from the perspective of one user • These tools focus only on the IP-level path. Problems maybe with higher layer entities like proxies and servers. University of Cincinnati, 2005

  4. More problems and a solution • Tools like ping and traceroute rely on “active probing”. • Active probing creates overheads especially if too many hosts start probing simultaneously. • Active probing doesn’t always disambiguate the cause of the problem. • Proposed solution: • Hosts accumulate performance information passively, from multiple vantage points, across all layers (not only the IP level path), and the information is aggregated using a p2p network. University of Cincinnati, 2005

  5. Tasks • To acquire performance data from end-user perspective. • To aggregate this data in meaningful ways and share it among cooperating peers. • To use this data for fault analysis and diagnosis. University of Cincinnati, 2005

  6. Outline • NetProfiler • Architecture • Data Acquisition • TcpScope • WebScope • Normalization • Data Aggregation • Attributes • Aggregation Tree • Filtering • Existing Systems • Analysis and Diagnosis • Distributed Blame Allocation • Comparative Analysis • N/W Engg. and Analysis • N/W Health Reporting • Experimental Results • Discussion • Conclusion Task 1: Acquire data

  7. NetProfiler • NetProfiler is the p2p system that is suggested which addresses the issues and problems outlined. University of Cincinnati, 2005

  8. NetProfiler - Architecture • The architecture and design of NetProfiler can be described with respect to the following activities: • Data Acquisition • Normalization • Data Aggregation • Analysis and Diagnosis University of Cincinnati, 2005

  9. NetProfiler – Architecture - Data Acquisition • Data Acquisition • Performed by ‘sensors’. • Sensors are software modules residing on end hosts. • Primarily perform passive observation. • Sensors could be developed for various protocols and applications. • Two sensors were developed • TcpScope (for tcp) • WebScope (for http) University of Cincinnati, 2005

  10. NetProfiler – Architecture - Data Acquisition- TcpScope • TcpScope • Designed to work at the transport layer. • Passive sensor which listens for TCP transfers. • Parameters that are measured include (but are not limited to): • Round Trip Time (RTT) • Congestion Window of sender (cwnd) • Bottleneck bandwidth University of Cincinnati, 2005

  11. NetProfiler – Architecture - Data Acquisition- WebScope • WebScope • If a caching proxy is used, the user loses visibility at the transport layer beyond the proxy. To get around this problem, another sensor called WebScope was developed. • This gives visibility into an end-to-end web transaction. • Cache busting techniques are employed to make sure content is not served from the cache University of Cincinnati, 2005

  12. NetProfiler – Architecture - Data Acquisition • Data Acquisition - Summary • Data is acquired at the hosts by the use of sensors (software modules) • They do not actively probe but listen passively. • Two have been implemented – TcpScope, WebScope. • Measures include – RTT, cwnd, bottleneck bandwidth University of Cincinnati, 2005

  13. Outline • NetProfiler • Architecture • Data Acquisition • TcpScope • WebScope • Normalization • Data Aggregation • Attributes • Aggregation Tree • Filtering • Existing Systems • Analysis and Diagnosis • Distributed Blame Allocation • Comparative Analysis • N/W Engg. and Analysis • N/W Health Reporting • Experimental Results • Discussion • Conclusion Task 2: Aggregate and share data

  14. NetProfiler – Architecture - Normalization • Normalization • Once the data is acquired through sensors, it needs to be aggregated and shared with peers. • But, before that, it must be normalized. • This is required because every host is operating with different parameters such as bandwidth and data transfer rate. University of Cincinnati, 2005

  15. NetProfiler – Architecture – Data Aggregation • Data Aggregation • Sensors are used to gather performance data at each host. • For the system to be meaningful, an efficient aggregation scheme must be devised. • The authors have proposed an aggregation based on a “set of client attributes”. University of Cincinnati, 2005

  16. NetProfiler – Architecture – Data Aggregation - Attributes • Attributes • Hierarchical – impose a natural hierarchy on the hosts along which data can be aggregated. • Example - Geographical Location • Neighborhood -> City -> Region -> Country • Non – hierarchical – used for filtering • Example – Bandwidth class University of Cincinnati, 2005

  17. NetProfiler – Architecture – Data Aggregation – Aggregt’n Tree • Aggregation Tree • A tree which mirrors the hierarchy imposed by the attribute. • Data can be aggregated along this tree. University of Cincinnati, 2005

  18. NetProfiler – Architecture – Data Aggregation – Filtering • Filtering • Hierarchical attributes are used to logically organize the data aggregation. • But as data travels from node to node, it is separated into classes / types based on non-hierarchical attributes of the host. University of Cincinnati, 2005

  19. NetProfiler – Architecture – Data Aggregation – Existing Systems • Existing Systems for distributed monitoring, aggregating and querying on the Internet • SDIMS • Ganglia • Slicestat • IrisNet • PIER • Sophia • Astrolabe The idea here is toleverage these systemsto aggregate data. Thegoalis to develop a system for fault-diagnosisand analysis rather than to design a new data aggregation system. University of Cincinnati, 2005

  20. Outline • NetProfiler • Architecture • Data Acquisition • TcpScope • WebScope • Normalization • Data Aggregation • Attributes • Aggregation Tree • Filtering • Existing Systems • Analysis and Diagnosis • Distributed Blame Allocation • Comparative Analysis • N/W Engg. and Analysis • N/W Health Reporting • Experimental Results • Discussion • Conclusion Task 3: Use data for analysis and diagnosis

  21. NetProfiler – Architecture – Analysis and Diagnosis • Analysis and Diagnosis • So far, the acquisition and aggregation of data has been discussed. • How is the aggregated data used to analyze the system and diagnose commonly occurring problems ? University of Cincinnati, 2005

  22. NetProfiler – Architecture – Analysis and Diagnosis – Dist. Blame Allocation • Distributed Blame Allocation • A client experiencing bad performance will tend to blame the entities involved in the end-to-end transaction. • The idea behind distributed blame allocation is to use data supplied by sensors and other nodes to locate trouble spots. University of Cincinnati, 2005

  23. NetProfiler – Architecture – Analysis and Diagnosis – Comparative Analysis • Comparative Analysis • This is an attractive feature of the profiling system. Clients have information about the performance experienced by neighbors. • This is used to make decisions such as switching to a higher type of service or switching ISPs. University of Cincinnati, 2005

  24. NetProfiler – Architecture – Analysis and Diagnosis – Comparative Analysis • Comparative Analysis - Examples • Am I being treated fairly ? • A client continues to experience worse performance than its neighbors even though they are served by the same ISP…time to change ISPs. Without the aid of NetProfiler, this kind of information cannot be made available. • Should I get a better connection ? • A client has information about the performance experienced by neighbors who are in a higher bandwidth class. This aids in the decision making process. University of Cincinnati, 2005

  25. NetProfiler – Architecture – Analysis and Diagnosis – N/W Engg. & Analysis • Network Engineering and Analysis • Network engineering and analysis is facilitated by NetProfiler performance data. • Example • Network operators may want to choose from a set of alternatives for a company network. Like – leased lines or VPN • Usually, performance data is based on link and router monitoring. • NetProfiler provides and end-user perspective of network performance from multiple vantage points. University of Cincinnati, 2005

  26. NetProfiler – Architecture – Analysis and Diagnosis – N/W Health Reporting • Network Health Reporting • The information gathered by NetProfiler can be used to generate reports about the general health of a wide-area-network. • These reports are available through other means but the advantage of using NetProfiler are greater coverage, lower cost, flexibility and contribution based operation. University of Cincinnati, 2005

  27. Outline • NetProfiler • Architecture • Data Acquisition • TcpScope • WebScope • Normalization • Data Aggregation • Attributes • Aggregation Tree • Filtering • Existing Systems • Analysis and Diagnosis • Distributed Blame Allocation • Comparative Analysis • N/W Engg. and Analysis • N/W Health Reporting • Experimental Results • Discussion • Conclusion

  28. NetProfiler – Experimental Results - Setup • Experimental Results – Setup • Heterogeneous set of clients • 147 PlanetLab nodes, dialup hosts, connected to 26 Pops on MSN network and 5 connected to the Microsoft Corporate Network. • Repeatedly downloading from a diverse set of 70 websites University of Cincinnati, 2005

  29. NetProfiler – Architecture – Experimental Results - Findings • Experimental Results – Findings • Access to a website failed at almost every client. Suggests a server-side problem. • Differences in failure rates among seemingly “equivalent” clients. • A group of nodes sharing an attribute had higher failure rates than all others. This suggests a common cause of the problem affecting these nodes. • [details] University of Cincinnati, 2005

  30. NetProfiler – Experimental Results - Findings • Experimental Results – Findings • Sometimes, problems occurred between a client-server pair. This suggests that there is some problem in the path between these two nodes. [details] University of Cincinnati, 2005

  31. NetProfiler – Discussion • Discussion • Deployment Models • Coordinated • Organic University of Cincinnati, 2005

  32. NetProfiler – Discussion • Organic Deployment - Challenges • Security • Clients may not want to reveal IP or other info. • They may be allowed to report at a “coarser” level, eg. At ISP • Data integrity – malicious users sending incorrect performance data • Bootstrapping – chicken-and-egg • Proposed solution is to allow some active probing in the initial stages University of Cincinnati, 2005

  33. Conclusions • NetProfiler is a system that enables monitoring and diagnosis of • network problems. • It is meant for wide area networks and relies on heavy participation. • It can potentially benefit both ISPs and end users. • Ongoing work includes detection of surreptitious activity to predict • attacks.

  34. End http://geocities.com/alisaif_uc/ppts.htm Saif Ali ECECS University of Cincinnati April 2005

  35. Details of experimental findings System wide failures Most nodes Could not reach the sites www.technion.ac.il and www.hku.hk Discrepancy among seemingly equivalent nodes Among MSN nodes connected to PoPs, those using ICG as the upstream provider experienced lower failure rates than those using other providers like QWEST or UUNET. Group of clients sharing an attribute All 5 hosts on the Microsoft Corporate Network experience high failure rates in accessing a site www.royal.gov.uk whereas failure for other hosts is negligible. Particular Client-Server Pair A Microsoft Corporate Network node in China can never access the site www.nmt.edu whereas others do not have this problem. Back

More Related