380 likes | 549 Views
KEK Network. Qi Fazhi 2004.8. Super Sinet Router. 10 GbE. KEK GW. 2 x GbE. 2 x GbE. KEK SW L2/L3 Switch for outside connections. KEK SW L2/L3 Switch for outside connections. 10 GbE. 2 x GbE. 2 x GbE. IDS GbE. Netscreen Firewall. Netscreen Firewall. IDS GbE. 2 x GbE.
E N D
KEK Network Qi Fazhi 2004.8
Super Sinet Router 10GbE KEK GW 2 x GbE 2 x GbE KEK SW L2/L3 Switch for outside connections KEK SW L2/L3 Switch for outside connections 10GbE 2 x GbE 2 x GbE IDS GbE Netscreen Firewall Netscreen Firewall IDS GbE 2 x GbE 2 x GbE 4 x GbE Central L2/L3 Switch A Central L2/L3 Switch B 4 x GbE 4 x GbE 4 x GbE NOOD Switch to other places NOOD Switches to Buildings 10GbE GbE up to: 14 Vlan, Based MAC Static & DHCP(for wireless) Tskuba Center Other Univ. HEP Lab.
KEK’s firewall architecture 10GbE High Throughput Access Route L2/L3 Switch for outside connections 10GbE Regular flows 2 x GbE IDS GbE Firewall 2 x GbE Central L2/L3 Switch Securitymonitor 10GbE/GbE
SLAC, FNAL, BNL, CERN, DESY, SDSC, ..... universities in Japan SuperSINET SuperSINET SuperSINET KEK APAN universities/institutes in Asia-Pacific APAN/Transpac
SuperSINET’s links to US/EU IEEAF Atlantic GbE x 3 Since 2004.1 GbE GbE OC48 x 4 GbE POS Channel NII’s router at NewYork GbE SuperSINET (10Gbps) ISP GbE OC48 OC48 10GbE • SLAC is on ESnet 10GbE MAN LAN10GbE Switch Gèant CERN is on GEANT 10GbE Abilene 10GbE ESnet
WAN Performance Measurement and Monitor-- Study and Work at KEK Qi Fazhi 2004-07
Study • WAN Performance Terminology • WAN Performance Measurement Tools • How to get larger throughput
Terminology / Elements be relationship to WAN Performance • Bandwidth • Physical bandwidth, or capacity (C) • Available bandwidth (A) • Throughput • Maximum throughput • Achievable throughput • Latency / RTT
TERMINOLOGY • Latency • Packet Loss • Bandwidth:the speed that a network element can forward traffic. • (is independent of end hosts and protocol type) • Physical/capacity:maximum number of bits per second a network element can transfer(is determined by the slowest network element along the path) • Available:the capacity minus utilization over a given time interval. • Throughput:amount of data that is successfully sent from one host to another via a network(may be limited by every component along the path from source host to destination host) • Maximum throughput • Achievable throughput
Network Performance Measurement Tools • Iperf/Netperf:tools for measuring end-to-end TCP/UDP performance • pipechar: hop-by-hop bottleneck analysis tool • pchar: hop-by-hop performance measurement tool • traceroute: lists all routers from current host to remote host
How to get larger throughput Larger Throughput Decrease Latency / RTT Packet Loss Choose the best path Routingcoordination Latency / RTT Packet Loss Parallel stream TCP ? TCP Tuning
TCP Tuning • Why? • Bandwidth increases • TCP shortage • How? • Modify the TCP Windows Size • Modify the System Buffer Size • …… • Some New Protocol: FAST
TCP • TCP is adaptive • It is constantly trying to go faster • It slows down when it detects a loss • How much it sends is controlled by windows • When it sends is controlled by received ACK’s(or timeouts)
Why Tuning? Summary of techniques to maximize TCP WAN throughput • congestion window (CWND):TCP uses it to determine how many packets can be sent at one time, The larger the congestion window size, the higher the throughput. • TCP “slow start” and “congestion avoidance” algorithms determine the size of the congestion window • The maximum congestion window is related to the amount of buffer space that the kernel allocates for each socket (System) • For each socket, there is a default value for the buffer size, which can be changed by the program using a system library call just before opening the socket. • Kernel enforced maximum buffer size.
Bandwidth*Delay Product • Bandwidth * Delay = number of bytes in flight to fill path • The optimal buffer size is twice the bandwidth*delay product of the link(recover from errors) • buffer size = 2 * bandwidth * delay= bandwidth * RTT • bandwidth :get it from pipechar or pchar • RTT:get it from Ping • Example:ping time is 50 ms ; the end-to-end network consists of all 100 BT Ethernet and OC3 (155 Mbps).then TCP buffers should be .05 sec * (100 Mbits / 8 bits) = 625 KBytes • most of today UNIX OS's by default have a maximum TCP buffer size of only 256 KB! So we should do some tuning…… BW*Delay buffer Windows throughput
Summary Important Points About TCP • Throughput • congestion window :determine how many packets can be sent at one time ,the larger the congestion window size, the higher the throughput. • the size of the congestion window :slow start (constantly trying to go faster)and congestion avoidance (slows down when it detects a loss) • maximum congestion window :is related to the amount of buffer space that the kernel allocates for each socket • buffer size : • 2*Bandwidth*Delay Products(BDP)= bandwidth * RTT
Routing coordinationKEK<->IHEP • Before Change Status • Go through Russia and USA • KEK->IHEP: KEK->Sinet->Abilene->RBNet->CSTnet->IHEP • IHEP->KEK: IHEP->CSTnet-> RBNet ->Abilene ->Sinet ->KEK • RTT is very large • After Change Status • KEK->IHEP:KEK->SINet->APAN-JP->CERnet->NSFC->CSTnet->IHEP • IHEP->KEK(not changed):IHEP->CSTnet-> RBNet ->Abilene ->Sinet ->KEK • For the reason of not changing IHEP->KEK route,the RTT is still large,but there are some changes in the throughput
AS9405 KEK2505 IHEP3460 Sinet2907 Abilene11537 APAN_JP7660 Dargon_tap9407 Cernet4538 NSFC9406 CSTNet7497 APAN_JP&APAN_CN Link RBnet5568 AS3356 AS1239 Level 3 Sprint KEK-IHEP Route Tuning
Parallel TCP Stream KEK->IHEP One TCP Stream Parallel TCP Streams
WORK WAN Performance Measurement/Monitor • Target • Packet loss & latency • Throughput • Tools • Mechanism
Mechanism (Base on www) WWW HTTP Ping/iperf/netperf Result CMD Traceroute/Ping iperf netperf Data Test PC Remote Remote Remote
System Structure WAN Performance Measurement and Monitor WAN Performance Measurement WAN Performance Monitor Performance measurement command Performance measurement result Performance data collect Performance data analysis Data File
README • INSTALL • creat a user account: monitor • cd ~monitor • tar xvf monitor.tar • edit /usr/local/apache/conf/httpd.conf • edit the ~monitor/ping_list.txt and ~monitor/iperf_list.txt • edit the /etc/crontab • http://serveraddress/monitor/
Useful links • http://www.apan.net • http://ws.arin.net/cgi-bin/whois.pl