220 likes | 302 Views
Maintaining Packet Order in Two-Stage Switches. Isaac Keslassy, Nick McKeown Stanford University. OQ routers: + work-conserving - memory bandwidth = (N+1)R. R. R. R. R. R. R. IQ routers: + memory bandwidth = 2R - arbitration complexity. Bipartite Matching.
E N D
Maintaining Packet Order in Two-Stage Switches Isaac Keslassy, Nick McKeown Stanford University
OQ routers: • + work-conserving • - memory bandwidth = (N+1)R R R R R R R • IQ routers: • + memory bandwidth = 2R • - arbitration complexity Bipartite Matching Two Successive Scaling Problems
Today: 64 ports at 10Gbps, 64-byte cells. • Arbitration Time = = 51.2ns • Request/Grant Communication BW = 17.5Gbps 64bytes 10Gbps IQ Arbitration Complexity • Scaling to 160Gbps: • Arbitration Time = 3.2ns • Request/Grant Communication BW = 280Gbps • Two main alternatives for scaling: • Increase cell size (Kar et al., 2000) • Eliminate arbitration (Chang et al., 2001)
Desirable Characteristics for Router Architecture • Ideal: OQ • 100% throughput • Minimum delay • Maintains packet order • Necessary: able to regularly connect any input to any output • What if the world was perfect? Assume Bernoulli iid uniform arrival traffic...
1 1 N N Cyclic Shift? Uniform Bernoulli iid traffic: 100% throughput
1 1 N N Problem: real traffic is non-uniform Cyclic Shift? q(t) b(t) (t) Long-term service opportunities exceed arrivals:
1 2 2 1 1 1 1 N N N Two-Stage Switch External Inputs Internal Inputs External Outputs Load-balancing cyclic shift Switching cyclic shift 100% throughput for broad range of traffic types (C.S. Chang et al., 2001)
External Inputs Internal Inputs External Outputs q(t) 2(t) 1(t) 1 a(t) b(t) 1 1 N • First cyclic shift: N N • Traffic rate: • Long-term service opportunities exceed arrivals: Two-Stage Switch • (2 = 1 possible)
Two-Stage Switch Characteristics • Eliminates arbitration • 100% throughput • Conventional router packaging
1 1 1 N N N Using a Single Stage Twice Linecards Lookup Phase 1 Buffer 1 Lookup Buffer 2 Phase 2 Lookup Buffer 3
Optical Switch Fabric Two-Stage Switch Characteristics Optical links Racks of linecards
1 2 2 1 1 1 1 N N N Two-Stage Switch Characteristics External Inputs Internal Inputs External Outputs Cyclic Shift Cyclic Shift Problem: unbounded mis-sequencing
t External Inputs Internal Inputs External Outputs 2 1 1 1 2 2 1 1 1 N N N Cyclic Shift Cyclic Shift Full Frames First (FFF):Intuitive Idea • Idea: • Spread cells evenly across all linecards
External Inputs Internal Inputs External Outputs 2 1 2 3 1 1 1 1 N N N Cyclic Shift Cyclic Shift Full Frames First (FFF):Intuitive Idea • Idea: • Spread cells evenly across all linecards • Read them in order
External Inputs Internal Inputs External Outputs 1 1 2 1 1 1 2 3 N N N Cyclic Shift Cyclic Shift Full Frames First (FFF):Intuitive Idea • Idea: • Spread cells evenly across all linecards • Read them in order
t External Inputs Internal Inputs External Outputs 2 1 1 2 2 1 3 3 1 1 1 N N N Cyclic Shift Cyclic Shift First Problem Problem: if two packets don’t arrive consecutively, there may be a hole in the reading sequence
t 3 1 2 Flow Load Balancing Coordination Buffer (VOQ) 1 1 2 1 1 1 2 3 N N N Cyclic Shift Cyclic Shift Coordination Buffer Solution: collect cells from a flow in a coordination buffer, and load-balance them among linecards
t Input 1: Input 2: a b t Flow Load Balancing Coordination Buffer (VOQ) 2 3 1 b a 1 3 1 2 1 1 1 2 N N N Cyclic Shift Cyclic Shift Second Problem Problem: No access to cell 2 because of head-of-line blocking
a b 1 3 2 Expanding VOQ Structure Solution: expand VOQ structure by distinguishing among switch inputs
FFF: Guarantees Theorem 1: for any arrival process for which OQ has 100% throughput, so does FFF Theorem 2: for any arrival process, Davg (FFF) Davg(OQ) + (4N2 - 2) Theorem 3: FFF maintains packet order
R R 1 1 1 1 1 1 2 2 2 2 2 2 Cyclic Shift Cyclic Shift R/N R/N R/N 3 3 3 3 3 3 Passive mesh 2R/N Passive mesh Two-Stage Switch in Optics
Summary • FFF: practical algorithm that solves mis-sequencing • Same throughput as OQ, and average delay within a bound • New approach to optical switching