Opensample: A Low-latency, Sampling-based Measurement Platform for Software Defined Data Center

OpenSample
A Low-latency, Sampling-based Measurement
Platform for Software Defined Data Center
Junho Suh†, Ted “Taekyoung” Kwon†,
Colin Dixon‡, Wes Felter‡, and John Carter‡
†Seoul National University
‡IBM Research Austin
ICDCS'14@Madrid 1

Software Defined Networking (SDN)
OS
routing
VPN
…
monitoring
Control /
management
functions
Embedded OS
Switching ASIC
Open Interface
Network OS
Open Interface
routing
VPN
…
monitoring
CISCO Juniper
2
Legacy SDN
SDN
Measurement
Control
Decision
ICDCS'14@Madrid

X86 64bits High-performance
mainframe
3
Control Loop in
Software Defined Networking
• Control loop
SDN
Decision
(100us)
Measurement
Control
(100ms ~ 1sec+)
(10ms)
• Measurement is a bottleneck
– High latency of control loopDC
PerformanceApp performance
ICDCS'14@Madrid
Open Interface
Network OS
Open Interface
routing
VPN
…
monitoring
IBM RackSwitch G8264

Control Loop in
Software Defined Networking
• Control loop
(100us)
• For high-speed networks
– E.g., 10/40Gbps
X86 64bits High-performance
mainframe
4
SDN
Measurement
Control
Decision
ICDCS'14@Madrid
Open Interface
Network OS
Open Interface
routing
VPN
…
monitoring
IBM RackSwitch G8264
(100ms ~ 1sec+)
(10ms)

How Fast should Measurement do
for SDDC?
5
CDF of Flow Duration
Univ. DC
Flow Duration@1Gbps (ms)
*source: T. Benson, et al., “Network traffic characteristics of data
centers in the wild,” IMC`10
Production DC
Flow Duration@1Gbps (ms)
*source: Alizadeh et al., Background TCP flows, Microsoft data center
DCTCP, Sigcomm`10
• The situation is getting worse toward high-speed data center
networks (e.g., 1Gbps  10/40Gbps)
ICDCS'14@Madrid

Why are Measurements so Slow?
• Traditionally, this didn’t need to be fast
• Control plane of switch’s CPUs are wimpy
– Over taxing on switch’s CPU  increasing as flow table increases
– Ex) Polling flow counter and sampling
• Although faster CPU could help, but still a big gap between CPUs and ASICs
Kernel Driver
CPU
CPU
ASIC
PCI-E
ICDCS'14@Madrid 6
ASIC
Kernel Driver
CPU
PCI-E
XAUI, Aurora …

Is Packet Sampling a viable Solution?
• Estimating flow statistics from a packet sampling
1,000 pkts classified as A
Hmm…roughly
400,000 pkts of
class A are there~
– Maximum likelihood estimation (MLE)
• #packets ≈ #packets sampled X sampling ratio
2,500 pkts sampled
… …
… … …
1,000,000 pkts transiting
in a given measurement time
Sampling probability = 0.25%
Flow A
Flow B
Flow C
ICDCS'14@Madrid 7

Is Packet Sampling a Viable Solution?
• Theory behind this inference…
– Large number theory
– Estimation accuracy ∝ sqrt of # samples
2,500 pkts sampled
… …
… … …
1,000,000 pkts transiting
in a given measurement time
Sampling probability = 0.25%
1,000 pkts classified as A
Hmm…roughly
40% of pkts of
class A are there~
The number of pkts of
class A are in
[381,000, 419,000]
Flow A
Flow B
Flow C
ICDCS'14@Madrid 8

How many Samples really we can get?
1-in-N Peak at 350 pkts/sec
9
collector
client server
ICDCS'14@Madrid
• Micro benchmark
– IBM RackSwitch G8264
– Single TCP connection @10Gbps w/ TCP perf

How many Samples really we can get?
In 100ms, only 3,000 pkts
in avg. are arrived,
as well as 60 flows
*Dataset source: Benson, T., Network traffic characteristics of data centers in the wild, IMC 2010
• With a limit of 350 samples/sec, the situation is worse
ICDCS'14@Madrid 10

Can we gather Samples more?
• Two approaches to increase an accuracy
(= increase # samples)
– ↑ sampling probability
• Over taxing on switches’ CPU
– ↑ an measurement interval
• Violating OpenSample’s goal of low-latency
measurements
ICDCS'14@Madrid 11

Our Solution: Protocol-aware
Flow Statistics Detection Algorithm
• Fact: 99% of total traffic in data centers are
TCP flows
– Captures two distinct packet headers
• a timestamp and a TCP sequence number are exploited
– Ex) TCP packet A with seq# SA at time tA and B with seq# SB at
time tB such that tA < tB
ICDCS'14@Madrid 12

• Ex) Estimating flow statistics
13
… … …
S1 Throughput of flowS = (S2-S1)/(t3-t1)
Throughput of flowT = (T2-T1)/(t3-t1)
ICDCS'14@Madrid
S2
T2 T1
t4 t3 t2 t1
Streaming algorithm

• Ex) Estimating port statistics
– Exploiting MLE
• regards packets passing through a specific port as super
flow
14
measurement
interval
t2 t1
… … …
UtilportA = #samples * sampling rate
ICDCS'14@Madrid

• Benefits
– Streaming algorithm
• Near-real-time analysis
– High accuracy even with low sampling probability
– Independent with the sampling theory
• don’t need to know a sampling probability
• can capture samples at multiple points in a given network
• measurement delay depends only on latency between two
different samples
ICDCS'14@Madrid 15

How much a number of flows can be
detected?
• Probability of flow statistics detection in single switch
model
Disjoint events
– Pr{2+ samples} = 1 – Pr{get 0 or 1 sample}
– = 1 – Pr{0 sample} – Pr{1 sample}
– =1 - (1-p)n - np(1-p)n-1
Bernoulli trial
n: # of packets in a given flow
p: probability of packet sampled (0 ≤ p ≤ 1)
ICDCS'14@Madrid 16

detected?
17
• Probability of flow statisitcs detection w/ multiple
switches
– Pr{2+ samples} = 1 – (1-kp)n – nkp(1-kp)n-1
n: # of packets in the flow
k: # of switches
ICDCS'14@Madrid

detected?
18
• Probability of flow detection w/ multiple switches
n: # of packets in the flow
k: # of switches
ICDCS'14@Madrid

Protocol-aware Flow Statistics
Detection Algorithm (4/4)
• Flow detection delay
– E[D] = E[X1 + X2]
– = E[X1] + E[X2] = 2/λkp
D: delaying to acquire two samples from a given flow
X1, X2: the arrival time of the first and second sampled
packets
λ: inter-arrival rate
ICDCS'14@Madrid 19

Protocol-aware Flow Statistics
Detection Algorithm (4/4)
• Flow detection delay
ICDCS'14@Madrid 20

Implementation
• OpenSample collector
– Java-based collector with Netty NIO
framework
– sFlow v5.0 standard
– Reconstruct flow/port statistics
• Protocol-aware flow statistics detection
• Maximum Likelihood Estimation
• Floodlight SDN controller
– Traffic engineering application
ICDCS'14@Madrid 21

Benchmark Tests (Emulation)
• Mininet v2.0
– SDN emulator running in a single host
– Real traffic characteristics
– Results can be reproducible
ICDCS'14@Madrid 22

Benchmark Tests (Configuration)
• Topology
– FatTree (k=4) vs. non-blocking @ 10 Mbps
– 16 hosts and 3-levels
• Workloads
– Spatial locality of traffic patterns
• random (benign), staggered, stride (adversarial)
– Flow size following an exponential distribution
• with avg. 1MB and 1GB
• Benchmarks
– Polling-based flow scheduler
• Polling interval = 1 second
– MLE vs. Protocol-aware flow statistics detection algorithm
• Sampling rate: N=50 (High), N=200 (Low)
ICDCS'14@Madrid 23

Results: 1GB Long Flow
• Normalized aggregate throughput
ICDCS'14@Madrid 24

Results: 1MB Short Flow
• Normalized aggregate throughput
ICDCS'14@Madrid 25

• CDF of bytes left at the time
of detection
• CDF of bytes left at the time
of routing
ICDCS'14@Madrid 26

• The total bytes sent in 30s and the percent of those bytes
scheduled by traffic engineering for the STRIDE8 workload
ICDCS'14@Madrid 27

Conclusion
• OpenSample
– A working prototype of a low-latency, sampling-based measurement platform
– Reducing control loop latency from 1-5 seconds to 100 milliseconds
• Further reducing control loop as fast as 100us with hardware supports
– See Planck: Millisecond-scale Monitoring and Control for Commodity
Networks, Sigcomm`14
ICDCS'14@Madrid 28

Q&A
Email: jhsuh@mmlab.snu.ac.kr
ICDCS'14@Madrid 29

Opensample: A Low-latency, Sampling-based Measurement Platform for Software Defined Data Center

Recommended

More Related Content

What's hot (20)

Viewers also liked (8)

Similar to Opensample: A Low-latency, Sampling-based Measurement Platform for Software Defined Data Center (20)

Recently uploaded (20)

Opensample: A Low-latency, Sampling-based Measurement Platform for Software Defined Data Center

Editor's Notes