Exploration of Large Scale Virtual Networks. Open Network Summit 2016

Similar documents
Linux NIC and iscsi Performance over 40GbE

Performance of Host Identity Protocol on Nokia Internet Tablet

ICS 351: Today's plan. IP addresses Network Address Translation Dynamic Host Configuration Protocol Small Office / Home Office configuration

How To Compare Performance Of A Router On A Hypervisor On A Linux Virtualbox 2.5 (Xen) To A Virtualbox (Xeen) Xen-Virtualization (X

Performance Evaluation of VMXNET3 Virtual Network Device VMware vsphere 4 build

Performance Analysis of IPv4 v/s IPv6 in Virtual Environment Using UBUNTU

1000Mbps Ethernet Performance Test Report

Quantifying the Performance Degradation of IPv6 for TCP in Windows and Linux Networking

Programmable Networking with Open vswitch

How To Monitor And Test An Ethernet Network On A Computer Or Network Card

Adobe LiveCycle Data Services 3 Performance Brief

Assessing the Performance of Virtualization Technologies for NFV: a Preliminary Benchmarking

Globus Striped GridFTP Framework and Server. Raj Kettimuthu, ANL and U. Chicago

Linux Virtual Server Tutorial

Network performance in virtual infrastructures

Dell PowerVault MD Series Storage Arrays: IP SAN Best Practices

D1.2 Network Load Balancing

Introduction to LAN/WAN. Network Layer (part II)

Performance Evaluation of Linux Bridge

TCP Labs. WACREN Network Monitoring and Measurement Workshop Antoine Delvaux perfsonar developer

ZEN LOAD BALANCER EE v3.04 DATASHEET The Load Balancing made easy

Where IT perceptions are reality. Test Report. OCe14000 Performance. Featuring Emulex OCe14102 Network Adapters Emulex XE100 Offload Engine

Question: 3 When using Application Intelligence, Server Time may be defined as.

Zeus Traffic Manager VA Performance on vsphere 4

Distributed File System Performance. Milind Saraph / Rich Sudlow Office of Information Technologies University of Notre Dame

TCP/IP Support Enhancements

Microsoft SQL Server 2012 on Cisco UCS with iscsi-based Storage Access in VMware ESX Virtualization Environment: Performance Study

Sockets vs. RDMA Interface over 10-Gigabit Networks: An In-depth Analysis of the Memory Traffic Bottleneck

Chapter 11. User Datagram Protocol (UDP)

Network Layers. CSC358 - Introduction to Computer Networks

VMWARE WHITE PAPER 1

CS335 Sample Questions for Exam #2

Wave Relay System and General Project Details

z/os V1R11 Communications Server system management and monitoring

MEASURING WORKLOAD PERFORMANCE IS THE INFRASTRUCTURE A PROBLEM?

VXLAN: Scaling Data Center Capacity. White Paper

Introduction. Application Performance in the QLinux Multimedia Operating System. Solution: QLinux. Introduction. Outline. QLinux Design Principles

Hardware Performance Optimization and Tuning. Presenter: Tom Arakelian Assistant: Guy Ingalls

50. DFN Betriebstagung

How Solace Message Routers Reduce the Cost of IT Infrastructure

Network Security TCP/IP Refresher

The IP Transmission Process. V1.4: Geoff Bennett

Control of Multiple Packet Schedulers for Improving QoS on OpenFlow/SDN Networking

Windows Server Performance Monitoring

IBM Tivoli Composite Application Manager for Microsoft Applications: Microsoft Hyper-V Server Agent Version Fix Pack 2.

JP1 Version 11: Example Configurations

ZEN LOAD BALANCER EE v3.02 DATASHEET The Load Balancing made easy

IP SAN Best Practices

Windows Server 2012 R2 Hyper-V: Designing for the Real World

Oracle Database Scalability in VMware ESX VMware ESX 3.5

1 Data information is sent onto the network cable using which of the following? A Communication protocol B Data packet

Open Source in Network Administration: the ntop Project

IP videoconferencing solution with ProCurve switches and Tandberg terminals

Scalable Linux Clusters with LVS

Removing Performance Bottlenecks in Databases with Red Hat Enterprise Linux and Violin Memory Flash Storage Arrays. Red Hat Performance Engineering

PCI Express High Speed Networks. Complete Solution for High Speed Networking

Quantum StorNext. Product Brief: Distributed LAN Client

A Dell Technical White Paper Dell Storage Engineering

Ethernet-based Storage Configuration

Communications Server for Linux

Network Virtualization Technologies and their Effect on Performance

OpenDaylight Performance Stress Tests Report

EVALUATING NETWORK BUFFER SIZE REQUIREMENTS

Course Overview: Learn the essential skills needed to set up, configure, support, and troubleshoot your TCP/IP-based network.

One Server Per City: C Using TCP for Very Large SIP Servers. Kumiko Ono Henning Schulzrinne {kumiko, hgs}@cs.columbia.edu

Iperf Tutorial. Jon Dugan Summer JointTechs 2010, Columbus, OH

IOS110. Virtualization 5/27/2014 1

Quantifying TCP Performance for IPv6 in Linux- Based Server Operating Systems

co Characterizing and Tracing Packet Floods Using Cisco R

L-Series LAN Provisioning Best Practices for Local Area Network Deployment. Introduction. L-Series Network Provisioning

Performance Tuning Guidelines for PowerExchange for Microsoft Dynamics CRM

Chapter 3. Internet Applications and Network Programming

Industrial Communication Whitepaper. Principles of EtherNet/IP Communication

Monitoring high-speed networks using ntop. Luca Deri

1. The Web: HTTP; file transfer: FTP; remote login: Telnet; Network News: NNTP; SMTP.

Application. Transport. Network. Data Link. Physical. Network Layers. Goal

Transport and Network Layer

A Comparative Study on Vega-HTTP & Popular Open-source Web-servers

Network Load Balancing

SAS deployment on IBM Power servers with IBM PowerVM dedicated-donating LPARs

SIDN Server Measurements

Network Technologies for Next-generation Data Centers

Strategies. Addressing and Routing

Resource Utilization of Middleware Components in Embedded Systems

SO_REUSEPORT Scaling Techniques for Servers with High Connection Rates. Ying Cai

Hands on Workshop. Network Performance Monitoring and Multicast Routing. Yasuichi Kitamura NICT Jin Tanaka KDDI/NICT APAN-JP NOC

Riverbed Stingray Traffic Manager VA Performance on vsphere 4 WHITE PAPER

Performance of Software Switching

Linux KVM Virtual Traffic Monitoring

Big Data Technologies for Ultra-High-Speed Data Transfer and Processing

- An Essential Building Block for Stable and Reliable Compute Clusters

Overview of Computer Networks

DIABLO TECHNOLOGIES MEMORY CHANNEL STORAGE AND VMWARE VIRTUAL SAN : VDI ACCELERATION

VXLAN Performance Evaluation on VMware vsphere 5.1

New Products and New Features May, 2015

ADVANCED NETWORK CONFIGURATION GUIDE

Active-Active Servers and Connection Synchronisation for LVS

ReadyNAS Remote White Paper. NETGEAR May 2010

Transcription:

Exploration of Large Scale Virtual Networks Open Network Summit 2016 David Wilder wilder@us.ibm.com

A Network of Containers Docker containers Virtual network More containers.. 1 5001 2 4 OpenVswitch or Linux Bridge 5002 5003 5000 Each container has a unique network name-space consisting of: Virtual network interfaces Routing and arp tables Network statistics and more containers.. 10000 2

Project Objectives Build and evaluate large virtual networks on Linux using: Docker, Linux Bridge, OpenVswitch. Learn how virtual networks: scale. perform. break. 3

What I Measured: Aggregate bandwidth TCP connection rate Latency broadcast and unicast Up to 1600 docker nodes IPV4 4

Some details.. 80 processor power8 (SMT=8) 132 GB Linux kernel version: 4.4.0 rc5+ (ppc64le) OpenVswitch V2.5.90 Docker 1.5.0 5

Throughput Measurements ovs-vswitchd iperf clients iperf servers 1 801 2 4 OpenVswitch In kernel flow table 802 803 800 network streams 800 1600 Docker containers Docker containers 6

Predictions Iperf is not performing any real I/O and will become cpu bound. CPU load average ~= Number of iperf streams. Throughput will level out when the number of stream equals the number of processors. 7

CPU Utilization: 8

Aggregate Bandwidth 9

Linux Bridge compared to OpenVswitch 10

Issues Linux Bridge Limited to a maximum of 1024 ports. Linux/net/bridge/br_private.h #define BR_PORT_BITS 10 #define BR_PORT_BITS 14 /* (16k ports) */ OpenVswitch At over 400 streams iperf sporadically failed to establish a connection returning the error No route to host. 11

Measuring TCP Connection Rate ovs-vswitchd (user) 112,010 netlink sockets Ovs-benchmark 1 801 OpenVswitch (kernel) 2 4 Flow Table Connection requests 802 803 800 1600 Docker containers Docker containers 12

TCP Connection Rate 13

Measuring latency of unicast and brodcast packets. Receivers 2 1 Sender OpenVswitch / Linux Bridge Brodcast and Unicast UDP packets with timestamps n Idle containers n+1 1600 14

Unicast Latency 15

Packet Flooding. Sender 1 OpenVswitch / Linux Bridge flood Receiver 2 1600 sysctl -w net.core.netdev_max_backlog=10000 16

Broadcast Packet Latency 17

Multiple Bridge Configuration - Eleven Bridges 1 Sender Receiver Same bridge Receiver Different bridge Bridge br1 Bridge br10 Bridge br2 Bridge br0 Bridge br3 Bridge br4 Bridge br5 Up to 1600 Containers were evenly distributed between br1 through br10 Bridge br9 Bridge br8 Bridge br7 Bridge br6 18

Broadcast Latency single vs. multiple bridges 19

Tuning Prevent broadcast packet loss due to overflowing per-cpu queue: sysctl -w net.core.netdev_max_backlog=10000 Prevent Neighbor (ARP) table overflows sysctl -w net.ipv4.neigh.default.gc_thresh1=1024 sysctl -w net.ipv4.neigh.default.gc_thresh2=2048 sysctl -w net.ipv4.neigh.default.gc_thresh3=4096 dmesg: neighbour: arp_cache: neighbor table overflow! 20

Increasing the nofiles limit for ovs-vswitchd Ovs-vswitchd creates many netlink sockets to communicate with the kernel. 112K sockets in our set-up. Each netlink socket requires a file descriptor. The value of nofiles for the ovs-vswitchd must be at least: (Number of cpus * Number of switch ports). $ prlimit p \ `cat /var/run/openvswitch/ovs vswitchd.pid` \ nofile=200000 21

In Summary... Throughput becomes a function of CPU (loadaverage). Consider how contention for CPU will affect throughput. Massive layer 2 domains will have a large broadcast latency. I saw a decreased latency when using multiple bridges. 22

Legal Statement This work represents the view of the authors and does not necessarily represent the view of IBM. IBM is a registered trademark of International Business Machines Corporation in the United States and/or other countries. Linux is a registered trademark of Linus Torvalds. Microsoft and Windows are trademarks of Microsoft Corporation in the United States, other countries, or both. Other company, product, and service names may be trademarks or service marks of others. 23