Large-Scale Passive Network Monitoring using Ordinary Switches



Similar documents
Large-Scale Passive Monitoring using SDN

Microsoft s Demon Datacenter Scale Distributed Ethernet Monitoring Appliance

Scalable Network Monitoring with SDN-Based Ethernet Fabrics

SDN AND SECURITY: Why Take Over the Hosts When You Can Take Over the Network

How To Make A Vpc More Secure With A Cloud Network Overlay (Network) On A Vlan) On An Openstack Vlan On A Server On A Network On A 2D (Vlan) (Vpn) On Your Vlan

Packet Optimization & Visibility with Wireshark and PCAPs. Gordon Beith Director of Product Management VSS Monitoring

How To Orchestrate The Clouddusing Network With Andn

OpenFlow and Software Defined Networking presented by Greg Ferro. OpenFlow Functions and Flow Tables

Open SDN for Network Visibility

Wireshark Developer and User Conference

Cloud Networking Disruption with Software Defined Network Virtualization. Ali Khayam

VXLAN: Scaling Data Center Capacity. White Paper

OVERLAYING VIRTUALIZED LAYER 2 NETWORKS OVER LAYER 3 NETWORKS

VXLAN Bridging & Routing

Network Virtualization Based on Flows

NfSen Plugin Supporting The Virtual Network Monitoring

Ethernet-based Software Defined Network (SDN) Cloud Computing Research Center for Mobile Applications (CCMA), ITRI 雲 端 運 算 行 動 應 用 研 究 中 心

BROADCOM SDN SOLUTIONS OF-DPA (OPENFLOW DATA PLANE ABSTRACTION) SOFTWARE

Ten Things to Look for in an SDN Controller

SOFTWARE-DEFINED NETWORKING AND OPENFLOW

Cisco Certified Network Associate Exam. Operation of IP Data Networks. LAN Switching Technologies. IP addressing (IPv4 / IPv6)

Carrier/WAN SDN Brocade Flow Optimizer Making SDN Consumable

Software Defined Networking What is it, how does it work, and what is it good for?

NEC contribution to OpenDaylight: Virtual Tenant Network (VTN)

Enhancing Cisco Networks with Gigamon // White Paper

基 於 SDN 與 可 程 式 化 硬 體 架 構 之 雲 端 網 路 系 統 交 換 器

CCT vs. CCENT Skill Set Comparison

SOFTWARE-DEFINED NETWORKING AND OPENFLOW

Scalable Network Monitoring with SDN-Based Ethernet Fabrics

Software Defined Networking and the design of OpenFlow switches

Network Virtualization for the Enterprise Data Center. Guido Appenzeller Open Networking Summit October 2011

Software-Defined Networking for the Data Center. Dr. Peer Hasselmeyer NEC Laboratories Europe

Data Analysis Load Balancer

Interconnecting Cisco Networking Devices Part 2

Software Defined Networking and OpenFlow: a Concise Review

Application Note Gigabit Ethernet Port Modes

SDN Controller Requirement

: Interconnecting Cisco Networking Devices Part 1 v2.0 (ICND1)

SDN and OpenFlow. Naresh Thukkani (ONF T&I Contributor) Technical Leader, Criterion Networks

Ethernet-based Software Defined Network (SDN)

SDN Overview for UCAR IT meeting 19-March Presenter Steven Wallace Support by the GENI Program Office!

Affording the Upgrade to Higher Speed & Density

- EtherChannel - Port Aggregation

SSVVP SIP School VVoIP Professional Certification

IP Addressing and Subnetting. 2002, Cisco Systems, Inc. All rights reserved.

Cisco Data Centre: Introducing Cisco Data Center Networking

Securing Local Area Network with OpenFlow

Tutorial: OpenFlow in GENI

Simplify IT. With Cisco Application Centric Infrastructure. Barry Huang Nov 13, 2014

Understanding OpenFlow

"Charting the Course...

SDN in the Public Cloud: Windows Azure. Albert Greenberg Partner Development Manager Windows Azure Networking

Application Note. Stateful Firewall, IPS or IDS Load- Balancing

Application Performance Management - Deployment Best Practices Using Ixia- Anue Net Tool Optimizer

SSL Inspection Step-by-Step Guide. June 6, 2016

1 Data information is sent onto the network cable using which of the following? A Communication protocol B Data packet

hp ProLiant network adapter teaming

Data Center Infrastructure of the future. Alexei Agueev, Systems Engineer

Networking 4 Voice and Video over IP (VVoIP)

Enabling Visibility for Wireshark across Physical, Virtual and SDN. Patrick Leong, CTO Gigamon

SDN. WHITE PAPER Intel Ethernet Switch FM6000 Series - Software Defined Networking. Recep Ozdag Intel Corporation

Network Technologies for Next-generation Data Centers

OpenFlow and Onix. OpenFlow: Enabling Innovation in Campus Networks. The Problem. We also want. How to run experiments in campus networks?

: Interconnecting Cisco Networking Devices Part 2 v1.1

Extending Networking to Fit the Cloud

Configuring EtherChannels

IMPLEMENTING CISCO SWITCHED NETWORKS V2.0 (SWITCH)

Software Defined Networking (SDN) - Open Flow

Software Defined Network (SDN)

Technical Bulletin. Enabling Arista Advanced Monitoring. Overview

Application Delivery Networking

How To Switch A Layer 1 Matrix Switch On A Network On A Cloud (Network) On A Microsoft Network (Network On A Server) On An Openflow (Network-1) On The Network (Netscout) On Your Network (

Software Defined Networking A quantum leap for Devops?

LANs and VLANs A Simplified Tutorial

SDN CONTROLLER. Emil Gągała. PLNOG, , Kraków

Packet Sniffing on Layer 2 Switched Local Area Networks

Introduction to Software Defined Networking (SDN) and how it will change the inside of your DataCentre

How To Learn Cisco Cisco Ios And Cisco Vlan

20. Switched Local Area Networks

Internet Firewall CSIS Internet Firewall. Spring 2012 CSIS net13 1. Firewalls. Stateless Packet Filtering

Flow Monitor Configuration. Content CHAPTER 1 MIRROR CONFIGURATION CHAPTER 2 RSPAN CONFIGURATION CHAPTER 3 SFLOW CONFIGURATION...

OpenFlow: History and Overview. Demo of routers

How do I get to

How To Understand and Configure Your Network for IntraVUE

Splunk for Networking and SDN

Network Virtualization and Software-defined Networking. Chris Wright and Thomas Graf Red Hat June 14, 2013

Network Agent Quick Start

Getting to know OpenFlow. Nick Rutherford Mariano Vallés

Why Software Defined Networking (SDN)? Boyan Sotirov

Underneath OpenStack Quantum: Software Defined Networking with Open vswitch

Flow Monitor Configuration. Content CHAPTER 1 MIRROR CONFIGURATION CHAPTER 2 SFLOW CONFIGURATION CHAPTER 3 RSPAN CONFIGURATION...

> Technical Configuration Guide for Microsoft Network Load Balancing. Ethernet Switch and Ethernet Routing Switch Engineering

Transcription:

Large-Scale Passive Network Monitoring using Ordinary Switches Justin Scott Senior Network OPS engineer Juscott@microsoft.com Rich Groves Rgroves@a10networks.com

Preface We are network Engineers This isn t a Microso4 Product We are here to share methods and Knowledge. Hopefully we can all con@nue to help foster evolu@on in the industry

About Jus0n Sco3 Started career at MSFT in 2007. Network Opera@ons engineer, specialized in high profile, high stress outages Turned to packet analysis to get through ambiguous problem statements Frustrated by our inability to exonerate our network quickly. Lack of ability to data mine telemetry data at the network layer Sharkfest 2014

What s this about? A different way of aggregate data from a TAP/SPAN Our struggle with other approaches An Architecture based on OpenFlow and use of commodity merchant silicon A whole new world of use- cases Learnings we ve taken away

The Scale of the cloud Thousands of 10g links per Data Center 8,16 and 32x10g uplinks from TORs Cost makes it a non-starter with commercial solutions

Prior itera0ons Capture- Net Ø Consisted of off the shelf aggrega@on gear, which was far too expensive at scale Ø high cost made tool purchases a difficult pitch, no point without tools Ø Resulted in lots of gear gathering dust Ø Opera@ons not mature enough to back such a solu@on PMA/PUMA Passive Measurement Architecture Ø Lower cost than Capture- net Ø Designed for a specific environment and not intended to scale Ø Extremely feature rich Nemesys AKA Rich s crazy ideas Ø Big hub - Switched network with MAC learning turned off Le4 with shuffling sniffers around the DC as troubleshoots popped up.

Ques0ons? NOT THE END Just took a step back

What features make up a packet broker? terminates taps Can match on a 5- tuple duplica@on 80% Packets unaltered low latency Stats Layer 7 packet inspec@on Time stamps Frame Slicing 20% Microburst detec@on Sharkfest 2014

Reversing the Packet Broker Filter Ports MUX Service backplane Filter Ports (pre-filter, de-duplication of data) Service! Service! Timestamps,DPI,etc Delivery Ports (data duplication and delivery) Delivery

Can You Spot the Off the Shelf Packet Broker? Which is 20x more expensive? What do these have in common? They are all the same switch! Is it called a Packet Broker cause it makes you broker? -Raewyn Groves (Rich s Daughter)

11 Architecture

The glue SDN Controller Openflow 1.0 Ø Ø Ø Ø Controller runs as an agent on the switch Standards managed by the Open Networking Founda0on developed at Stanford 2007-2010 Can match on SRC and/or DST fields of either TCP/UDP, IP, MAC., ICMP code & types, Ethertype, Vlan id Controller Discovers topology via LLDP Can manage whole solu0on via remote API, CLI or web GUI

Mul0- Tenant Distributed Ethernet Monitoring Appliance Enabling Packet Capture and Analysis at Enterprise Scale monitor ports filter mux filter service service delivery Appliance tooling 20X cheaper than off the shelf solutions

Filter Layer Monitor Ports Filter Filter e Terminates all monitor ports Drops all traffic by default De- duplica@on of data if needed Aggressive sflow exports

Mux Layer Monitor Ports MUX Aggregates all filter switches in a data center Directs traffic to either service nodes or delivery interfaces Enables service chaining per policy

Services Nodes monitor ports r Service Service Aggregated by mux layer. Majority of cost is here Flows DON T need to be sent through the service by default. Service chaining Some Service: Deeper (layer 7) filtering Time stamping Microburst detec@on Traffic Ra@o s (SYN/SYNC ACK) Frame slicing - (64, 128 byte) Payload removal for compliance Rate limi@ng

Delivery Layer monitor ports r Delivery tooling 1:N and N:1 delivery to duplica@on of data Delivery to local or tunneling to remote tools

Controller Router Router Filter_Switch2 Filter_Switch1 Filter_Switch3 policy demo description Ticket 12345689 1 match tcp dst-port 1.1.1.1 2 match tcp src-port 1.1.1.1 filter-interface Filter_Switch1_Port1 filter-interface Filter_Switch1_Port2 filter-interface Filter_Switch2_Port1 filter-interface Filter_Switch2_Port2 filter-interface Filter_Switch3_Port1 filter-interface Filter_Switch3_Port2 delivery-interface Capture_server_NIC1

Extras Intelligence of the solu0on Ø Ø Ø Ø Ø Ø Ø Ø Ø Overlapping flow support Vlan rewrite ARP glean Marker packet Stats Mul0- User support Tap Port grouping Self termina0ng policy bring your own innova0on controller

Use Cases and Examples 20 Microsoft Confidential Internal Use Only

Reac0ve Ops Use- cases Split the network into inves0ga0on domains. Quickly exonerate or implicate the network Time gained not physical moving sniffers from room to room Verify TCP intelligent network appliance are opera0ng as expected

IPV6 Problem Statement: Users on a large ISP in Sea3le are intermi3ently unable to connect to exchange via IPV6. Repro facts: 3- way TCP connec0on setup s up. 9- way SSL handshake fails Ack for Client hello doesn t make it back to Loadbalancer Solu0on: Implicates or exonerates advanced L7 devices that are commonly finger pointed Root cause: Race condi0on - If the client hello was received on the LoadBalancer, before the backend connec0on was made it would trigger the bug

Proac0ve monitoring Use- case Relying sole on SNMP polling and syslogs gives you false confidence Exposure to the underlying TCP telemetry data is true network performance data Detect retransmissions (TCP- SACK)

Controller Router Router Filter_Switch2 Filter_Switch1 policy demo description Ticket 12345689 short desc 1 match tcp dst-port 443 2 match tcp src-port 443 filter-interface Filter_Switch1_Port1 filter-interface Filter_Switch1_Port2 filter-interface Filter_Switch2_Port1 filter-interface Filter_Switch2_Port2 delivery-interface Capture_server_NIC1 use-service remove_payload chain 1 use-service Layer_7_service_TCP_sack_match chain 2

Port- channels and delivery Load-balance to multiple tools - Symmetric hashing Duplicate data to multiple delivery interfaces Binding portchannels to Openflow Services expanding multiple interfaces

Increase Visibility on Large L2 networks Connec0ng a filter- interface to a L2 network as a trunked link Unicast flooding: NLB is a loadbalancing technology that doesn t use tradi0onal hardware based LB s. Stolen gateway: Human fat fingers an IP address as the default- gateway. Broadcasts: All fun in games un0l the rate of broadcasts increase over some dura0on and starve out legi0mate traffic AKA broadcast storm. STP TCN: A single packet that indicates the whole L2 network CAM table is going to be flushed and relearnt. A single occurrence isn t the end of the world, but if it s frequent occurrence bad things are happening.

Adding sflow Analysis monitor ports filter1 filter sflow samples sourced from all interfaces mux delivery Controlle r Behavioral analysis More meaningful captures are taken sflow collector logic 27

Remote delivery monitor ports filter1 filter mux Controlle r delivery Produc@on network 28 Encap and send to remote tool in other DC Decap on arrival

Basic Openflow Pinger Functionality Spine Spine Spine Spine packet is transmijed through the openflow control channel Packet flows through the produc@on network Leaf 1 Leaf 2 Packet is des@ned toward example dest 10.1.1.1 Demon 1588 Switch Controller Openflow encap is removed inner packet is transmijed through specified output port Packets Packets are des@ned counted for and @mestamps controller are read encapsulated for @ming analysis through Openflow control channel Packet is cra4ed based on a template

30 Cost & Caveats

Solu0on breakdown Cost to build out a TAP infrastructure to support 200 10gig links. Cost Tap strip Filter Switch MUX Tap strip Delivery Switch Capture Filter Switch Server Packet MUX Broker Delivery Switch Capture Server SDN Controller Packet Broker Number of links

Learnings of raw openflow solu0on Short term hurtles Ø TCAM limits Ø IPv6 support Ø Lacking mul@vendor support in the same ecosystem Ø Can t match on TCP/IP if packets encapsulated (MPLS,IP- IP) Ø Most services are sta@c and have a 10Gig cap. Openflow Ecosystem Ø switch vendors implement Openflow a lijle differently Ø Commercial controller support is splintering. Whitebox switches/bare metal Ø Total access/control of the underlying hardware

33 Questions?

Microsom is a great place to work! We need experts like you. We have larger than life problems to solve and are well supported Networking is cri@cal to Microso4's online success and well funded. Washington is beau@ful! It doesn t rain that much. We just say that to keep people from cali from moving in