The Case Against Jumbo Frames. Richard A Steenbergen <ras@gtt.net> GTT Communications, Inc.

Similar documents
Advanced features on Ethernet networks. Fulvio Risso Politecnico di Torino

MPLS RSVP-TE Auto-Bandwidth: Practical Lessons Learned

MPLS RSVP-TE Auto-Bandwidth: Practical Lessons Learned. Richard A Steenbergen <ras@gt-t.net>

Chapter 3. TCP/IP Networks. 3.1 Internet Protocol version 4 (IPv4)

ADVANCED NETWORK CONFIGURATION GUIDE

MPLS for ISPs PPPoE over VPLS. MPLS, VPLS, PPPoE

IP addressing and forwarding Network layer

Internetworking II: VPNs, MPLS, and Traffic Engineering

Network Considerations for IP Video

Networking Test 4 Study Guide

Quality of Service Analysis of site to site for IPSec VPNs for realtime multimedia traffic.

VXLAN: Scaling Data Center Capacity. White Paper

TCP Offload Engines. As network interconnect speeds advance to Gigabit. Introduction to

RFC 6349 Testing with TrueSpeed from JDSU Experience Your Network as Your Customers Do

Juniper Networks EX Series/ Cisco Catalyst Interoperability Test Results. May 1, 2009

Securing EtherNet/IP Using DPI Firewall Technology

CMA5000 SPECIFICATIONS Gigabit Ethernet Module

Multipath TCP in Practice (Work in Progress) Mark Handley Damon Wischik Costin Raiciu Alan Ford

Link Layer. 5.6 Hubs and switches 5.7 PPP 5.8 Link Virtualization: ATM and MPLS

IP SAN Best Practices

Network administrators must be aware that delay exists, and then design their network to bring end-to-end delay within acceptable limits.

ACHILLES CERTIFICATION. SIS Module SLS 1508

How will the Migration from IPv4 to IPv6 Impact Voice and Visual Communication?

A SENSIBLE GUIDE TO LATENCY MANAGEMENT

Isilon IQ Network Configuration Guide

Encapsulating Voice in IP Packets

Exhibit n.2: The layers of a hierarchical network

VLAN und MPLS, Firewall und NAT,

Juniper / Cisco Interoperability Tests. August 2014

D1.2 Network Load Balancing

Where IT perceptions are reality. Test Report. OCe14000 Performance. Featuring Emulex OCe14102 Network Adapters Emulex XE100 Offload Engine

Ethernet Fabric Requirements for FCoE in the Data Center

Guide to TCP/IP, Third Edition. Chapter 3: Data Link and Network Layer TCP/IP Protocols

his document discusses implementation of dynamic mobile network routing (DMNR) in the EN-4000.

IEEE Congestion Management Presentation for IEEE Congestion Management Study Group

UPPER LAYER SWITCHING

Building High-Performance iscsi SAN Configurations. An Alacritech and McDATA Technical Note

Transport and Network Layer

Exam 1 Review Questions

MPLS Layer 3 and Layer 2 VPNs over an IP only Core. Rahul Aggarwal Juniper Networks. rahul@juniper.net

NetFlow Subinterface Support

Denial of Service Attacks

CS 457 Lecture 19 Global Internet - BGP. Fall 2011

Lecture 1: Course Introduction" CSE 222A: Computer Communication Networks Alex C. Snoeren

CCNA R&S: Introduction to Networks. Chapter 5: Ethernet

20-CS X Network Security Spring, An Introduction To. Network Security. Week 1. January 7

ESSENTIALS. Understanding Ethernet Switches and Routers. April 2011 VOLUME 3 ISSUE 1 A TECHNICAL SUPPLEMENT TO CONTROL NETWORK

hp ProLiant network adapter teaming

Brocade Solution for EMC VSPEX Server Virtualization

IPV6 FRAGMENTATION. The Case For Deprecation. Ron Bonica NANOG58

Technote. SmartNode Quality of Service for VoIP on the Internet Access Link

EINDHOVEN UNIVERSITY OF TECHNOLOGY Department of Mathematics and Computer Science

Using IPM to Measure Network Performance

Overview of Network Measurement Tools

Introduction to LAN/WAN. Network Layer (part II)

Internetworking. Problem: There is more than one network (heterogeneity & scale)

Approaches for DDoS an ISP Perspective.

Windows TCP Chimney: Network Protocol Offload for Optimal Application Scalability and Manageability

Network Security: Workshop. Dr. Anat Bremler-Barr. Assignment #2 Analyze dump files Solution Taken from

IP videoconferencing solution with ProCurve switches and Tandberg terminals

CS514: Intermediate Course in Computer Systems

Debugging With Netalyzr

Lecture 15: Congestion Control. CSE 123: Computer Networks Stefan Savage

White Paper Abstract Disclaimer

Final Exam. Route Computation: One reason why link state routing is preferable to distance vector style routing.

A Performance Analysis of Gateway-to-Gateway VPN on the Linux Platform

Question: 3 When using Application Intelligence, Server Time may be defined as.

N5 NETWORKING BEST PRACTICES

NETWORK LAYER/INTERNET PROTOCOLS

How To Configure Voice Vlan On An Ip Phone

This topic lists the key mechanisms use to implement QoS in an IP network.

Asynchronous Transfer Mode: ATM. ATM architecture. ATM: network or link layer? ATM Adaptation Layer (AAL)

CS5490/6490: Network Security- Lecture Notes - November 9 th 2015

Analysis of Basic Quality of Service Mechanism for Voice over IP In Hamdard University Network Campus

Mike Canney Principal Network Analyst getpackets.com

Names & Addresses. Names & Addresses. Hop-by-Hop Packet Forwarding. Longest-Prefix-Match Forwarding. Longest-Prefix-Match Forwarding

Cisco EtherSwitch Network Modules

Computer Network. Interconnected collection of autonomous computers that are able to exchange information

Enhancing Cisco Networks with Gigamon // White Paper

ethernet services for multi-site connectivity security, performance, ip transparency

Zarząd (7 osób) F inanse (13 osób) M arketing (7 osób) S przedaż (16 osób) K adry (15 osób)

Burst Testing. New mobility standards and cloud-computing network. This application note will describe how TCP creates bursty

Flow processing and the rise of the middle.

TCP/IP Jumbo Frames Network Performance Evaluation on A Testbed Infrastructure

Ten top problems network techs encounter

NAT TCP SIP ALG Support

Fibre Channel over Ethernet in the Data Center: An Introduction

Virtual PortChannels: Building Networks without Spanning Tree Protocol

An Introduction to VoIP Protocols

Communications and Computer Networks

Protecting and controlling Virtual LANs by Linux router-firewall

FWSM introduction Intro 5/1

Internetworking and IP Address

Accelerating High-Speed Networking with Intel I/O Acceleration Technology

High Speed I/O Server Computing with InfiniBand

Transcription:

The Case Against Jumbo Frames Richard A Steenbergen <ras@gtt.net> GTT Communications, Inc. 1

What s This All About? What the heck is a Jumbo Frame? Technically, the IEEE 802.3 Ethernet standard defines the maximum frame payload (MTU) value at 1500 bytes. Supporting anything larger than 1500 bytes is outside of this standard, and we call it a Jumbo Frame. How much bigger? Nobody really knows. It s non-standard, remember. The rough guideline for most people is around 9000. This is a historical number from the original Alteon proposal. Most vendors today actually offer a different number. 2

It s Over 9000!!! 3

The Goal of Jumbo Frames Improved High Speed Transfer Efficiency Most basic host operations are aligned around 4096 bytes. Memory pages, iscsi data blocks, etc, etc. Sending 1500 byte packets doesn t align well with these operations Results in poor chunking and waiting for buffers to fill. Reduced packet/sec routing lookup load. Increasing packet sizes can decrease the PPS rate. Increased flexibility when tunneling. 1500 byte MTUs are the defacto standard of the Internet. So doing 1500 plus tunnel overhead is hard to do well. 4

The Intended Use of Jumbo Frames The current paradigm is to fill the MTU to the max. Sending many MB of data in 1460 + Headers chunks. Most of packets on the Internet are 1500 bytes long. The rest are mostly TCP ACKs to those 1500 byte packets. J The proposed Jumbo Frame usage is different. You aren t supposed to send 9K packets every time. Send 4096*2 = 8192 bytes of payload, plus headers. 8192 payload + IP (20) + TCP (20) + Ethernet (14), etc. The extra buffer up to 9000(something) is intended to increase flexibility for using different packet headers. 5

Starting To See Inter-Network Deployment Until recently, jumbo frames have primarily been an internal network only thing at best. But now some IX operators are starting to roll out Jumbo Frame VLANs at major exchanges. This could eventually lead to the ability to deliver a > 1500 byte packet end-to-end. And many people are cheerleading this effort. With a lot of idealism about improving the efficiency of high-speed end-to-end flows, which is a good thing. 6

But this could all be a Very Bad Idea

Picking an MTU Number Picking an MTU value is really hard work. Remember, there is no standard value defined anywhere. Nor are their any negotiation protocols to automate it. So we re down to manual negotiation between operators. And it s all made even harder by router vendors. There isn t even a standard for using the same number! Cisco IOS 1500 equals Juniper & IOS XR 1514. But wait, it gets even worse Cisco IOS 1500 on an 802.1q tagged interface equals Juniper 1518, or 1522 on a Q-in-Q link, etc. And if you get it wrong, you silently blackhole traffic. 8

Path MTU Discovery Sucks Path MTU Discovery (PMTUD) is how the Internet deals with MTU mismatch today. When a router encounters an MTU mismatch and a frame that is too large, it drops the packet and sends an ICMP. The host receives the ICMP, and reduces the packet size for the retransmission and the flow. Hopefully it now fits. But PMTUD is broken beyond words. Routers can only generate these ICMP packets so fast. It s incredibly vulnerable to Denial of Service attacks too. ICMP packets are often limited/blocked by ISPs or users. The only reason stuff works today IS the 1500 byte MTU. 9

Other Problems Larger packet sizes increases jitter. Making 100 Gigabit flows more efficient is a noble cause. But a single 9000 byte packet down a 10 Mbps link has 7.2ms of serialization delay alone. The benefits are non-existent until you have a 100% Jumbo enabled end-to-end path. No content network will ever enable a service which has minimal benefit and which risks breaking the flows for any percentage of their customer base. 10

And It s All Totally Unnecessary Jumbo Frames are a nearly 15 year old idea. 15 years ago, they helped a host deliver a gigabit flow. Today, they re really completely unnecessary for that. Modern NICs help eliminate most of this. With techniques like Large Segment Offload (LSO). Instead of making it a big hassle to move data around in 1500-byte chunks, you just hand the NIC a 64k buffer. So we re solving a problem we really don t even have. 11

Tunneling PROVES How Broken PMTUD Is The argument that establishing > 1500 byte MTU IX s helps enable tunneling is an interesting one. But it actually proves the point that PMTUD is broken. If MTU mismatches in the Internet actually worked, you wouldn t have a problem tunneling 1476 over GRE. Creating more MTU mismatches to solve the problem of MTU mismatch is fundamentally flawed logic. 12

What You d Need To Raise the MTU Bar So what do you need before you can even try to dabble in the space of raising the Internet MTU? Reliable MTU Negotiation between end-points. Making operators negotiate a value between every connected device, even if the numbers meant the same thing on every device, doesn t scale. You need a negotiation protocol to find Layer 2 MTU. And with Ethernet, this needs to be per-mac. And for PMTUD to work you need to know it at L3. So the only sensible place to do it would be ARP. 13

But What About Path MTU Discovery Path MTU Discovery is still the fundamental flaw. Requiring that a router drop a packet, generate an ICMP, have that ICMP successfully make it back to the host before we even KNOW about an MTU mismatch is horrible in every sense of the word. This is not supportable at speed in modern router architectures. This is incredibly vulnerable to Denial of Service attacks. It adds latency and stalls performance on every flow. And there is no replacement on the horizon. In Short: Internet-wide Jumbo Frames will probably cause infinitely more harm than good under the current technology. 14

Send questions, comments, complaints to: Richard A Steenbergen <ras@gtt.net> GTT Communications, Inc.