Virtualization: TCP/IP Performance Management in a Virtualized Environment Orlando Share Session 9308



Similar documents
Virtual Networking with z/vm Guest LAN and Virtual Switch

Virtual Networking with z/vm Guest LANs and the z/vm Virtual Switch

Oracle Networking and High Availability Options (with Linux on System z) & Red Hat/SUSE Oracle Update

Performance Evaluation of VMXNET3 Virtual Network Device VMware vsphere 4 build

Top 10 Tips for z/os Network Performance Monitoring with OMEGAMON Session 11899

Top 10 Tips for z/os Network Performance Monitoring with OMEGAMON. Ernie Gilman IBM. August 10, 2011: 1:30 PM-2:30 PM.

IBM Software Group. Lotus Domino 6.5 Server Enablement

IBM Communications Server for Linux - Network Optimization for On Demand business

Abstract. 1. Introduction and motivation

Top 10 Tips for z/os Network Performance Monitoring with OMEGAMON Ernie Gilman

Introduction to Mainframe (z/os) Network Management

Gigabit Ethernet Design

VXLAN Performance Evaluation on VMware vsphere 5.1

vsphere Networking ESXi 5.0 vcenter Server 5.0 EN

vsphere Networking vsphere 6.0 ESXi 6.0 vcenter Server 6.0 EN

D1.2 Network Load Balancing

TCP Offload Engines. As network interconnect speeds advance to Gigabit. Introduction to

VMWARE WHITE PAPER 1

Networking Driver Performance and Measurement - e1000 A Case Study

Windows Server 2008 R2 Hyper V. Public FAQ

Intel I340 Ethernet Dual Port and Quad Port Server Adapters for System x Product Guide

VXLAN: Scaling Data Center Capacity. White Paper

Microsoft SQL Server 2012 on Cisco UCS with iscsi-based Storage Access in VMware ESX Virtualization Environment: Performance Study

PE10G2T Dual Port Fiber 10 Gigabit Ethernet TOE PCI Express Server Adapter Broadcom based

AS/400e. TCP/IP routing and workload balancing

vsphere Networking vsphere 5.5 ESXi 5.5 vcenter Server 5.5 EN

RELEASE NOTES. StoneGate Firewall/VPN v for IBM zseries

hp ProLiant network adapter teaming

Performance Evaluation of Linux Bridge

Barracuda Link Balancer

Intel Ethernet Switch Load Balancing System Design Using Advanced Features in Intel Ethernet Switch Family

Network Virtualization and Data Center Networks Data Center Virtualization - Basics. Qin Yin Fall Semester 2013

Chapter 8 Router and Network Management

Sockets vs. RDMA Interface over 10-Gigabit Networks: An In-depth Analysis of the Memory Traffic Bottleneck

Broadcom Ethernet Network Controller Enhanced Virtualization Functionality

SUN DUAL PORT 10GBase-T ETHERNET NETWORKING CARDS

Where IT perceptions are reality. Test Report. OCe14000 Performance. Featuring Emulex OCe14102 Network Adapters Emulex XE100 Offload Engine

Evaluation Report: Emulex OCe GbE and OCe GbE Adapter Comparison with Intel X710 10GbE and XL710 40GbE Adapters

How To Monitor And Test An Ethernet Network On A Computer Or Network Card

Leveraging NIC Technology to Improve Network Performance in VMware vsphere

Big Data Storage in the Cloud

VegaStream Information Note Considerations for a VoIP installation

AppDirector Load balancing IBM Websphere and AppXcel

Datacenter Operating Systems

VMware vshield App Design Guide TECHNICAL WHITE PAPER

Fibre Channel over Ethernet in the Data Center: An Introduction

CCNA R&S: Introduction to Networks. Chapter 5: Ethernet

Using IPM to Measure Network Performance

MEASURING WORKLOAD PERFORMANCE IS THE INFRASTRUCTURE A PROBLEM?

ACHILLES CERTIFICATION. SIS Module SLS 1508

I/O Virtualization Using Mellanox InfiniBand And Channel I/O Virtualization (CIOV) Technology

ESXi Configuration Guide

Fax Server Cluster Configuration

Feature Comparison. Windows Server 2008 R2 Hyper-V and Windows Server 2012 Hyper-V

CommuniGate Pro SIP Performance Test on IBM System z9. Technical Summary Report Version V03

VLAN for DekTec Network Adapters

Performance and Recommended Use of AB545A 4-Port Gigabit Ethernet Cards

Using Virtual IP Addressing. with Bus-Tech TCP/IP Controllers

Operating Systems Design 16. Networking: Sockets

Nutanix Tech Note. VMware vsphere Networking on Nutanix

Allocating Network Bandwidth to Match Business Priorities

ESX Server 3 Configuration Guide Update 2 and later for ESX Server 3.5 and VirtualCenter 2.5

z/vm and Linux Disaster Recovery A Customer Experience Lee Stewart Sirius Computer Solutions (DSP)

Hardware Performance Optimization and Tuning. Presenter: Tom Arakelian Assistant: Guy Ingalls

Networking TCP/IP routing and workload balancing

Procedure: You can find the problem sheet on Drive D: of the lab PCs. Part 1: Router & Switch

Optimizing Data Center Networks for Cloud Computing

The Lagopus SDN Software Switch. 3.1 SDN and OpenFlow. 3. Cloud Computing Technology

Virtualization. Dr. Yingwu Zhu

Introduction to MPIO, MCS, Trunking, and LACP

Part 1 - What s New in Hyper-V 2012 R2. Clive.Watson@Microsoft.com Datacenter Specialist

M.Sc. IT Semester III VIRTUALIZATION QUESTION BANK Unit 1 1. What is virtualization? Explain the five stage virtualization process. 2.

IP SAN Best Practices

iseries TCP/IP routing and workload balancing

Radware s AppDirector and AppXcel An Application Delivery solution for applications developed over BEA s Weblogic

Technical Brief. DualNet with Teaming Advanced Networking. October 2006 TB _v02

VLAN 802.1Q. 1. VLAN Overview. 1. VLAN Overview. 2. VLAN Trunk. 3. Why use VLANs? 4. LAN to LAN communication. 5. Management port

Network performance in virtual infrastructures

Set Up a VM-Series Firewall on the Citrix SDX Server

PE310G4BPi40-T Quad port Copper 10 Gigabit Ethernet PCI Express Bypass Server Intel based

Creating Overlay Networks Using Intel Ethernet Converged Network Adapters

How To Manage Performance On A Network (Networking) On A Server (Netware) On Your Computer Or Network (Computers) On An Offline) On The Netbook (Network) On Pc Or Mac (Netcom) On

Network Layer IPv4. Dr. Sanjay P. Ahuja, Ph.D. Fidelity National Financial Distinguished Professor of CIS. School of Computing, UNF

Security Overview of the Integrity Virtual Machines Architecture

Radware s AppDirector and Microsoft Windows Terminal Services 2008 Integration Guide

Integrated Traffic Monitoring

<Insert Picture Here> Oracle Database Support for Server Virtualization Updated December 7, 2009

Internet Protocol: IP packet headers. vendredi 18 octobre 13

ADVANCED NETWORK CONFIGURATION GUIDE

Performance of Software Switching

Internet Protocols Fall Lectures 7-8 Andreas Terzis

ZEN LOAD BALANCER EE v3.02 DATASHEET The Load Balancing made easy

RED HAT ENTERPRISE VIRTUALIZATION FOR SERVERS: COMPETITIVE FEATURES

Meeting the Five Key Needs of Next-Generation Cloud Computing Networks with 10 GbE

Ethernet. Ethernet. Network Devices

ZEN LOAD BALANCER EE v3.04 DATASHEET The Load Balancing made easy

Fabrics that Fit Matching the Network to Today s Data Center Traffic Conditions

Chapter 1 Reading Organizer

Transcription:

Virtualization: TCP/IP Performance Management in a Virtualized Environment Orlando Share Session 9308 Laura Knapp WW Business Consultant Laurak@aesclever.com Applied Expert Systems, Inc. 2011 1

Background Mainframe Specifics Common Problems Applied Expert Systems, Inc. 2011 2

Right-Sizing IT Infrastructure Consolidate entire farms of..servers..storage..network...etal and dynamically optimize to only consume the resources you need! and dynamically optimize to move applications for high availability and performance! Applied Expert Systems, Inc. 2011 3

Always On, Optimized, Energy Efficient Datacenter Dynamic Resource Scheduling Balance workloads Right-size hardware Optimize real time High Availability Restart immediately when H/W or OS fail Protect all apps On-demand Capacity Scale without disruption Reconfigure on the fly Save time X Applied Expert Systems, Inc. 2011 4

Managing Virtualized Data Center Fundamentals of management apply FCAPS Fault Configuration Availability Performance Security Report Monitor Analyze Leading to Service Level Achievement Optimum Resource Utilization Highly available systems High performing systems Remediate Diagnose Applied Expert Systems, Inc. 2011 5

Background Mainframe Specifics Common Problems Best Practices Applied Expert Systems, Inc. 2011 6

z/vm Reliable Scalable Mature VM OS Secure Flexible Efficient Applied Expert Systems, Inc. 2011 7

Total Installed Linux MIPS Business Service Management for Performance Use of Mainframe as VM Grows The momentum continues: Shipped IFL engine volumes increased 62% from 3Q07 to 3Q09 Shipped IFL MIPS increased 100% from 3Q07 to 3Q09 Linux is 16% of the System z customer install base (MIPS) 70% of the top 100 System z clients are running Linux on the mainframe >3,000 applications available for Linux on System z Source: Reed Mullen-IBM Installed Linux MIPS 2003 2004 2005 2006 2007 2008 3Q09 Applied Expert Systems, Inc. 2011 8

z/vm Structure Applied Expert Systems, Inc. 2011 9

Advanced Virtualization on System z MVS (Multiple Virtual Storage) VM (Virtual Machine) LPAR (Logical Partition) Load Balancing VIPA (Virtual IP Addressing) HiperSockets Enterprise Extender (Virtual SNA) Linux for z/series VLAN s (Virtual LAN) VSwitch (Virtual Switch) Applied Expert Systems, Inc. 2011 10

OSA Express3 Highlights New hardware data router bypasses firmware for packet construction, inspection, routing, etc New microprocessor (660 MHz versus 500/448 MHz) New PCI bus (PCI Express) New LC Duplex SM connectors for 10 Gbe feature Dual density adapters Up to four ports per feature, two ports per CHPID Up to 45% improvement in latency over OSA- Express2 4x improvement over OSA-Express2 for 10g Ethernet feature (line speed) Applied Expert Systems, Inc. 2011 11

Background Mainframe Specifics Common Problems Applied Expert Systems, Inc. 2011 12

Approaches Top Down or bottom Up doesn t matter Consistency does Applications Middleware Guest OS VM Network Applied Expert Systems, Inc. 2011 13

Management Lifecycle - Monitoring 3 rd Party Services Services Gather Configuration and Traffic Information Observe Statistics Collect Capacity Data Analyze Traffic Performance Baselining Services Services Solve Problems 3 rd Party Services Implement Changes Plan Changes Evaluate What-if Analysis Applied Expert Systems, Inc. 2011 14

Linux: OSA LAN Timer or Blocking Timer OSA inbound blocking function Determines how long OSA will hold packets Indirectly affects Frequency of host interrupt Payload per interrupt Workload Heavy Light Linux has 3 potential values for OSA2 For frames under 1536:Time between 2 incoming packets For Jumbo frames: Duration of inter-packet gap Total duration that OSA holds a single inbound buffer Default mode is NO LAN idle which is a good compromise for both transactional and streaming workloads Linux behaves differently with OSAExpress3 Using the default for OSA2 results in short latency but high CPU utilization OSA Applied Expert Systems, Inc. 2011 15

Scenario 1 High CPU Utilization after move to OSA3 Situation A system with an even mix of transactional and streaming workloads had a hardware upgrade and was now running with an OSA3 adapter. The Linux CPU became excessively high for no clearly visible reason. Trouble Shooting Historical data was viewed to ensure that the spike in CPU activity did occur when the OSA3 adapter was activated. In viewing the bytes in/out and other workload data no glaring inconsistencies were seen. Solution When the change was made the original OSA2 values for BLKT were used (inter=0, inter_jumbo=0, total=0). Due to the difference in OSA2 and OSA3 behavior these numbers were changed (inter=5, inter_jumbo=15, total=250). CPU utilization returned to normal OSA2 default value on OSA3 results in shortest latency and highest CPU utilization Best to use MTU size of 1492 for OSA3 Supported in SLES10SP3+kernel update SLES 11 RHEL 5.5 Red Hat: /etc/sysconfig/networkscripts/ifcfg-eth0 add OPTIONS="blkt/inter=5 blkt/inter_jumbo=15 blkt/total=250" Applied Expert Systems, Inc. 2011 16

TCP Segmentation Offload (Large Send) Segmentation consumes large amount of CPU Allows most IPv4 TCP segmentation processing to be handled by OSA Increases data transfer efficiency for IPv4 packets Host/ TCP OSA Network Applied Expert Systems, Inc. 2011 17

Scenario 2 2 Tiered Database System Situation Client had a 2 tiered Database system and OSA 3 adapters. The front end database servers created many TCP/IP connections with high transactional volumes. The responses resulted in large TCP segments and the CPU utilization was unbearable. Trouble Shooting Look at the detailed TCP connections and transfer information. Use a packet trace tool. Is there a correlation between large segments sizes resulted in excessive CPU utilization. If so, go in an looking at the OSA adapter TCP Offload was not turned on. Use the large_send parameter no: no large send TSO: OSA adapter does segmentation EDDP: the qeth driver performs segmentation TCPIP will still do segmentation for: LPAR-LPAR packets Solution TCP Segmentation was turned on for the OSA adapter. On average anywhere from 25% to 45% CPU improvement was observed. IPSec encapsulated packets When Multipath is in effect (unless all interfaces support segmentation offload) Applied Expert Systems, Inc. 2011 18

Scenario 3 Mixed Servers on Virtualized Platform Situation Client had 12 servers on a single z/vm base. A single OSA adapter supported this z/vm base and several other LPARs. The workloads were mixed in nature and inconsistent in arrival. Some critical workloads when moved to the z/vm base were not providing the users with the desired response time. Trouble Shooting Using monitoring tools a detailed look was made of the overall IP stack and the flow of information. No inconsistencies were found. What was noted is that all traffic was treated equally. A look at the priority queue setting showed that everything was set to a queue setting of 3 (not important) Solution Due to the use of TOS bits for many of the workloads that required priority treatment the queue assignment was changed to prio_queuing_tos Workloads requiring special handling were now being provided this by the IP stack. Service Type Low Latency 0 High Throughput Queue Impacts outgoing packets Extremely important in high traffic conditions 4 queues are available 1 High Reliability 2 Not Important 3 Applied Expert Systems, Inc. 2011 19

z/vm Virtual Switch Link Aggregation First introduced with z/vm 5.3 Aggregated links includes a physical switch supporting IEEE 802.3ad Linux Linux Linux Linux Linux Link Aggregation Control Protocol VSWITCH z/vm OSA OSA OSA OSA System z LPAR Link Aggregation Control Protocol SWITCH Applied Expert Systems, Inc. 2011 20

Scenario 4 Connection Lost after VSWITCH OSA Takeov Situation Client had a VSWITCH configured with OSA-P and backup OSA-B. When the cable connected to OSA-p was removed the communication was lost even though switching to OSA-B occurred. Trouble Shooting The following data was used in this analysis and was gathered using standard monitoring, tracing, and IP tools. First look at the status of the devices OSA-P and OSA-B. The monitor used showed us that there was a mismatch between speed of OSA-B and the switch that it was connecting to. Solution Match the speed of the switch and the OSA-B Other items investigated included ARP cache of z/vm TCP/IP z/vm routing of Linux guest information Trace information for the VSWITCH Applied Expert Systems, Inc. 2011 21

Scenario 5 Linux Hipersocket Performance Slow Situation A client had a very successful beta with Linux on system z. As they added additional workloads onto the Linux systems overall network performance declined. Hipersockets was used and the expectation was that performance should have been better. Trouble Shooting Using a Linux TCP/IP Monitor check the overall flow of information through both the IP and TCP layers. Verify that listeners are available for the applications. View alerts and determine if any would suggest the problem being seen. Check the buffer count. In this system the buffer count had never been raised and was still set at 16. Solution Tests by IBM have shown that using the default of 16 limits throughput as the number of parallel sessions increases with hipersockets. The buffers were increased to 50 with acceptable results As you increase the buffer additional memory will be used SUSE SLES11: in /etc/udev/rules.d/51-qeth- 0.0.f200.rules add ACTION=="add", SUBSYSTEM=="ccwgroup", KERNEL=="0.0.f200", ATTR{buffer_count}="128" Applied Expert Systems, Inc. 2011 22

MTU Size Optimizing MTU size can provide optimum performance improvements Set the maximum size supported by all hops between the source and destination Traceroute can provide details on the MTU size but some router administrators block traceroute If you application sends frames <= 1400 bytes use an MTU size of 1492 Jumbo frames use and MTU size of 8992 TCP uses MTU size for window size calculation For VSWITCH an MTU of 8992 is recommended Applied Expert Systems, Inc. 2011 23

Scenario 6 Excessive Fragmentation Situation A client had a Linux on system environment and they were about ready to grow the production use of Linux. While they did not have any major problems they new of they asked for an overall health check. Trouble Shooting Using a Linux TCP/IP Monitor check the overall flow of information through both the IP and TCP layers. Look at the MTU settings on your links and the fragmentation on the IP stack. While there was not significant fragmentation, the MTU size was set at 1500. Solution In order to prevent future fragmentation issues we reset the MTU size to 1492 and defined that as the standard for their linux systems Applied Expert Systems, Inc. 2011 24

A few items to exploit in System z Kernel Kernel 2.6.35 Offload outbound checksumming Move calculation of checksum for non-tso packets from the driver to the OSA card NAPI support for QDIO and QETH (NAPI is Linux networking API) Kernel 2.6.34 Hipersockets Network Traffic Analyzer Trace Hipersockets network traffic Layer 2 and Layer 3 supported Kernel 2.6.33 OSA ADIO Data Connection Isolation Isolate data traffic between Linux guests on a VSWITCH Linux Guest communication needs to go through an exernal router Improves multi-tier security Applied Expert Systems, Inc. 2011 25

AES Sessions at Share Aug 8, 2011: 1:30-2:30 9288: Keeping Your Network at Peak Performance as You Virtualize the Data Center Aug 10, 2011: 8:00-9:00 9266: IPv6 Basics Aug 10, 2011: 4:30-5:30 9270: Managing an IPv6 Network Aug 11, 2011: 3:00-4:00 9273: CSI Maui: Forensics in The Case of the Attacked Browsers Aug 11, 2011: 11:00-12:00 9277: Implementing IPv6 on Windows and Linux Desktop Aug 11, 2011: 1:30-2:30 9290: Network Problem Diagnosis with OSA Examples Aug 12, 2011: 8:00-9:00 9308: TCP/IP Performance Management in a Virtualized Environment Applied Expert Systems, Inc. 2011 26

QUESTIONS? laurak@aesclever.com www.aesclever.com 650-617-2400 Applied Expert Systems, Inc. 2011 27