VMware Horizon 6 with View Performance and Best Practices TECHNICAL WHITE PAPER



Similar documents
Characterize Performance in Horizon 6

VMware Horizon View 5.2 and Hardware Accelerated 3D Graphics

DIABLO TECHNOLOGIES MEMORY CHANNEL STORAGE AND VMWARE VIRTUAL SAN : VDI ACCELERATION

Oracle Database Scalability in VMware ESX VMware ESX 3.5

Dell Desktop Virtualization Solutions Stack with Teradici APEX 2800 server offload card

VMware Virtual SAN Backup Using VMware vsphere Data Protection Advanced SEPTEMBER 2014

Why Choose VMware vsphere for Desktop Virtualization? WHITE PAPER

Server and Storage Sizing Guide for Windows 7 TECHNICAL NOTES

WHITE PAPER 1

Global Financial Management Firm Implements Desktop Virtualization to Meet Needs for Centralized Management and Performance

VMware View 4 with PCoIP I N F O R M AT I O N G U I D E

DESKTOP VIRTUALIZATION SIZING AND COST MODELING PROCESSOR MEMORY STORAGE NETWORK

Dell Compellent Storage Center SAN & VMware View 1,000 Desktop Reference Architecture. Dell Compellent Product Specialist Team

VDI Without Compromise with SimpliVity OmniStack and Citrix XenDesktop

VMware Workspace Portal Reference Architecture

Virtual Desktop Infrastructure (VDI) made Easy

VMware Horizon. VMware Horizon 6. Q. What is VMware Horizon? Central image management is supported for

Microsoft Office SharePoint Server 2007 Performance on VMware vsphere 4.1

IOmark- VDI. HP HP ConvergedSystem 242- HC StoreVirtual Test Report: VDI- HC b Test Report Date: 27, April

IOmark- VDI. Nimbus Data Gemini Test Report: VDI a Test Report Date: 6, September

VMware Virtual SAN 6.2 with Virtual Desktop Infrastructure Workload

Introduction to VMware EVO: RAIL. White Paper

Diablo and VMware TM powering SQL Server TM in Virtual SAN TM. A Diablo Technologies Whitepaper. May 2015

MS Exchange Server Acceleration

Kronos Workforce Central on VMware Virtual Infrastructure

Virtualization of the MS Exchange Server Environment

The VMware Reference Architecture for Stateless Virtual Desktops with VMware View 4.5

Microsoft Exchange Server 2007

Intel Cloud Builders Guide to Cloud Design and Deployment on Intel Platforms

Analysis of VDI Storage Performance During Bootstorm

Performance Characteristics of VMFS and RDM VMware ESX Server 3.0.1

Configuration Maximums

Management of VMware ESXi. on HP ProLiant Servers

Technical Paper. Moving SAS Applications from a Physical to a Virtual VMware Environment

Understanding Data Locality in VMware Virtual SAN

Performance Test Report: Novell iprint Appliance 1.1

HP SN1000E 16 Gb Fibre Channel HBA Evaluation

Configuration Maximums VMware Infrastructure 3

How To Use A Vmware View For A Patient Care System

Virtual Desktop Infrastructure (VDI) Overview

NVIDIA GRID OVERVIEW SERVER POWERED BY NVIDIA GRID. WHY GPUs FOR VIRTUAL DESKTOPS AND APPLICATIONS? WHAT IS A VIRTUAL DESKTOP?

Infor Web UI Sizing and Deployment for a Thin Client Solution

VBLOCK SOLUTION FOR KNOWLEDGE WORKER ENVIRONMENTS WITH VMWARE VIEW 4.5

VMware Horizon 7. End-User Computing Today. Horizon 7: Delivering Desktops and Applications as a Service

HP ProLiant BL660c Gen9 and Microsoft SQL Server 2014 technical brief

Desktop Virtualization with VMware Horizon View 5.2 on Dell EqualLogic PS6210XS Hybrid Storage Array

Configuration Maximums

White paper Fujitsu Virtual Desktop Infrastructure (VDI) using DX200F AFA with VMware in a Full Clone Configuration

Vocera Voice 4.3 and 4.4 Server Sizing Matrix

MaxDeploy Ready. Hyper- Converged Virtualization Solution. With SanDisk Fusion iomemory products

Philips IntelliSpace Critical Care and Anesthesia on VMware vsphere 5.1

Dragon NaturallySpeaking and citrix. A White Paper from Nuance Communications March 2009

HCIbench: Virtual SAN Automated Performance Testing Tool User Guide

Dell Desktop Virtualization Solutions Simplified. All-in-one VDI appliance creates a new level of simplicity for desktop virtualization

Enterprise Desktop Solutions: VMware View 4.5

What s New in VMware vsphere 4.1 Storage. VMware vsphere 4.1

DELL. Virtual Desktop Infrastructure Study END-TO-END COMPUTING. Dell Enterprise Solutions Engineering

EMC Business Continuity for Microsoft SQL Server 2008

VMware Virtual SAN Design and Sizing Guide for Horizon View Virtual Desktop Infrastructures TECHNICAL MARKETING DOCUMENTATION REV A /JULY 2014

Citrix XenApp Server Deployment on VMware ESX at a Large Multi-National Insurance Company

VDI Optimization Real World Learnings. Russ Fellows, Evaluator Group

Handling Multimedia Under Desktop Virtualization for Knowledge Workers

Desktop Virtualization (VDI) Overview VMware Horizon View

Increasing Storage Performance, Reducing Cost and Simplifying Management for VDI Deployments

Configuration Maximums VMware vsphere 4.0

Performance analysis and comparison of virtualization protocols, RDP and PCoIP

Remote PC Guide Series - Volume 1

Top 10 Reasons to Virtualize VMware Zimbra Collaboration Server with VMware vsphere. white PAPER

Design Considerations for Increasing VDI Performance and Scalability with Cisco Unified Computing System

VMware Horizon 6 Reference Architecture TECHNICAL WHITE PAPER

GTC EXPRESS WEBINAR: GETTING THE MOST OUT OF VMWARE HORIZON VIEW VDGA WITH NVIDIA GRID. Steve Harpster - Solution Architect NALA

Implementation Considerations for VMware App Volumes in a Citrix XenApp Environment WHITE PAPER

VMware Virtual SAN Design and Sizing Guide TECHNICAL MARKETING DOCUMENTATION V 1.0/MARCH 2014

Voice over IP (VoIP) Performance Evaluation on VMware vsphere 5

Tableau Server Scalability Explained

REFERENCE ARCHITECTURE. PernixData FVP Software and Splunk Enterprise

Cisco, Citrix, Microsoft, and NetApp Deliver Simplified High-Performance Infrastructure for Virtual Desktops

Pivot3 Reference Architecture for VMware View Version 1.03

EMC Unified Storage for Microsoft SQL Server 2008

Managing Capacity Using VMware vcenter CapacityIQ TECHNICAL WHITE PAPER

Brainlab Node TM Technical Specifications

Detailed Lab Report DR101115D. Citrix XenDesktop 4 vs. VMware View 4 using Citrix Branch Repeater and Riverbed Steelhead

Windows Server ,500-user pooled VDI deployment guide

VMware End User Computing Horizon Suite

Leveraging NIC Technology to Improve Network Performance in VMware vsphere

Business Process Desktop: Acronis backup & Recovery 11.5 Deployment Guide

HP Client Virtualization SMB Reference Architecture for Windows Server 2012

Understanding the Performance of an X User Environment

High-Availability Fault Tolerant Computing for Remote and Branch Offices HA/FT solutions for Cisco UCS E-Series servers and VMware vsphere

Host Power Management in VMware vsphere 5

IBM FlashSystem and Atlantis ILIO

HOW MANY USERS CAN I GET ON A SERVER? This is a typical conversation we have with customers considering NVIDIA GRID vgpu:

EMC Backup and Recovery for Microsoft SQL Server

EMC Backup and Recovery for Microsoft Exchange 2007 SP2

VMware vsphere Data Protection 6.0

SanDisk SSD Boot Storm Testing for Virtual Desktop Infrastructure (VDI)

White paper Fujitsu vshape Virtual Desktop Infrastructure (VDI) using Fibre Channel and VMware

Evaluation Report: Supporting Microsoft Exchange on the Lenovo S3200 Hybrid Array

Sizing of Virtual Desktop Infrastructures

Transcription:

ware Horizon 6 with View TECHNICAL WHITE PAPER

ware Horizon 6 with View Table of Contents. Introduction... 3 Horizon 6 Feature and Performance Enhancements.... 4 Remote Desktop Session Host Applications... 4 Improved PCoIP Performance.... 4 Hardware-Accelerated 3D Graphics.... 5 ware Virtual SAN... 6 Performance Results... 7 RDSH Sizing.... 8 Calculating the Number of vcpus and RDSH Virtual Machines... 8 Calculating the Number of Users per Core.... 9 Calculating the Number of Application Sessions per Core... 10 Comparing Display Protocol Performance.... 10 CPU Usage.... 11 Bandwidth Usage.... 11 Performance Results of PCoIP Default Changes.... 12 Hardware-Accelerated 3D Graphics Performance.... 13 Light 3D Workload.... 15 CAD Workload... 15 VDI Characterization on Virtual SAN... 18 Horizon 6 Best Practices.... 20 RDSH Virtual Machine Sizing.... 20 RDSH Session Sizing.... 20 RDSH Server Virtual Machine Optimization.... 20 Guest Best Practices for Bandwidth and Storage... 21 PCoIP Settings... 22 3D Graphics Best Practices... 22 Virtual SAN Best Practices.... 22 Conclusion.... 23 References.... 24 Authors and Contributors.... 25 TECHNICAL WHITE PAPER / 2

ware Horizon 6 with View Introduction The ware Horizon with View centralized desktop infrastructure offers advantages for both end users and IT staff. End users are no longer locked to a particular machine and can access their system and files from anywhere, anytime. Horizon with View transforms IT by simplifying and automating desktop and applications management. IT administrators can quickly create virtual desktops on demand based on locations and profiles. By centrally maintaining desktops, applications, and data, Horizon with View reduces costs, improves security, and increases availability and flexibility for end users. Centralized Virtual De Desktops wa re vs ph ere (ES Xi) Linked Clones Microsoft Active Directory View Connection Server Zero Client OS Thin Client Horizon Client for Android ware vcenter Horizon Client for ios View Composer Horizon Client for Windows Endpoint Devices Master Image Horizon Client for Mac Horizon Client for Linux Horizon Client for Windows Store HTML Figure 1: Horizon with View Architecture T E C H N I C A L W H I T E PA P E R / 3

ware Horizon 6 with View Horizon 6 Feature and Performance Enhancements Horizon 6, which includes Horizon with View in each edition, introduces some new features, such as Remote Desktop Session Host (RDSH) applications and application publishing with seamless Windows support, and enhancements that heighten and improve performance, including adjusting the PCoIP defaults, expanding 3D graphics support, and adding support for ware Virtual SAN. This white paper describes the performance gains achieved with the following Horizon 6 enhancements, as indicated by the testing results. It details the architecture systems used for testing the features and recommends best practices for configuring your system. Remote Desktop Session Host Applications Horizon 6 with View introduces RDSH-hosted apps and extended capabilities for RDSH-based desktops. Users can connect to the RDSH server to get a full desktop session or use the published applications seamlessly on the client side. Improved PCoIP Performance The PC over IP (PCoIP) display protocol s adaptive technology provides an optimized virtual desktop delivery on both LAN and WAN. Horizon 6 with View improves the end-user experience by introducing new bandwidth management algorithms that increase the frame rate and reduce its standard deviation. Horizon 6 also has new PCoIP default settings that improve performance. You can change the settings if needed, but in most use cases, the new defaults are suitable and use less bandwidth. The changed default settings are Build to Lossless (BTL) is set to off A fully lossless image is usually not required, and disabling BTL reduces the bandwidth by up to 20 percent. In some cases, however, you might need to enable BTL, for example, when a doctor needs to examine a precise image. Maximum Initial Image Quality changed from 90 percent to 80 percent This setting controls the quality and compression level of pixels when the screen changes for pictures, animations, or videos. The previous default setting of 90 percent provided a perceptually lossless level of compression, meaning that the human eye could not perceive any distortion in the image. Because the screen is constantly changing for animation and video, people see no perceptible change at 80 percent of lossless. The new default reduces bandwidth usage by up to 30 percent. Minimum Image Quality changed from 50 percent to 40 percent This setting specifies the lowest quality and compression level of changed pixels for pictures, animations, and videos. This limit is reached only when the network is congested and the PCoIP session needs to conserve bandwidth. By decreasing the default to 40 percent of lossless, animation and video frame rates are improved by up to 10 percent under constrained conditions. TECHNICAL WHITE PAPER / 4

ware Horizon 6 with View Hardware-Accelerated 3D Graphics In response to user demand for an ever-richer set of applications to be supported in the virtual environment, ware improved 3D graphics support in View 5.x, with additional improvements added in Horizon 6. The following 3D capabilities expand both the target user base and potential use cases that IT organizations can deliver with virtual desktops. Soft 3D Introduced with View 5.0, support for software-accelerated 3D graphics is provided by a Soft 3D graphics driver without physical GPUs installed in the ware ESXi (hypervisor) virtual machine host. vsga (Virtual Software Graphics Acceleration) Introduced with View 5.2, multiple virtual machines can leverage physical GPUs installed locally in the ESXi virtual machine hosts to provide hardware-accelerated 3D graphics to multiple virtual desktops. vdga (Virtual Dedicated Graphics Acceleration) Introduced with View 5.3, a single virtual machine is mapped to one physical GPU installed in the ESXi virtual machine host to provide high-end, hardwareaccelerated workstation graphics where a dedicated GPU is needed. The 3D graphics acceleration is built into the ware vsphere platform. Horizon with View fully leverages vsphere and delivers a robust set of 3D offerings to end users. You can administer support for 3D desktops from the View Administrator console. You can enable it on a per pool or per virtual machine basis using the vsphere client. More information and a detailed performance study about 3D graphics acceleration are available in the ware community forum paper, ware Horizon 6 and Hardware Accelerated 3D Graphics. In contrast to a physical workstation that has sole use of its GPU, GPUs in a virtualized environment are a shared resource. Therefore, it is important to ensure that each virtual machine does not waste the GPU resource. For instance, if View is configured to remote at a lower frame rate (the default is 30 FPS), it usually does not make sense for a 3D application to render hundreds of frames per second (FPS). For these situations, you can configure a View registry setting to limit the maximum application frame rate either in the template virtual machine or on a per-virtual-machine basis. The value is typically set to the maximum frame rate used by PCoIP. Setting the following registry entry for a 3D workload can significantly improve the performance and consolidation ratios achievable: HKLM\SOFTWARE\ware, Inc.\ware SVGA DevTap\MaxAppFrameRate TECHNICAL WHITE PAPER / 5

ware Horizon 6 with View ware Virtual SAN ware Virtual SAN is a software-defined storage tier that simplifies and streamlines storage provisioning and management. Horizon 6 supports Virtual SAN 5.5, providing a low-cost storage solution for virtual desktops and RDSH sessions and applications. Virtual SAN allows IT to manage resources and allocate storage on an as-needed basis through storage-policy-based management the administrator creates storage policies and applies the appropriate policy at deployment or when requirements change. Virtual SAN offers the following advantages: Supports any type of desktop, stateless or persistent Reduces the complexity of scaling for virtual desktop infrastructure (VDI) deployments Scales to the maximum vsphere cluster size (32 nodes) Delivers performance that is almost equal to an all-flash storage system at a fraction of the cost Horizon with View ware vcenter Server vsphere Virtual SAN SSD Hard Disks SSD Hard Disks SSD Hard Disks SSD Hard Disks Clustered Virtual SAN Datastore Figure 2: Virtual SAN and Horizon with View Architecture TECHNICAL WHITE PAPER / 6

ware Horizon 6 with View Performance Results To test the performance of the new and enhanced features, we used ware View Planner 3.5, a workload generator. View Planner simulates typical end-user operations, such as typing in Microsoft Word, playing a PowerPoint slideshow, reading Outlook emails, viewing PDF files, browsing Web pages, and watching a video. View Planner also mimics user behavior by allowing for think time during activities. For more information, see the View Planner Installation and User Guide. Microsoft Office Other Applications Figure 3: Applications Used in View Planner Workload To simulate a heavy user, all applications were selected, a fast moving video with many screen changes was played, and a think time of 2 seconds was used. For a medium user, all the applications were selected, but a slower video with fewer changes and a think time of 5 seconds were used. View Planner was run for multiple iterations, with each iteration completing all user operations for the specific group. Each iteration has three phases: ramp-up, steady state, and ramp-down. During each iteration, View Planner reports the latencies for each operation performed within each virtual machine. View Planner divides tests into groups. Group A represents interactive operations, and Group B includes CPUand I/O-sensitive operations. Quality of service (QoS) is determined for Group A user operations as 1 second, and Group B user operations as 6 seconds. TECHNICAL WHITE PAPER / 7

ware Horizon 6 with View RDSH Sizing With sizing, the goal is to consolidate as many sessions as possible on a particular infrastructure without sacrificing quality. To assess sizing, three aspects of performance were examined: How many users or sessions per physical core can a desktop or application session support? How many vcpus are used for an RDSH virtual machine? How many RDSH virtual machines are needed? Windows 7 32-bit, 1 vcpu, 1 GB RAM RDSH s Windows 2012 R2 Server, 2 x 16 vcpu, 16 x 96 GB Client s or Users ware vsphere 5.5 PCoIP ware vsphere 5.5 Dell PowerEdge R820, 32-core (4 x 8-core socket) Intel Xeon E5-4650, 2.7 GHz, 512 GB RAM, 2 TB RAID 0, local SSD disks Dell PowerEdge R710, 12 cores, 2-socket Intel Xeon E5645, 2.4 GHz, 256 GB RAM, local SSD disks Figure 4: System Under Test for RDSH Server Virtual Machine Sizing Calculating the Number of vcpus and RDSH Virtual Machines One of the main considerations in RDSH sizing is to decide whether to use a few large RDSH server virtual machines (s) 12 to 24 vcpus or many small RDSH server s (4 to 8 vcpus). To arrive at the optimal number of RDSH vcpus and instances, we experimented with several different configurations. The experimental results show that the number of vcpus for the RDSH server should fit within one CPU NUMA node; that is, the number of vcpus is less than or equal to the number of cores in the CPU socket. The total number of vcpus across all RDSH s should be twice the total number of physical cores in the system across all CPU sockets. Because most modern processors have hyperthreading enabled, and each physical core has two hyperthreads, you can conceive of this 2:1 over-commitment ratio as a 1:1 ratio by taking hyperthreaded cores into account. With these two practices in place, essentially two RDSH s are being placed in one NUMA node. For more information about RDSH sizing, see the ware Horizon 6 RDSH white paper. The system under test included a vsphere server hosting several RDSH virtual machines, each running Windows 2012 R2 Server and each configured with 2 16 vcpus and 16 96 GB of memory. The RDSH virtual machines communicated with the client virtual machines using PCoIP. Each client virtual machine ran either a remote desktop or a remote application. Each client virtual machine ran 32-bit Windows 7 with 1 GB RAM. TECHNICAL WHITE PAPER / 8

ware Horizon 6 with View Calculating the Number of Users per Core Figure 5 shows the optimal number of users (sessions) per physical core that can be served on the RDSH s in terms of response time in seconds. Thorough testing by ware has shown that users can tolerate delay in a remote desktop or remote application action for 6 seconds. The chart shows nine users per core is the point that intersects with a 6-second response time. Figure 5: View Planner Group B Response Time with Increasing Users per Core TECHNICAL WHITE PAPER / 9

ware Horizon 6 with View Calculating the Number of Application Sessions per Core Similar to the remote desktop sessions, all applications were run as remote seamless applications. The number of sessions was increased from 40 to 68 on an 8-vCPU RDSH virtual machine. Figure 6 shows that the density is between 8 and 8.5 users per core, which is slightly lower than the number of desktop sessions, which was about 9 users per core. Figure 6: View Planner Group B Response Time with Increasing Application Sessions per Core Comparing Display Protocol Performance An important aspect of using remote applications is to find the ideal user experience at different network conditions. Horizon 6 with View uses the PCoIP remote viewing network protocol. ware compared the performance of PCoIP to Microsoft RDP 8 and Citrix ICA. The study looked at two primary factors that contribute to the speed of an application when viewed remotely: CPU usage and bandwidth usage. The test consisted of 60 remote application sessions running on an 8-vCPU Windows 2012 R2 RDSH virtual machine. All View Planner 3.5 applications were run, except video, a minimum of three iterations with a 5-second think time. The resolution was 1152 x 864, and the color depth was 32-bit. The ICA compression level was changed to low to match the perceptually lossless quality of PCoIP. TECHNICAL WHITE PAPER / 10

ware Horizon 6 with View CPU Usage Figure 7 shows the percentage of the guest CPU used when performing the various workload tasks. PCoIP used on average 71.6 percent, RDP used 68 percent, and ICA 71.2 percent. All the protocols used about the same amount of compute power on the guest operating system. Figure 7: CPU Usage in the RDSH Virtual Machine for Three Competing Remote Display Protocols Bandwidth Usage Figure 8 shows the bandwidth usage of the 60 sessions for each display protocol. The average kilobits per second (Kbps) are calculated for the workload runs. PCoIP averaged 44.7 Kbps per session, RDP used 50.7 Kbps, and ICA 48.4 Kbps. PCoIP bandwidth performance is about 10 percent better than RDP and ICA. Figure 8: Bandwidth Usage for View Planner Medium Workload per Remote Display Protocols TECHNICAL WHITE PAPER / 11

ware Horizon 6 with View Performance Results of PCoIP Default Changes To test the effects of changing the PCoIP defaults, we compared the performance of Horizon 6 with View to View 5.3. Figure 9 shows the setup used for this evaluation. An HP ProLiant BL 460c G6 blade server with a 2.53 GHz Intel Xeon processor and 96 GB physical memory hosted the desktop s. The desktop data and operating system disk resided on an all-flash array. The ran 32-bit Windows 7 with one vcpu and 1 GB of virtual memory. To run clients at scale, s were used on a separate host. An HP ProLiant BL 460c G6 blade server with a 2.13 GHz Intel Xeon E7-2830 processor and 256 GB physical memory was used. Windows 7 32-bit clients connected to the desktop s with PCoIP. The clients were configured with one vcpu and 1 GB of virtual memory. The View Planner workload representing a heavy user was run on all 40 desktop s. All applications were tested, and a think time of 2 seconds was used. Windows 7 32-bit, 1 vcpu, 1 GB RAM Windows 7 32-bit, 1 vcpu, 1 GB RAM VDI s Client s or Users ware vsphere 5.5 ware vsphere 5.5 PCoIP HP ProLiant BL 460c G6, 8-core (2 x 4-core socket) Intel Xeon 2.53 GHz, 96 GB RAM, Violin storage array HP ProLiant BL 620c G7, 16 cores, 2-socket Intel Xeon E7-2830 2.13 GHz, 256 GB RAM connected to SAN Figure 9: Setup to Test PCoIP Default Setting Changes TECHNICAL WHITE PAPER / 12

ware Horizon 6 with View Figure 10 shows the response time for Groups A and B, comparing Horizon 6 with View and View 5.3. Horizon 6 with View shows about a 10 percent lower response time in both interactive and I/O-sensitive operations compared to View 5.3. Bandwidth usage was about 30 percent lower in Horizon 6 with View due to changes in PCoIP default settings. 6 Response time in seconds 5 4 3 2 1 View 5.3 Horizon 6 with View 0 Group A 95% Group B 95% Figure 10: View Planner Response Times Comparing Horizon 6 with View and View 5.3 Hardware-Accelerated 3D Graphics Performance To measure the scalability of a VDI solution that uses vsga to support 3D graphics, two different workloads that stress the vsga solution in different ways were tested. Scalability is defined in terms of the consolidation ratio and the corresponding response time or frame rate during the runs. The following workloads represent typical customer use scenarios: Light 3D workload This workload consists of common desktop applications and usage, including Microsoft Office 2010, Adobe Acrobat, 720p video, browsing static content with Internet Explorer, and running Google Earth in the Chrome browser. All applications are launched at the beginning of the run and remain open for the duration of the run. Throughout the duration of the test, the workload performs a variety of operations using these applications. The ordering of the operations differs from desktop to desktop to mimic real-world workloads. The desktop s run Windows 7 at a resolution of 1600 x 1200 pixels and have Aero enabled. TECHNICAL WHITE PAPER / 13

ware Horizon 6 with View CAD workload 2 A Solid Edge CAD viewer is run in isolation for the duration of the test. During the test, a 3:1 reducer model was used, as illustrated in Figure 11. Figure 11: 3-to-1 Reducer Model Used in Performance Tests with the Solid Edge Viewer A think time of 5 seconds was used. All tests used the default settings. The setup is shown in Figure 12. The workloads were run on a single Dell R720 server with different consolidation ratios. The number of s that can be supported per GPU can be dictated by either the GPU s compute resources being exhausted or the GPU s available memory being exhausted. Windows 7 SP1 32-bit, 2 vcpu, 1.5 GB RAM, 1600 x 1200 resolution, Horizon 6.0 with View Windows 7 SP1 32-bit, 2 vcpu, 2 GB RAM, 128 MB VRAM, 1600 x 1200 resolution, Horizon 6.0 with View, View Planner ware vsphere 5.5 ware vsphere 5.5 PCoIP Intel Xeon E5540, 2.53 GHz, 96 GB RAM SSD disk storage on EMC CLARiiON CX4 Dell PowerEdge R720, 2 Intel Xeon E5-2695 v2, 2.4 GHz, 512 GB RAM, 2 NVIDIA K2 (16 GB GDDR5 DRAM total) SSD disk storage on EMC CLARiiON CX4 Figure 12: Setup for Measuring the Performance of the vsga Stack and Horizon 6 with View TECHNICAL WHITE PAPER / 14

ware Horizon 6 with View Light 3D Workload For the light 3D workload, the number of s was gradually increased until the View Planner response threshold was exceeded. The results are shown in Figure 13. The corresponding CPU utilization, as measured using esxtop, is shown by the line graph. Based on this data, it is clear that the vsga stack can support 120 users while each user is executing the light 3D workload. Running on higher-performance processors typically delivers even higher consolidation ratios. Normalized response time (lower is better) 1.20 1.00 0.80 0.60 0.40 0.20 0.00 Normalized value of 95(th) percentile of response time for light 3D workload FAIL PASS 16 32 48 64 80 96 120 128 Number of s 100 90 80 70 60 50 40 30 20 10 0 % CPU utilization on ESX server View Planner Threshold ESX CPU (%) - Utilization Response Time Figure 13: Response Time as the Number of s Is Increased The results shown in Figure 13 were obtained using desktop s configured with 128 MB video RAM. Because the test bed had two NVIDIA GRID K2 GPUs with 8 GB DRAM each, the GPUs can support only about 128 desktop s. For this light 3D workload, the maximum consolidation ratio achieved on the dual-socket server under test was 128. The test was stopped when the responsiveness exceeded the upper limit allowed by the View Planner responsiveness threshold. CAD Workload In this workload, a Solid Edge viewer runs a single model: a 3-to-1 reducer, as shown in Figure 11. As with the light 3D workload, the simulated user s interaction with the model mimics a real user s pattern. Figure 14 illustrates the scalability of the vsga as the load on the server with two K2 cards is increased. As the number of s is increased from 1 to 72, the aggregate remotely delivered FPS increased by 35 times for K2, and 30 times for K1. Peak GPU usage was about 95 percent. Figure 16 presents the same results as Figure 14, but shows the remotely delivered FPS data on a per- basis. This view of the data highlights that the performance of the individual s moderately decreases as the number of s is scaled. TECHNICAL WHITE PAPER / 15

ware Horizon 6 with View Normalized value of total FPS delivered remotely (higher is better) 40 35 30 25 20 15 10 5 0 Normalized plot of total FPS delivered remotely for CAD workload 2 1 2 4 8 16 32 48 64 72 80 Number of s 100 90 80 70 60 50 40 30 20 10 0 % CPU utilization on ESX server Fails FPS Threshold Criterion ESX CPU (%) - Utilization Total FPS Figure 14: Scalability of the vsga Solution as the Server Load Is Increased with NVIDIA K2 Cards Normalized value of total FPS delivered remotely (higher is better) 40 35 30 25 20 15 10 5 0 Normalized plot of total FPS for CAD workload 2 1 2 4 8 16 32 48 56 64 72 Number of s 100 90 80 70 60 50 40 30 20 10 0 % CPU utilization on ESX server ESX CPU (%) - Utilization Total FPS Figure 15: Scalability of the vsga Solution as the Server Load Is Increased with NVIDIA K1 Cards TECHNICAL WHITE PAPER / 16

ware Horizon 6 with View Normalized value of average FPS delivered remotely (higher is better) 1.20 1.00 0.80 0.60 0.40 0.20 0.00 Normalized plot of average FPS delivered remotely for CAD workload 2 Average FPS FPS Threshold PASS FAIL 1 2 4 8 16 32 48 64 72 80 Number of s Figure 16: Remotely Delivered FPS Data on a per- Basis with K2 Graphics Cards Normalized value of average FPS delivered remotely (higher is better) 1.20 1.00 0.80 0.60 0.40 0.20 0.00 Normalized plot of average FPS for CAD workload 2 Average FPS FPS Threshold 1 2 4 8 16 32 48 56 64 72 Number of s PASS FAIL Figure 17: Remotely Delivered FPS Data on a per- Basis with K1 Graphics Cards TECHNICAL WHITE PAPER / 17

ware Horizon 6 with View VDI Characterization on Virtual SAN In this setup to test VDI characterization on a three-node Virtual SAN cluster, a host running the desktop s has 16 Intel Xeon E5-2690 cores running about 2.9 GHz. The host has 256 GB physical RAM, which is more than sufficient to run one hundred 1-GB Windows 7 s. For Virtual SAN, each host has two disk groups. Each disk group has one PCI-e 200-GB solid-state drive (SSD) and six 300-GB 15 K RPM SAS disks. We used View Planner to evaluate VDI and Virtual SAN performance. Client s or Users Windows XP SP3 32-bit, 1 vcpu, 768 MB RAM Desktop s Windows 7 32-bit, 1 vcpu, 1 GB RAM ware vsphere 5.5 ware vsphere 5.5 PCoIP HP ProLiant BL 460, 8-core Intel Xeon E5540, 2.53 GHz, 96 GB RAM with hybrid storage array with FC protocol Dell PowerEdge R720xd, 16-core Intel Xeon E5-2690, 2.9 GHz, 256 GB RAM, 2 NVIDIA K2 (16 GB GDDR5 DRAM total) Virtual SAN: Two disk groups with each disk group having one PCI-e SSD, 200 GB; six 300-GB 15k RPM SAS disks Figure 18: Setup for Virtual SAN (Host Configuration for a 3-Node Cluster) TECHNICAL WHITE PAPER / 18

ware Horizon 6 with View To highlight the PCoIP default changes, we compared the View Planner score on Virtual SAN 5.5 (vsphere 5.5 U1) on View 5.3.2, and Horizon 6 with View and Virtual SAN public beta (vsphere 5.5). On a three-node cluster, VDImark the maximum number of desktop s that can run passing QoS criteria is obtained for both Virtual SAN 5.5 and Virtual SAN public beta. The results are shown in Figure 19. 400 View Planner VDImark for 3-node cluster 350 VDImark 300 250 200 150 286 305 341 100 50 0 Virtual SAN Public Beta with View 5.3.2 Virtual SAN 5.5 with View 5.3.2 Virtual SAN 5.5 with Horizon 6 with View Figure 19: VDI Performance on Virtual SAN The results show that with Virtual SAN 5.5, we can scale up to 305 s on a three-node cluster, which is about 5 percent more consolidation than Virtual SAN public beta. Compared to prior releases, Virtual SAN 5.5 has about 10 percent more consolidation with Horizon 6 with View. This improvement is primarily due to PCoIP default changes, which provide better host consolidation and lower bandwidth usage. TECHNICAL WHITE PAPER / 19

ware Horizon 6 with View Horizon 6 Best Practices Based on the performance results, we recommend the following best practices. RDSH Virtual Machine Sizing Using multiple RDSH server s instead of one large provides the best performance. Make the vcpu for the RDSH less than or equal to the number of cores in the socket so that the fits within the same NUMA node. For the number of instances, a 2:1 CPU over-commitment works better, for example, 64 vcpus on 32 cores, or 64 hyperthreaded cores, in our experiments. If the number of cores in the socket is less, and the requirement is to use a large vcpu RDSH that does not fit in the same NUMA node, try the preferht=true option in the X file of the RDSH server. This option forces the vcpus to the fewest number of physical sockets by giving preference to hyperthreaded cores for scheduling purposes, increasing the probability of the staying in the same socket. You can also set the CPU affinity so that each vcpu is mapped to one set of hyperthreaded cores. Because a 1:1 overcommit with hyperthreaded cores is recommended, setting the affinity in the X file can further improve performance. In the following X configuration example, an 8-vCPU is mapped to cores 0 7. Similarly, the other seven instances of the 8-vCPU s could be mapped to cores 8 63: sched.cpu.affinity = 0,1,2,3,4,5,6,7 RDSH Session Sizing Based on nine desktop sessions per core for typical office workers (medium user) on a 2.7 GHz processor, about 300 MHz per user is needed. For seamless applications, the amount is about 350 MHz. Therefore, setting the CPU MHz between 300 and 500 MHz per session provides a good user experience. A typical session for a medium user requires the following: CPU 300 500 MHz per session Memory 400 500 MB per session for nine applications, based on a typical working set of applications (your needs might be different) Disk space 200 300 MB per user in the operating system disk for profiles, temporary files, and so on Network 50 Kbps per session on average, but also plan for peak bandwidth RDSH Server Virtual Machine Optimization To optimize the RDSH server image, you can make several registry and group policy changes using the Horizon 6 RDSH Optimization Tool available on the ware Community Web site. To view the registry changes, you can run process monitor (procmon) and set the filter on WriteRegistry to see which optimizations are being applied. TECHNICAL WHITE PAPER / 20

ware Horizon 6 with View Guest Best Practices for Bandwidth and Storage It is important to optimize the master before the linked clones or full clones are created. Many optimizations can be applied to the guest. These optimizations save precious resources, such as bandwidth and storage. Table 1 shows the settings applied to achieve the best bandwidth and minimize the need for storage resources. More guest optimizations are discussed in the Optimization Guide for Windows 7 and Windows 8 Virtual Desktops in Horizon with View. PARAMETER vcpu Memory Network adapter Storage adapter ware Tools Visual settings Disable services* Group policy settings Other settings CONFIGURATION One for Windows XP and Windows 7 and Windows 8; two for multimediaintensive apps 768 1024 MB for Windows XP 1 GB for 32-bit Windows 7 and Windows 8 2 GB for 64-bit Windows 7 and Windows 8 1.5 2 GB for Windows XP, Windows 7, and 32-bit Windows 8 3 GB for 64-bit Windows 7 and Windows 8 for memory-intensive apps vmxnet3, flexible PVSCSI or LSI Logic SAS Latest installed (also make sure that the balloon driver is functioning properly) Adjust to best performance (can provide an additional 10 20 percent bandwidth savings in WAN environments) Disable animations for Windows maximize and minimize operations Use default cursor for busy and working cursor Windows Update, Super-fetch, Windows Index Disable hibernation Disable system restore Set screensaver to None Turn off clear type Disable fading effects Disable auto-play and external drive caching for quick release (recommended so that these services do not try to pull USB drive information over the WAN) Disable last access timestamps* * Especially important to curtail redo log growth with linked clones. Table 1: Guest Best Practices to Save Bandwidth and Storage Resources TECHNICAL WHITE PAPER / 21

ware Horizon 6 with View PCoIP Settings After optimizing the master, you can adjust PCoIP settings to realize the best user experience. Table 2 lists group policy object (GPO) settings to improve the user experience for WAN environments. But even the default settings save lots of bandwidth. More PCoIP settings are discussed in the ware View 5 with PCoIP: Network Optimization Guide. SETTING DEFAULT RECOMMENDATION DESCRIPTION Session Audio BW limit 500 Kbps 50 100 Kbps Reduces bandwidth usage of audio with usable quality. Maximum frame rate 30 Change to 10 15 based on network settings Provides decent video playback and fast graphics operations in typical WAN conditions. Maximum session bandwidth Set per network conditions and link bandwidth Helps PCoIP better estimate bandwidth, and maximizes the user s experience of sharing the same link. Client-side cache size 250 MB Set per client-side memory available Lets you configure the client-side image cache size. Table 2: PCoIP GPO Settings and Best Practices 3D Graphics Best Practices Horizon 6 with View and PCoIP dynamically adapt to the available CPU and bandwidth resources to present the optimal user experience. Even when multiple s are sharing a single physical GPU, vsphere ensures that the resource is fairly shared between them. As a result, minimal configuration is required to deliver peak performance. Virtual SAN Best Practices Get better performance from your Virtual SAN with these best practices. Use more, smaller SAS disks for capacity tier to get more IOPS. Separate the SSD device access path from the capacity tier path. Use a dedicated SAS HBA for disk-based flash or use a PCI-E or memory slot SSD. Minimize the number of components as much as possible per. Enable View Storage Acceleration. TECHNICAL WHITE PAPER / 22

ware Horizon 6 with View Conclusion This paper presented performance data and best practices for ware Horizon 6 and highlighted some of the new and enhanced features. It describes some of the sizing practices for RDSH virtual machines to get the best performance for RDSH desktop sessions and applications. According to the findings, the vcpu for the RDSH should be configured so that the fits within one NUMA node, and the number of RDSH server instances should allow all hyperthreaded cores to be fully utilized. For RDSH desktop and application sessions, 300 MHz per session provides a good user experience for a medium workload. In Horizon 6 with View, the PCoIP defaults have changed, resulting in a 10 percent improvement in the user experience and 30 percent lower bandwidth usage. Similar results are seen when the desktops are deployed on Virtual SAN 5.5. For the vsga feature, the results illustrated the ability of ware hardware-backed 3D support to scale efficiently, demonstrating the benefits of GPU virtualization and the strength of the ware 3D strategy. TECHNICAL WHITE PAPER / 23

ware Horizon 6 with View References ware View Planner Installation and User Guide ware View Planner download ware View Planner: Measuring True Virtual Desktop Experience at Scale Comprehensive User Experience Monitoring Optimization Guide for Windows 7 and Windows 8 Virtual Desktops in Horizon with View Remote Desktop Protocol 8.0 update for Windows 7 and Windows Server 2008 R2 ware Horizon View 5.2 and Hardware Accelerated 3D Graphics: Performance Study ware View 5 with PCoIP: Network Optimization Guide VDI Benchmarking Using View Planner on ware Virtual SAN (VSAN), part 1 VDI Benchmarking Using View Planner on ware Virtual SAN (VSAN), part 2 VDI Benchmarking Using View Planner on ware Virtual SAN (VSAN), part 3 VDI Performance Benchmarking on ware Virtual SAN 5.5 Simulating different VDI users with View Planner 3.0 ware Horizon with View ware Virtual SAN ware Horizon 6 Reference Architecture ware Horizon 6 RDSH Whitepaper ware Horizon 6 and Hardware Accelerated 3D Graphics - TECHNICAL WHITE PAPER / 24

ware Horizon 6 with View Authors and Contributors Banit Agrawal, Senior Performance Engineer, ware, is an expert in VDI, View, remote display protocols, ware View Planner, performance benchmarking, and performance troubleshooting and has filed several patents addressing these areas. Hari Sivaraman, Senior Member of the Technical Staff, ware, works on 3D rendering performance and CUDA support for ESX. Xing Fu, Member of the Technical Staff, ware Performance Engineering group, works on the performance of virtualization solutions. Pradeep Chikku, Senior Storage Engineer, ware, has experience in file systems, storage, ESX kernel storage stack, storage devices, device drivers, and Fibre Channel storage. Nachiket Karmarkar, Senior Performance Engineer, ware, has expertise in the area of VDI performance, performance automation, and View Planner. Rishi Bidarkar, Director, ware, leads the VDI Performance and View Planner teams. He has filed several patents in the area of VDI performance and display benchmarking. The authors would like to extend their sincere thanks to Nancy Beckus, technical writer and editor in the Technical Marketing team at ware, and Julie Brodeur for their comments, feedback, and improvements in the quality of this white paper. The authors also thank the View Planner team and Warren Ponder for their comments and feedback. To comment on this paper, contact the ware End-User-Computing Technical Marketing team at euc_tech_content_feedback@vmware.com. ware, Inc. 3401 Hillview Avenue Palo Alto CA 94304 USA Tel 877-486-9273 Fax 650-427-5001 www.vmware.com Copyright 2015 ware, Inc. All rights reserved. This product is protected by U.S. and international copyright and intellectual property laws. ware products are covered by one or more patents listed at http://www.vmware.com/go/patents. ware is a registered trademark or trademark of ware, Inc. in the United States and/or other jurisdictions. All other marks and names mentioned herein may be trademarks of their respective companies. Item No: W-TWP-HORZVIEWPERFBESTPRAC-USLET-20150324-WEB