OCIO HIGH RESOLUTION GRAPHICS DESKTOP VIRTUALIZATION POC. Chip Charnley Technical Expert Client Technologies Infrastructure Architecture



Similar documents
ArcGIS Pro: Virtualizing in Citrix XenApp and XenDesktop. Emily Apsey Performance Engineer

Table of Contents. P a g e 2

NVIDIA GRID DASSAULT CATIA V5/V6 SCALABILITY GUIDE. NVIDIA Performance Engineering Labs PerfEngDoc-SG-DSC01v1 March 2016

PNY Professional Solutions NVIDIA GRID - GPU Acceleration for the Cloud

NVIDIA GRID OVERVIEW SERVER POWERED BY NVIDIA GRID. WHY GPUs FOR VIRTUAL DESKTOPS AND APPLICATIONS? WHAT IS A VIRTUAL DESKTOP?

S5445 BUILDING THE BEST USER EXPERIENCE WITH CITRIX XENAPP & NVIDIA GRID THOMAS POPPELGAARD

Performance Testing in Virtualized Environments. Emily Apsey Product Engineer

Design Considerations for Increasing VDI Performance and Scalability with Cisco Unified Computing System

VDI Without Compromise with SimpliVity OmniStack and Citrix XenDesktop

GPU Accelerated XenDesktop 3D Graphics beyond Designers and Engineers

PLANNING FOR DENSITY AND PERFORMANCE IN VDI WITH NVIDIA GRID JASON SOUTHERN SENIOR SOLUTIONS ARCHITECT FOR NVIDIA GRID

S4726 VIRTUALIZATION AN INTRO TO VIRTUALIZATION. Luke Wignall & Jared Cowart Senior Solution Architects GRID

Cisco, Citrix, Microsoft, and NetApp Deliver Simplified High-Performance Infrastructure for Virtual Desktops

User Reports. Time on System. Session Count. Detailed Reports. Summary Reports. Individual Gantt Charts

Virtualization of ArcGIS Pro. An Esri White Paper December 2015

605: Design and implement a desktop virtualization solution based on a mock scenario. Hands-on Lab Exercise Guide

Amazon EC2 XenApp Scalability Analysis

GTC EXPRESS WEBINAR: GETTING THE MOST OUT OF VMWARE HORIZON VIEW VDGA WITH NVIDIA GRID. Steve Harpster - Solution Architect NALA

REMOTE HIGH FIDELITY VISUALIZATION. May 2015 Jeremy Main, Sr. Solution Architect GRID

Hardware Accelerated Graphics for VDI

VDI: What Does it Mean, Deploying challenges & Will It Save You Money?

DELL. Virtual Desktop Infrastructure Study END-TO-END COMPUTING. Dell Enterprise Solutions Engineering

Dell Virtual Remote Desktop Reference Architecture. Technical White Paper Version 1.0

HOW MANY USERS CAN I GET ON A SERVER? This is a typical conversation we have with customers considering NVIDIA GRID vgpu:

AT&T Connect Participant Application & VDI Platform Support

Best Practices for Deploying Virtual Desktop Infrastructure on the Unified Computing System

Remoting Your 3D. Ian Williams Manager, PSG Applied Engineering NVIDIA Corporation.

IBM PureFlex and Atlantis ILIO: Cost-effective, high-performance and scalable persistent VDI

INSTALLATION GUIDE ENTERPRISE DYNAMICS 9.0

Sizing of Virtual Desktop Infrastructures

Designing Fast Virtual Desktops for Healthcare

GRIDCENTRIC VMS TECHNOLOGY VDI PERFORMANCE STUDY

DIABLO TECHNOLOGIES MEMORY CHANNEL STORAGE AND VMWARE VIRTUAL SAN : VDI ACCELERATION

Citrix XenDesktop & XenApp

GTC 2015 S5128 Case Study: Georgia Tech Uses Citrix XenApp with NVIDIA GRID to Deliver Engineering Applications

Citrix XenDesktop Modular Reference Architecture Version 2.0. Prepared by: Worldwide Consulting Solutions

The impact of virtualization security on your VDI environment

Dell Desktop Virtualization Solutions Stack with Teradici APEX 2800 server offload card

GPU virtualization with Citrix XenDesktop, using NVIDIA GRID graphics board on VMware vsphere 6

Get the Best out of NVIDIA GPUs for 3D Design and Engineering in the Cloud

NVIDIA GRID VGPU Steve Harpster SA NALA October 2013

VDI Without Compromise with SimpliVity OmniStack and VMware Horizon View

Boost your VDI Confidence with Monitoring and Load Testing

HDX 3D Version 1.0 Release Notes

Proactive VDI Performance Monitoring

CLOUD GAMING WITH NVIDIA GRID TECHNOLOGIES Franck DIARD, Ph.D., SW Chief Software Architect GDC 2014

Monitoring Databases on VMware

CUSTOMER EXPERIENCES WITH GPU VIRTUALIZATION AND 3D REMOTING

Reviewer s Guide for Remote 3D Graphics Apps

Delivering 3D Graphics from the Private or Public Cloud

Optimizing service assurance for XenServer virtual infrastructures with Xangati

Intel Cloud Builders Guide to Cloud Design and Deployment on Intel Platforms

High Performance Computing in CST STUDIO SUITE

Cisco Desktop Virtualization with UCS: A Blueprint for Success

Implementation of a Collaborative Engineering Data Management System. Tony Metz Fermilab 04 May 2015

Lab Validations: Optimizing Storage for XenDesktop with XenServer IntelliCache Reducing IO to Reduce Storage Costs

Why ClearCube Technology for VDI?

S Hands on Tutorial: Deploying GRID in Citrix and VMware Virtual Desktop Environments

How To Test For Performance And Scalability On A Server With A Multi-Core Computer (For A Large Server)

Thin Client & Virtual Desktop White Paper

Virtual Desktop Infrastructure Optimization with SysTrack Monitoring Tools and Login VSI Testing Tools

Desktop Virtualization. The back-end

Computer Graphics Hardware An Overview

Citrix XenServer 7 Feature Matrix

Autodesk Revit 2016 Product Line System Requirements and Recommendations

VMware End User Computing

MED 0115 Optimizing Citrix Presentation Server with VMware ESX Server

Praktijkexamen met Project VRC. Virtual Reality Check

Cisco Unified Computing System, Citrix XenDesktop, and Atlantis ILIO

Diablo and VMware TM powering SQL Server TM in Virtual SAN TM. A Diablo Technologies Whitepaper. May 2015

Virtual Desktop Infrastructure (VDI) made Easy

Vintage IT Services MSRP Pricing PaaS Catalog Pricing Service Categories MSRP Pricing

Detailed Lab Report DR101115D. Citrix XenDesktop 4 vs. VMware View 4 using Citrix Branch Repeater and Riverbed Steelhead

System requirements for Autodesk Building Design Suite 2017

Performance Comparison of Fujitsu PRIMERGY and PRIMEPOWER Servers

Goliath Performance Monitor Prerequisites v11.6

GRID VGPU FOR CITRIX XENSERVER

Performance in the Infragistics WebDataGrid for Microsoft ASP.NET AJAX. Contents. Performance and User Experience... 2

GRID VGPU FOR VMWARE VSPHERE

DIR Contract Number DIR-SDD-2263 Appendix C Pricing Index (per Amendment 6)

Dragon Medical Enterprise Network Edition Technical Note: Requirements for DMENE Networks with virtual servers

Virtualization Technologies. Embrace the new world of healthcare

Atlantis HyperScale VDI Reference Architecture with Citrix XenDesktop

BLACKBOARD LEARN TM AND VIRTUALIZATION Anand Gopinath, Software Performance Engineer, Blackboard Inc. Nakisa Shafiee, Senior Software Performance

Figure 1: RotemNet Main Screen

System Requirements Table of contents

Vocera Voice 4.3 and 4.4 Server Sizing Matrix

GETTING THE PERFORMANCE YOU NEED WITH VDI AND BYOD

PDFDumps. PDFDumps can solve all your IT exam problems and broaden your knowledge

Technical Guide to Application Delivery Option(s) for XenApp and/or XenDesktop

System Requirements for Microsoft Dynamics GP 2015

simplify monitoring Environment Prerequisites for Installation Simplify Monitoring 11.4 (v11.4) Document Date: January

Transcription:

HIGH RESOLUTION GRAPHICS DESKTOP VIRTUALIZATION POC Chip Charnley Technical Expert Client Technologies Infrastructure Architecture

Ford Who am I? PoC Description PoC results details Summary Q & A 2 of 33

Ford Motor Company, a global automotive industry leader based in Dearborn, Mich., Manufactures or distributes automobiles across six continents. With about 186,000 employees and 65 plants worldwide, the company s automotive brands include Ford and Lincoln. The company provides financial services through Ford Motor Credit Company. For more information regarding Ford and its products worldwide, please visit www.corporate.ford.com. 3

Who am I? Chip Charnley Worked at Ford since 1989 Currently part of the Infrastructure Architecture group as the Client Technologies Technical Expert. First client technology was to build a Windows 3.1 IP based client using QEMM and FTP software in 1995. Spent a significant portion of the last 5 years focused on implementing Virtual Desktops at Ford. 4

Ford and HRG VDI Ford has been investigating High Resolution Graphics (HRG) Desktop Virtualization off and on for close to 8 years although the reasons have morphed over time. Investigation started as a way to improve software delivery/patch management especially when 1000 s of machines globally had to be updated in one weekend. Most recently the driving reasons have been protection of intellectual property when dealing with joint venture and supplier access to Ford data sources and global performance to centralized data centers (latency driven performance and data load timing). 5

Ford and HRG VDI Two prior Proofs of Concept (POC) ended in failure due to unacceptable infrastructure cost per user and unacceptable end user experience due to high latency to global user locations. The Siemens Team Center Visualization demo in the Synergy 2013 keynote convinced me that the technology was finally available to solve most of Ford s end-user experience/ performance issues at a price point that the business would find acceptable. So I sought out those who had been involved in prior efforts or had expressed recent interest to participate in a new PoC in 2014 6

2014 PoC Ford partnered with Citrix and Cisco PoC architecture was driven by already in place non-hrg VDI architecture @ Ford. PoC consisted of two phases Benchmark XenApp (XA) and XenDesktop (XD) on various infrastructure configurations Execute end-user testing on a target configuration to determine if the end-user experience was acceptable The rest of this presentation will primarily focus on the benchmark phase of the PoC 7

PoC team The core technical team was made up of myself and Engineers from Citrix and Cisco. Cisco also provided project management support architecture design support Ford also provided network design/services to connect the PoC environment to the Ford production network project management support SME s to test end-user experience w/ selected apps 8

PoC Hardware BoM Design based on existing VDI implementation @ Ford Replaced blades with 4x C-240M3s Servers (2x E5-2680v2 2.8GHz CPUs) to support nvidia GRID cards One used for virtualized management servers 3 used for various XA and XD configurations Added nvidia GRID GPU cards 4 NVIDIA GRID K2 2 NVIDIA GRID K1 Cisco rack network w/ Nexus switches and Cisco firewalls to connect to the Ford Intranet 9

PoC Infrastructure Software BoM Design based on existing VDI implementation @ Ford but most software versions were updated to adequately support the nvidia GRID cards Utilized XenServer 6.2 SP1 (Hotfixes XS62ESP1003, XS62ESP1005,XS62ESP1008) vs. ESX XenDesktop 7.5 XenApp 7.5 Windows Server 2012R2 Cisco UCS 2.2.2c 10

Why Benchmark? No good industry data on optimal XA/XD hardware configuration w/ nvidia GRID required to meet Ford requirements. Determine for XA Which is better, physical or virtual servers what combination of RAM, CPU, & GPU (K1/K2) is likely to provide the best user density and performance mix Determine for XD what combination of RAM, CPU, GPU (K1/K2) and vgpu profile is likely to provide the best user density and performance mix 11

Benchmark Methodology Utilized RedWay s RedTurbine demo program to generate load. RedTurbine was chosen because: Reasonable facsimile of a CAD workload Ability to set to run continuously Heavy load = continuous run Medium load = Random start, random run-time, random restart intended to more accurately simulate real usage Utilized 4 laptops to open end-user sessions 1 dedicated to a single user experience 3 used to generate as many sessions as required to meet a given benchmark target. 12

Benchmark Methodology (cont.) Key performance data points collected A qualitative (subjective human) assessment of the visual performance (Excellent, Good, Degraded, Poor). Time required for a full cycle of the RedTurbine demo. (manually timed with a stopwatch) Physical CPU utilization % Physical GPU utilization % vcpu utilization % Above recorded for 1, 8/10, 16, 30 & 64 users Human SMEs from 3 target applications provided feedback on end-user experience. 13

XenApp Benchmark results Results were surprising: A physical server implementation with uncontrolled allocation of multiple GPUs to the processes significantly outperformed multiple VM instances with single dedicated GPU per instance. Benchmarking shows 30 users per server as acceptable and 64 users as unacceptable Optimum user density will always depend on the application configuration and usage patterns The ability of this configuration to send commands to the GPU appears to be the bottleneck as GPU utilization never exceeded 77% regardless of user density. 14

Execution Time (lower is better) XenApp Data Charts XA RedTurbine Benchmark 5:45 5:16 4:48 Except as noted below and the one user scenario, the GPUs were generally utilized evenly across all GPUs. In all scenarios except the one user scenario, the GPU utilization never went above 77%! 4:19 3:50 3:21 The Physical 2xK2 Heavy @ 30 users went south due to something new - For the first and only time we saw CPU queuing and lopsided utilization of the GPUs. There were also indications that NUMA transitions were part of the problem. 2:52 1 10 30 64 # of users XenApp Physical 1xK1 Heavy XenApp Physical 1xK2 Heavy XenApp Physical 2xK2 Heavy XenApp Physical 2xK2 Medium 15 Markers are color coded to Qualitative assessment of video display Green = Excellent Yellow = Good Red = Degraded Purple = Poor

XenDesktop Benchmark results Initial tests quickly showed that K1 cards were not going to provide the level of user experience desired at the CAD/CAE level Given the server CPU and RAM available, the K220Q vgpu profile with dual nvidia GRID K2 cards provided the best utilization of all resources (CPU, RAM and GPU) Acceptable performance across 30 users at moderate usage Max user density of 32 users for K220Q could not be tested due to RAM limitations Testing of other vgpu profiles at max user density suggests that increasing RAM to get to 32 users for K220Q might result in lower end-user experience (due to slower memory bus speed) and significant additional cost with a lot of RAM left unused. SME end-user experience performance perception showed differences between the RedTurbine results and the testing of the 3 target applications 16

Execution Time XenDesktop Data Charts XD RedTurbine Benchmark 3:50 3:43 3:36 3:28 3:21 3:14 3:07 3:00 1 8 16 30 # of users K140Q Heavy Load 1xK1 K220Q Heavy Load 1xK2 K240Q Heavy Load 2xK2 K220Q Random Load 2xK2 17

Server GPU Utilization XenDesktop Data Charts Server CPU Utilization XD RedTurbine Benchmark XD RedTurbine Benchmark 100% 100% 90% 90% 80% 80% 70% 70% 60% 60% 50% 50% 40% 40% 30% 30% 20% 20% 10% 10% 0% 1 8 16 30 0% 1 8 16 30 # of users # of users K140Q Heavy Load 1xK1 K220Q Heavy Load 1xK2 K240Q Heavy Load 2xK2 K220Q Random Load 2xK2 K140Q Heavy Load 1xK1 K240Q Heavy Load 2xK2 K220Q Heavy Load 1xK2 K220Q Random Load 2xK2 Markers are color coded to Qualitative assessment of video display Green = Excellent Yellow = Good Red = Degraded Purple = Poor Shows that the hardware is optimized with a K220Q vgpu profile and 30 users. 18

XD Latency Testing CAD App1 CAE App1 CAD App2 225ms 250ms 275ms 300ms 325 ms 350ms 375ms Green is acceptable performance Yellow is potentially acceptable performance Red is unacceptable performance This information is not from the RedTurbine benchmark but from actual SME (human) visual assessment of end-user experience. This is interesting because the industry information that could be found in early 2014 suggested to the PoC team that the top end acceptable latency would be in the 200ms-250ms range. 19

Summary Both XD and XA appear capable of providing acceptable performance while increasing end user productivity by as much as an order of magnitude (reduced data load times). Both appear to have user density (per user cost) that will be justifiable for the targeted Ford business cases. While the benchmark points to a K220Q profile as being the most cost effective, other testing clearly indicates higher capability/cost user profiles will be required to support some use cases. The impact of each application and the end user usage pattern on user density cannot be over stated. 20

Q&A Chip ccharnle@ford.com Charnley 21