Performance Tuning EC2 Instances Brendan Gregg, Performance Engineering, Netflix
|
|
|
- Theodore Maxwell
- 10 years ago
- Views:
Transcription
1 PFC306 Performance Tuning EC2 Instances Brendan Gregg, Performance Engineering, Netflix November 12, 2014 Las Vegas, NV 2014 Amazon.com, Inc. and its affiliates. All rights reserved. May not be copied, modified, or distributed in whole or in part without the express consent of Amazon.com, Inc.
2 Brendan Gregg Senior Performance Architect, Netflix Linux and FreeBSD performance On the Performance Engineering Team, led by Coburn Watson (and we re hiring!) Recent work: Linux perf-tools, using ftrace & perf_events Previous work includes: USE Method, flame graphs, heat maps, DTrace tools
3 Massive Amazon EC2 Linux cloud Tens of thousands of server instances Auto scale by ~3k each day CentOS and Ubuntu FreeBSD for content delivery Approx. 33% of US Internet traffic at night Performance is critical Customer satisfaction: now over 50M subscribers
4 Netflix Performance Evaluate technologyteam Engineering Recommendations & best practices Instance kernel tuning, assist app tuning Develop performance tools Instance types, Amazon EC2 options Develop tools for observability and analysis Project support New database, programming language, software change
5 Agenda 1. Instance Selection 2. Amazon EC2 Features 3. Kernel Tuning 4. Observability
6 Performance Tuning on In the Netflix cloud, Amazon EC2everything is a tunable Including instance type Performance wins have immediate benefits Great place to do performance engineering!
7 WARNINGS This is what s in our medicine cabinet Consider these best before: 2015 Take only if prescribed by a performance engineer
8 1. Instance Selection
9 The Netflix Cloud Many different application workloads: compute, storage, caching EC2 ELB Cassandra Applications (Services) S3 Elasticsearch EVCache SES SQS
10 Current Generation Instance i2: Storage-optimized Families r3: Memory optimized Lowest cost/gbyte c3: Compute-optimized SSD large capacity storage Latest CPUs, lowest price/compute perf m3: General purpose Balanced i2.8xlarge
11 Instance Sizes Ranges from medium to 8xlarge, depending on type Netflix has over 30 instance types in use Traditional: Tune the workload to match the server Cloud: Find an ideal workload and instance type combination Instead of: given A, optimize B; this is optimize A+B
12 Netflix Instance Type Flow Chart Selection By-Resource Brute Force
13 Instance Selection Flow Chart Start Exceptions Y N i2 Can cache? N Disk I/O bound? Y Select memory to cache working set N Find best balance Memory Y Need large disk capacity? Trade-off r3 l 8x m3 l c3 CPU
14 Netflix AWS Environment Elastic Load Balancing allows instance types to be tested with real load 1. Single instance canary, then, 2. Auto Scaling Group Much better than micro-benchmarking alone, which is extremely error prone ASG Cluster prod1 ELB Canary ASG-v010 ASG-v011 Instance Instance Instance Instance Instance Instance
15 By-Resource Approach 1. Determine bounding resource Eg: CPU, disk I/O, or network I/O Found using: Estimation (expertise) Resource observability with an existing real workload 2. Resource observability with a benchmark or load test (experimentation) Choose instance type for the bounding resource
16 Nomogram Visualization Tool
17 Nomogram Visualization Tool Select instance families From any desired resource, see types & cost Select resources
18 Nomogram Visualization Tool eg, 8 vcpu:
19 By-Resource Approach, cont. This focuses on optimizing a given workload More efficiency can be found by adjusting the workload to suit different instance types
20 Brute Force Choice 1. Run load test on ALL instance types 2. Optionally different workload configurations as well Measure throughput And check for acceptable latency 3. Calculate price/performance for all types 4. Choose most efficient type
21 Latency Requirements Check for an acceptable latency distribution when optimizing for price/performance Average Latency (Response Time) Headroom Acceptable measure price/performance Throughput Unacceptable
22 Netflix Instance Type ReVariance selection Usage 3. Cost
23 1. Instance Variance An instance type may be resource constrained only occasionally, or after warmup, or a code change Continually monitor performance, analyze variance/outliers
24 2. Instance Usage Older instance type usage can be quickly identified using internal tools, reevaluated, and upgraded to newer types
25 3. Instance Cost Also checked regularly. Tuning the price in price/perf.
26 3. Instance Cost Also checked regularly. Tuning the price in price/perf. Cost per hour Services
27 2. EC2 Features
28 EC2 Features A) Xen Modes: HVM, PV B) SR-IOV
29 Xen Modes The best performance is the latest PV/HVM hybrid mode that Xen supports On Amazon EC2: HVM == PVHVM if your Linux kernel version supports it PV == PV Current fastest on Amazon EC2, in general: HVM (PVHVM) Future fastest, in general: should be PVH WARNINGS: Your mileage may vary, and beware of outdated info.
30 Xen Modes
31 SR-IOV Single Root I/O Virtualization (SR-IOV) PCIe device provides virtualized instances AWS Enhanced Networking Available for some instance types: C3, R3, I2 VPC only Example improvements: Improved network throughput Reduced average network RTT, and reduced jitter
32 3. Kernel Tuning
33 Kernel Tuning Typically 5-25% wins, for average performance Adds up to significant savings for the Netflix cloud Bigger wins when reducing latency outliers Deploying tuning: Generic performance tuning is baked into our base AMI Experimental tuning is a package add-on (nflx-kernel-tunables) Workload specific tuning is configured in application AMIs Remember to tune the workload with the tunables
34 OS Distributions Netflix uses both CentOS and Ubuntu Operating system distributions can override tuning defaults Different kernel versions can provide different defaults, and additional tunables Tunable parameters include sysctl and /sys The following tunables provide a look into our current medicine cabinet. Your mileage may vary. Only take if prescribed by a performance engineer.
35 Tuning Targets 1. CPU Scheduler 2. Virtual Memory 3. Huge Pages 4. File System 5. Storage I/O 6. Networking 7. Hypervisor (Xen)
36 1. CPU Scheduler Tunables: Scheduler class, priorities, migration latency, tasksets Usage: Some Java apps have benefited from SCHED_BATCH, to reduce schedtool B switching. PID context E.g.: # Some apps benefit from reducing migrations using taskset(1), numactl(8), cgroups, and tuning sched_migration_cost_ns
37 2. Virtual Memory Tunables: swappiness, overcommit, OOM behavior Usage: Swappiness is set to zero to disable swapping and favor ditching the file system page cache first to free memory. (This tunable doesn t make much difference, as swap devices are usually absent.)= 0 vm.swappiness # from 60
38 3. Huge Pages Page size of 2 or 4 Mbytes, instead of 4k, should reduce various CPU overheads and improve MMU page translation cache reach. Tunables: Explicit huge page usage, transparent huge pages (THPs) Usage: THPs (enabled in later kernels), depending on the workload & CPUs, sometimes improve perf (~5% lower CPU), but sometimes hurt perf (~25% during %usr, and more during %sys refrag). To# disable: echohigher never CPU > /sys/kernel/mm/transparent_hugepage/enabled from madvise #
39 4. File System Tunables: page cache flushing behavior, file system type and its own tunables (e.g., ZFS on Linux) Usage: Page cache flushing is tuned to provide a more even behavior: background flush earlier, aggressive flush later vm.dirty_ratio Access timestamps = 80 # other from 40 disabled, and options vm.dirty_background_ratio = 5 # from 10 FS vm.dirty_expire_centisecs = # from 3000 mount -o defaults,noatime,discard,nobarrier depending on the
40 5. Storage I/O Tunables: Read ahead size, number of in-flight requests, I/O scheduler, volume stripe width Usage: Some workloads, e.g., Cassandra, can be sensitive to read ahead size SSDs can perform better with the noop scheduler (if not default already) /sys/block/*/queue/rq_affinity 2 Tuning md chunk size and stripe width /sys/block/*/queue/scheduler noop /sys/block/*/queue/nr_requests 256 /sys/block/*/queue/read_ahead_kb 256 mdadm chunk=64... to match workload
41 6. Networking Tunables: TCP buffer sizes, TCP backlog, device backlog, TCP reuse, Usage: net.core.somaxconn = 1000 net.core.netdev_max_backlog = 5000 net.core.rmem_max = net.core.wmem_max = net.ipv4.tcp_wmem = net.ipv4.tcp_rmem = net.ipv4.tcp_max_syn_backlog = 8096 net.ipv4.tcp_slow_start_after_idle = 0 net.ipv4.tcp_tw_reuse = 1 net.ipv4.ip_local_port_range = net.ipv4.tcp_abort_on_overflow = 1 # maybe
42 7. Hypervisor (Xen) Tunables: PV/HVM (baked into AMI) Kernel clocksource. From slow to fast: hpet, xen, tsc Usage: echo We ve encountered a Xen clocksource regression moving to Ubuntu Trusty; fixed by tuning clocksource to TSC (although beware of clock drift). Best case example (so far): CPU usage tsc > /sys/devices/system/clocksource/clocksource0/current_clocksource reduced by 30%, and average app latency reduced by 43%.
43 4. Observability
44 Observability Finding, quantifying, and confirming tunables Discovering system wins (5-25% s) and application wins (2-10x s)
45 Inefficient Tuning Whack-A-Mole Anti-Method: Tune things at random until the problem goes away Street Light Anti-Method: 1. Pick observability tools that are Familiar Found on the Internet At random 2. Run tools
46 Efficient Tuning Based on observability (data-driven): 1. Observe workload and resource usage 2. Analyze results: 3. Identify software and hardware components in use Study in-use components for tunables Quantify expected speed-up Tune, including experimental, the highest value targets Example methodology: the USE Method
47 USE Method For every hardware and software resource, check: 1. Utilization 2. Saturation 3. Errors Saturation Errors X Resource Utilization (%) Resource constraints show as saturation or high utilization. Resize or change instance type. Investigate tunables for the resource
48 More Inefficient Tuning Blame-Someone-Else Anti-Method 1. Find a system or environment component you are not responsible for, or cannot observe 2. Hypothesize that the issue is with that component 3. Redirect the issue to the responsible team 4. When proven wrong, go to 1 Eg, blaming the instance
49 Exonerating the Instance Observability often exonerates the instance Complex perf issue with no obvious cause: an instance issue? After analysis, sometimes: yes; usually, it s the app (80/20 rule) The 80/20 Rule: 80%: Improvement gained by application refactoring and tuning 20%: OS tuning, instance or infrastructure improvement, etc. There is also the 20/80 Latency Outlier Rule: 20%: Latency outliers are caused by application code
50 Or Finding the Monkey The Netflix Latency Monkey Injects latency in odd places Part of the Simian Army: a test suite for our fault-tolerant architecture
51 Analysis Perspectives Load Workload Analysis: Workload Analysis A.k.a. Top Down Analysis Application Begin with application workload, then break down request time. System Libraries Resource Analysis: System Calls A.k.a. Bottom Up Analysis Kernel Begin with resource performance and then their workloads. Devices E.g., the USE Method, followed by Resource Analysis
52 Instance Observability Tools A. Linux performance tools B. Netflix custom tools
53 A. Linux Performance Tools 1. Statistical tools 2. Profiling tools 3. Tracing tools 4. Hardware counters
54 1. Statistical Tools vmstat, pidstat, sar, etc., used mostly normally $ sar -n TCP,ETCP,DEV 1 Linux (test-e4f1a80b) rxpck/s /18/ :10:43 PM 09:10:44 PM 09:10:44 PM IFACE lo eth0 txpck/s :10:43 PM 09:10:44 PM active/s passive/s :10:43 PM 09:10:44 PM [ ] atmptf/s 0.00 _x86_64_ (8 CPU) rxkb/s txkb/s rxcmp/s txcmp/s iseg/s oseg/s estres/s retrans/s isegerr/s orsts/s 1.00 rxmcst/s
55 2. Profiling Tools Profiling: characterizing usage. E.g.: Sampling on-cpu stack traces to explain CPU usage Frequency counting object types to explain memory usage Profiling leads to tuning Hot code path -> any related tunables/config? Frequent object allocation -> any way to avoid this?
56 Profiling Types Application profiling Depends on application and language E.g., Java Flight Recorder, Yourkit, Lightweight Java Profiler Many tools are inaccurate or broken; test, verify, cross-check System profiling Linux perf_events (the perf command) ftrace can do kernel function counts SystemTap has various profiling capabilities
57 Application Profiling: LJP The Lightweight Java Profiler (LJP) Basic, open source, free, asynchronous CPU profiler Uses an agent that dumps hprof-like output The profiling output can be visualized as flame graphs
58
59 Stack frame Ancestry Mouse-over frames to quantify
60 System Profiling: perf_events perf_events sampling CPU stack traces can show: All application logic # # # # Whether high level logic can be seen depends on the app and VM JVM internals & libraries The Linux kernel git clone cd FlameGraph perf record -F 99 -ag -- sleep 60 perf script./stackcollapse-perf.pl./flamegraph.pl > perf.svg perf CPU flame graphs:
61
62 Kernel TCP/IP Broken Java stacks (missing frame pointer) GC Locks epoll Time Idle thread
63 3. Tracing Tools Many system tracers exist for Linux 1. ftrace 2. perf_events 3. ebpf 4. SystemTap 5. ktap 6. LTTng 7. dtrace4linux 8. Oracle Linux DTrace
64 ftrace Part of the Linux kernel first added in (2008), and enhanced in later releases Use directly via /sys/kernel/debug/tracing Already available in all Netflix Linux instances Front-end tools aid usage: perf-tools collection Unsupported hacks: see WARNINGs Also see the trace-cmd front-end, as well as perf
65 ftrace tool: iosnoop #./iosnoop ts Tracing block I/O. Ctrl-C to end. STARTs ENDs COMM supervise supervise supervise supervise [ ] PID TYPE W W W W DEV 202,1 202,1 202,1 202,1 BLOCK BYTES LATms #./iosnoop h USAGE: iosnoop [-hqst] [-d device] [-i iotype] [-p PID] [-n name] [duration] -d device # device string (eg, "202,1) -i iotype # match type (eg, '*R*' for all reads) -n name # process name to match on I/O issue -p PID # PID to match on I/O issue -Q # include queueing time in LATms -s # include start time of I/O (s) -t # include completion time of I/O (s) [ ]
66 perf_events Part of the Linux source Added from linux-tools-common, etc. Powerful multi-tool and profiler interval sampling, CPU performance counter events user and kernel dynamic tracing kernel line tracing and local variables (debuginfo) kernel filtering, and in-kernel counts (perf stat) Needs kernel debuginfo for advanced uses
67 perf_events Example # perf record e skb:consume_skb ag -- sleep 10 # perf report [...] 74.42% swapper [kernel.kallsyms] [k] consume_skb --- consume_skb arp_process arp_rcv Summarizing stack traces for a netif_receive_skb_core tracepoint netif_receive_skb netif_receive_skb virtnet_poll perf_events can do many things, net_rx_action it is hard to pick just one example do_softirq irq_exit do_irq ret_from_intr [ ]
68 4. Hardware Counters Model Specific Registers (MSRs) Basic details: timestamp clock, temperature, power Some are available in Amazon EC2 Performance Monitoring Counters (PMCs) Advanced details: cycles, stall cycles, cache misses Not available in Amazon EC2 (by default) Root cause CPU usage at the cycle level E.g., higher CPU usage due to more memory stall cycles
69 MSRs Can be used to verify real CPU clock rate Can vary with turboboost. Important to know for perf comparisons. Tool from ec2-guest#./showboost CPU MHz : 2500 Turbo MHz : 2900 (10 active) Turbo Ratio : 116% (10 active) CPU 0 summary every 5 seconds... TIME 06:11:35 06:11:40 06:11:45 [...] C0_MCYC C0_ACYC Real CPU MHz UTIL 51% 50% 49% RATIO 116% 115% 115% MHz
70 B. Netflix Online Tools Netflix develops many online web-based tools for cloud-wide performance observability Per-instance capable tools include: Atlas Vector
71 Atlas Cloud-wide and instance monitoring. Resource and app metrics, trends.
72 Atlas Cloud-wide and instance monitoring. Resource and app metrics, trends. Region Breakdowns App Interactive Graph Metrics Options Summary Statistics
73 Atlas All metrics in one system System metrics: Application metrics: CPU usage, disk I/O, memory latency percentiles, errors Filters or breakdowns by region, application, ASG, metric, instance Quickly narrow an investigation to an instance or resource type
74 Vector Real-time per-second instance metrics. On-demand profiling.
75 Vector Real-time per-second instance metrics. On-demand profiling. Utilization Per device Breakdowns Saturation Errors
76 Vector Given an instance, analyze low-level performance On-demand: CPU flame graphs, heat maps, ftrace metrics, and SystemTap metrics Quick: GUI-driven root cause analysis Scalable: other teams can use it easily Currently in development (beta-use)
77 Summary 1. Instance Selection 2. Amazon EC2 Features 3. Kernel Tuning 4. Observability
78 References & Links Amazon EC2: Netflix on EC2: Xen Modes: More Performance Analysis:
79 Thanks Netflix Performance Engineering Team Coburn Watson Scott Emmons: nomogram visualization, insttypes Martin Spier: Vector Amer Ather: tracing, Vector Vadim Filanovsky: AWS cost reporting Netflix Insight Engineering Team Roy Rapoport, etc: Atlas Amazon Web Services Bryan Nairn, Callum Hughes
80 Netflix talks at re:invent Talk Time Title PFC-305 Wednesday, 1:15pm Embracing Failure: Fault Injection and Service Reliability BDT-403 Wednesday, 2:15pm Next Generation Big Data Platform at Netflix PFC-306 Wednesday, 3:30pm Performance Tuning EC2 DEV-309 Wednesday, 3:30pm From Asgard to Zuul, How Netflix s proven Open Source Tools can accelerate and scale your services ARC-317 Wednesday, 4:30pm Maintaining a Resilient Front-Door at Massive Scale PFC-304 Wednesday, 4:30pm Effective Inter-process Communications in the Cloud: The Pros and Cons of Micro Services Architectures ENT-209 Wednesday, 4:30pm Cloud Migration, Dev-Ops and Distributed Systems APP-310 Friday, 9:00am Scheduling using Apache Mesos in the Cloud
81 Please give us your feedback on this presentation PFC306 Join the conversation on Twitter with #reinvent 2014 Amazon.com, Inc. and its affiliates. All rights reserved. May not be copied, modified, or distributed in whole or in part without the express consent of Amazon.com, Inc.
From Clouds to Roots. Brendan Gregg. Senior Performance Architect Performance Engineering Team. September, 2014. bgregg@ne5lix.
From Clouds to Roots Brendan Gregg Senior Performance Architect Performance Engineering Team [email protected], @brendangregg September, 2014 Root Cause Analysis at Ne5lix Devices Zuul Load Instances (Linux)
Stop the Guessing. Performance Methodologies for Production Systems. Brendan Gregg. Lead Performance Engineer, Joyent. Wednesday, June 19, 13
Stop the Guessing Performance Methodologies for Production Systems Brendan Gregg Lead Performance Engineer, Joyent Audience This is for developers, support, DBAs, sysadmins When perf isn t your day job,
Delivering Quality in Software Performance and Scalability Testing
Delivering Quality in Software Performance and Scalability Testing Abstract Khun Ban, Robert Scott, Kingsum Chow, and Huijun Yan Software and Services Group, Intel Corporation {khun.ban, robert.l.scott,
Best Practices for Monitoring Databases on VMware. Dean Richards Senior DBA, Confio Software
Best Practices for Monitoring Databases on VMware Dean Richards Senior DBA, Confio Software 1 Who Am I? 20+ Years in Oracle & SQL Server DBA and Developer Worked for Oracle Consulting Specialize in Performance
Migration Scenario: Migrating Batch Processes to the AWS Cloud
Migration Scenario: Migrating Batch Processes to the AWS Cloud Produce Ingest Process Store Manage Distribute Asset Creation Data Ingestor Metadata Ingestor (Manual) Transcoder Encoder Asset Store Catalog
Amazon EC2 Product Details Page 1 of 5
Amazon EC2 Product Details Page 1 of 5 Amazon EC2 Functionality Amazon EC2 presents a true virtual computing environment, allowing you to use web service interfaces to launch instances with a variety of
Cloud Operating Systems for Servers
Cloud Operating Systems for Servers Mike Day Distinguished Engineer, Virtualization and Linux August 20, 2014 [email protected] 1 What Makes a Good Cloud Operating System?! Consumes Few Resources! Fast
CON9577 Performance Optimizations for Cloud Infrastructure as a Service
CON9577 Performance Optimizations for Cloud Infrastructure as a Service John Falkenthal, Software Development Sr. Director - Oracle VM SPARC and VirtualBox Jeff Savit, Senior Principal Technical Product
DISTRIBUTED SYSTEMS [COMP9243] Lecture 9a: Cloud Computing WHAT IS CLOUD COMPUTING? 2
DISTRIBUTED SYSTEMS [COMP9243] Lecture 9a: Cloud Computing Slide 1 Slide 3 A style of computing in which dynamically scalable and often virtualized resources are provided as a service over the Internet.
Perf Tool: Performance Analysis Tool for Linux
/ Notes on Linux perf tool Intended audience: Those who would like to learn more about Linux perf performance analysis and profiling tool. Used: CPE 631 Advanced Computer Systems and Architectures CPE
These sub-systems are all highly dependent on each other. Any one of them with high utilization can easily cause problems in the other.
Abstract: The purpose of this document is to describe how to monitor Linux operating systems for performance. This paper examines how to interpret common Linux performance tool output. After collecting
Preparing Your IT for the Holidays. A quick start guide to take your e-commerce to the Cloud
Preparing Your IT for the Holidays A quick start guide to take your e-commerce to the Cloud September 2011 Preparing your IT for the Holidays: Contents Introduction E-Commerce Landscape...2 Introduction
KVM PERFORMANCE IMPROVEMENTS AND OPTIMIZATIONS. Mark Wagner Principal SW Engineer, Red Hat August 14, 2011
KVM PERFORMANCE IMPROVEMENTS AND OPTIMIZATIONS Mark Wagner Principal SW Engineer, Red Hat August 14, 2011 1 Overview Discuss a range of topics about KVM performance How to improve out of the box experience
Performance Management for Cloudbased STC 2012
Performance Management for Cloudbased Applications STC 2012 1 Agenda Context Problem Statement Cloud Architecture Need for Performance in Cloud Performance Challenges in Cloud Generic IaaS / PaaS / SaaS
Full and Para Virtualization
Full and Para Virtualization Dr. Sanjay P. Ahuja, Ph.D. 2010-14 FIS Distinguished Professor of Computer Science School of Computing, UNF x86 Hardware Virtualization The x86 architecture offers four levels
Mark Bennett. Search and the Virtual Machine
Mark Bennett Search and the Virtual Machine Agenda Intro / Business Drivers What to do with Search + Virtual What Makes Search Fast (or Slow!) Virtual Platforms Test Results Trends / Wrap Up / Q & A Business
JVM Performance Study Comparing Oracle HotSpot and Azul Zing Using Apache Cassandra
JVM Performance Study Comparing Oracle HotSpot and Azul Zing Using Apache Cassandra January 2014 Legal Notices Apache Cassandra, Spark and Solr and their respective logos are trademarks or registered trademarks
Technical Paper. Moving SAS Applications from a Physical to a Virtual VMware Environment
Technical Paper Moving SAS Applications from a Physical to a Virtual VMware Environment Release Information Content Version: April 2015. Trademarks and Patents SAS Institute Inc., SAS Campus Drive, Cary,
PERFORMANCE TUNING ORACLE RAC ON LINUX
PERFORMANCE TUNING ORACLE RAC ON LINUX By: Edward Whalen Performance Tuning Corporation INTRODUCTION Performance tuning is an integral part of the maintenance and administration of the Oracle database
Chapter 14 Virtual Machines
Operating Systems: Internals and Design Principles Chapter 14 Virtual Machines Eighth Edition By William Stallings Virtual Machines (VM) Virtualization technology enables a single PC or server to simultaneously
Practical Performance Understanding the Performance of Your Application
Neil Masson IBM Java Service Technical Lead 25 th September 2012 Practical Performance Understanding the Performance of Your Application 1 WebSphere User Group: Practical Performance Understand the Performance
Mission-Critical Java. An Oracle White Paper Updated October 2008
Mission-Critical Java An Oracle White Paper Updated October 2008 Mission-Critical Java The Oracle JRockit family of products is a comprehensive portfolio of Java runtime solutions that leverages the base
Nested Virtualization
Nested Virtualization Dongxiao Xu, Xiantao Zhang, Yang Zhang May 9, 2013 Agenda Nested Virtualization Overview Dive into Nested Virtualization Details Nested CPU Virtualization Nested MMU Virtualization
IMCM: A Flexible Fine-Grained Adaptive Framework for Parallel Mobile Hybrid Cloud Applications
Open System Laboratory of University of Illinois at Urbana Champaign presents: Outline: IMCM: A Flexible Fine-Grained Adaptive Framework for Parallel Mobile Hybrid Cloud Applications A Fine-Grained Adaptive
Distributed Scheduling with Apache Mesos in the Cloud. PhillyETE - April, 2015 Diptanu Gon Choudhury @diptanu
Distributed Scheduling with Apache Mesos in the Cloud PhillyETE - April, 2015 Diptanu Gon Choudhury @diptanu Who am I? Distributed Systems/Infrastructure Engineer in the Platform Engineering Group Design
Deep Dive: Maximizing EC2 & EBS Performance
Deep Dive: Maximizing EC2 & EBS Performance Tom Maddox, Solutions Architect 2015, Amazon Web Services, Inc. or its affiliates. All rights reserved What we ll cover Amazon EBS overview Volumes Snapshots
Monitoring Databases on VMware
Monitoring Databases on VMware Ensure Optimum Performance with the Correct Metrics By Dean Richards, Manager, Sales Engineering Confio Software 4772 Walnut Street, Suite 100 Boulder, CO 80301 www.confio.com
THE DEFINITIVE GUIDE FOR AWS CLOUD EC2 FAMILIES
THE DEFINITIVE GUIDE FOR AWS CLOUD EC2 FAMILIES Introduction Amazon Web Services (AWS), which was officially launched in 2006, offers you varying cloud services that are not only cost effective, but also
DataStax Enterprise, powered by Apache Cassandra (TM)
PerfAccel (TM) Performance Benchmark on Amazon: DataStax Enterprise, powered by Apache Cassandra (TM) Disclaimer: All of the documentation provided in this document, is copyright Datagres Technologies
FlashSoft Software from SanDisk : Accelerating Virtual Infrastructures
Technology Insight Paper FlashSoft Software from SanDisk : Accelerating Virtual Infrastructures By Leah Schoeb January 16, 2013 FlashSoft Software from SanDisk: Accelerating Virtual Infrastructures 1 FlashSoft
Deploying Splunk on Amazon Web Services
Copyright 2014 Splunk Inc. Deploying Splunk on Amazon Web Services Simeon Yep Senior Manager, Business Development Technical Services Roy Arsan Senior SoHware Engineer Disclaimer During the course of this
Web Application Deployment in the Cloud Using Amazon Web Services From Infancy to Maturity
P3 InfoTech Solutions Pvt. Ltd http://www.p3infotech.in July 2013 Created by P3 InfoTech Solutions Pvt. Ltd., http://p3infotech.in 1 Web Application Deployment in the Cloud Using Amazon Web Services From
Using WebSphere Application Server on Amazon EC2. Speaker(s): Ed McCabe, Arthur Meloy
Using WebSphere Application Server on Amazon EC2 Speaker(s): Ed McCabe, Arthur Meloy Cloud Computing for Developers Hosted by IBM and Amazon Web Services October 1, 2009 1 Agenda WebSphere Application
Large-scale performance monitoring framework for cloud monitoring. Live Trace Reading and Processing
Large-scale performance monitoring framework for cloud monitoring Live Trace Reading and Processing Julien Desfossez Michel Dagenais May 2014 École Polytechnique de Montreal Live Trace Reading Read the
Virtualization Technologies ORACLE TECHNICAL WHITE PAPER OCTOBER 2015
Virtualization Technologies ORACLE TECHNICAL WHITE PAPER OCTOBER 2015 Table of Contents Introduction 3 Designing a Consolidated Infrastructure 6 Seven Areas of Consideration for Consolidation 6 Security
Benchmarking Hadoop & HBase on Violin
Technical White Paper Report Technical Report Benchmarking Hadoop & HBase on Violin Harnessing Big Data Analytics at the Speed of Memory Version 1.0 Abstract The purpose of benchmarking is to show advantages
Cloud Computing. Adam Barker
Cloud Computing Adam Barker 1 Overview Introduction to Cloud computing Enabling technologies Different types of cloud: IaaS, PaaS and SaaS Cloud terminology Interacting with a cloud: management consoles
11.1 inspectit. 11.1. inspectit
11.1. inspectit Figure 11.1. Overview on the inspectit components [Siegl and Bouillet 2011] 11.1 inspectit The inspectit monitoring tool (website: http://www.inspectit.eu/) has been developed by NovaTec.
GraySort on Apache Spark by Databricks
GraySort on Apache Spark by Databricks Reynold Xin, Parviz Deyhim, Ali Ghodsi, Xiangrui Meng, Matei Zaharia Databricks Inc. Apache Spark Sorting in Spark Overview Sorting Within a Partition Range Partitioner
On- Prem MongoDB- as- a- Service Powered by the CumuLogic DBaaS Platform
On- Prem MongoDB- as- a- Service Powered by the CumuLogic DBaaS Platform Page 1 of 16 Table of Contents Table of Contents... 2 Introduction... 3 NoSQL Databases... 3 CumuLogic NoSQL Database Service...
Cisco Application-Centric Infrastructure (ACI) and Linux Containers
White Paper Cisco Application-Centric Infrastructure (ACI) and Linux Containers What You Will Learn Linux containers are quickly gaining traction as a new way of building, deploying, and managing applications
Relocating Windows Server 2003 Workloads
Relocating Windows Server 2003 Workloads An Opportunity to Optimize From Complex Change to an Opportunity to Optimize There is much you need to know before you upgrade to a new server platform, and time
Evaluation Methodology of Converged Cloud Environments
Krzysztof Zieliński Marcin Jarząb Sławomir Zieliński Karol Grzegorczyk Maciej Malawski Mariusz Zyśk Evaluation Methodology of Converged Cloud Environments Cloud Computing Cloud Computing enables convenient,
Understanding Data Locality in VMware Virtual SAN
Understanding Data Locality in VMware Virtual SAN July 2014 Edition T E C H N I C A L M A R K E T I N G D O C U M E N T A T I O N Table of Contents Introduction... 2 Virtual SAN Design Goals... 3 Data
The Virtualization Practice
The Virtualization Practice White Paper: Managing Applications in Docker Containers Bernd Harzog Analyst Virtualization and Cloud Performance Management October 2014 Abstract Docker has captured the attention
Achieving Nanosecond Latency Between Applications with IPC Shared Memory Messaging
Achieving Nanosecond Latency Between Applications with IPC Shared Memory Messaging In some markets and scenarios where competitive advantage is all about speed, speed is measured in micro- and even nano-seconds.
Small is Better: Avoiding Latency Traps in Virtualized DataCenters
Small is Better: Avoiding Latency Traps in Virtualized DataCenters SOCC 2013 Yunjing Xu, Michael Bailey, Brian Noble, Farnam Jahanian University of Michigan 1 Outline Introduction Related Work Source of
2) Xen Hypervisor 3) UEC
5. Implementation Implementation of the trust model requires first preparing a test bed. It is a cloud computing environment that is required as the first step towards the implementation. Various tools
Java Performance. Adrian Dozsa TM-JUG 18.09.2014
Java Performance Adrian Dozsa TM-JUG 18.09.2014 Agenda Requirements Performance Testing Micro-benchmarks Concurrency GC Tools Why is performance important? We hate slow web pages/apps We hate timeouts
DIABLO TECHNOLOGIES MEMORY CHANNEL STORAGE AND VMWARE VIRTUAL SAN : VDI ACCELERATION
DIABLO TECHNOLOGIES MEMORY CHANNEL STORAGE AND VMWARE VIRTUAL SAN : VDI ACCELERATION A DIABLO WHITE PAPER AUGUST 2014 Ricky Trigalo Director of Business Development Virtualization, Diablo Technologies
Managing Capacity Using VMware vcenter CapacityIQ TECHNICAL WHITE PAPER
Managing Capacity Using VMware vcenter CapacityIQ TECHNICAL WHITE PAPER Table of Contents Capacity Management Overview.... 3 CapacityIQ Information Collection.... 3 CapacityIQ Performance Metrics.... 4
An Oracle White Paper September 2013. Advanced Java Diagnostics and Monitoring Without Performance Overhead
An Oracle White Paper September 2013 Advanced Java Diagnostics and Monitoring Without Performance Overhead Introduction... 1 Non-Intrusive Profiling and Diagnostics... 2 JMX Console... 2 Java Flight Recorder...
Evaluation of 40 Gigabit Ethernet technology for data servers
Evaluation of 40 Gigabit Ethernet technology for data servers Azher Mughal, Artur Barczyk Caltech / USLHCNet CHEP-2012, New York http://supercomputing.caltech.edu Agenda Motivation behind 40GE in Data
Oracle Solaris: Aktueller Stand und Ausblick
Oracle Solaris: Aktueller Stand und Ausblick Detlef Drewanz Principal Sales Consultant, EMEA Server Presales The following is intended to outline our general product direction. It
Virtuoso and Database Scalability
Virtuoso and Database Scalability By Orri Erling Table of Contents Abstract Metrics Results Transaction Throughput Initializing 40 warehouses Serial Read Test Conditions Analysis Working Set Effect of
IOmark- VDI. Nimbus Data Gemini Test Report: VDI- 130906- a Test Report Date: 6, September 2013. www.iomark.org
IOmark- VDI Nimbus Data Gemini Test Report: VDI- 130906- a Test Copyright 2010-2013 Evaluator Group, Inc. All rights reserved. IOmark- VDI, IOmark- VDI, VDI- IOmark, and IOmark are trademarks of Evaluator
EqualLogic PS Series Load Balancers and Tiering, a Look Under the Covers. Keith Swindell Dell Storage Product Planning Manager
EqualLogic PS Series Load Balancers and Tiering, a Look Under the Covers Keith Swindell Dell Storage Product Planning Manager Topics Guiding principles Network load balancing MPIO Capacity load balancing
Performance Benchmark for Cloud Block Storage
Performance Benchmark for Cloud Block Storage J.R. Arredondo vjune2013 Contents Fundamentals of performance in block storage Description of the Performance Benchmark test Cost of performance comparison
Directions for VMware Ready Testing for Application Software
Directions for VMware Ready Testing for Application Software Introduction To be awarded the VMware ready logo for your product requires a modest amount of engineering work, assuming that the pre-requisites
An Oracle White Paper July 2011. Oracle Primavera Contract Management, Business Intelligence Publisher Edition-Sizing Guide
Oracle Primavera Contract Management, Business Intelligence Publisher Edition-Sizing Guide An Oracle White Paper July 2011 1 Disclaimer The following is intended to outline our general product direction.
VDI Optimization Real World Learnings. Russ Fellows, Evaluator Group
Russ Fellows, Evaluator Group SNIA Legal Notice The material contained in this tutorial is copyrighted by the SNIA unless otherwise noted. Member companies and individual members may use this material
Automating Big Data Benchmarking for Different Architectures with ALOJA
www.bsc.es Jan 2016 Automating Big Data Benchmarking for Different Architectures with ALOJA Nicolas Poggi, Postdoc Researcher Agenda 1. Intro on Hadoop performance 1. Current scenario and problematic 2.
Running MySQL on CentOS Linux. Peter Zaitsev April 14, 2014
Running MySQL on CentOS Linux Peter Zaitsev April 14, 2014 About the Presenta9on 2 Cover what you need to run MySQL on CentOS Linux successfully Distribu7on Hardware OS Configura7on MySQL Installa7on MySQL
Xen and XenServer Storage Performance
Xen and XenServer Storage Performance Low Latency Virtualisation Challenges Dr Felipe Franciosi XenServer Engineering Performance Team e-mail: [email protected] freenode: felipef #xen-api twitter:
Amazon Web Services Primer. William Strickland COP 6938 Fall 2012 University of Central Florida
Amazon Web Services Primer William Strickland COP 6938 Fall 2012 University of Central Florida AWS Overview Amazon Web Services (AWS) is a collection of varying remote computing provided by Amazon.com.
Architecture of the Kernel-based Virtual Machine (KVM)
Corporate Technology Architecture of the Kernel-based Virtual Machine (KVM) Jan Kiszka, Siemens AG, CT T DE IT 1 Corporate Competence Center Embedded Linux [email protected] Copyright Siemens AG 2010.
PostgreSQL Performance Characteristics on Joyent and Amazon EC2
OVERVIEW In today's big data world, high performance databases are not only required but are a major part of any critical business function. With the advent of mobile devices, users are consuming data
Network Infrastructure Services CS848 Project
Quality of Service Guarantees for Cloud Services CS848 Project presentation by Alexey Karyakin David R. Cheriton School of Computer Science University of Waterloo March 2010 Outline 1. Performance of cloud
CS 377: Operating Systems. Outline. A review of what you ve learned, and how it applies to a real operating system. Lecture 25 - Linux Case Study
CS 377: Operating Systems Lecture 25 - Linux Case Study Guest Lecturer: Tim Wood Outline Linux History Design Principles System Overview Process Scheduling Memory Management File Systems A review of what
BENCHMARKING CLOUD DATABASES CASE STUDY on HBASE, HADOOP and CASSANDRA USING YCSB
BENCHMARKING CLOUD DATABASES CASE STUDY on HBASE, HADOOP and CASSANDRA USING YCSB Planet Size Data!? Gartner s 10 key IT trends for 2012 unstructured data will grow some 80% over the course of the next
Performance Analysis: Benchmarking Public Clouds
Performance Analysis: Benchmarking Public Clouds Performance comparison of web server and database VMs on Internap AgileCLOUD and Amazon Web Services By Cloud Spectator March 215 PERFORMANCE REPORT WEB
IBM Tivoli Monitoring Version 6.3 Fix Pack 2. Infrastructure Management Dashboards for Servers Reference
IBM Tivoli Monitoring Version 6.3 Fix Pack 2 Infrastructure Management Dashboards for Servers Reference IBM Tivoli Monitoring Version 6.3 Fix Pack 2 Infrastructure Management Dashboards for Servers Reference
Direct NFS - Design considerations for next-gen NAS appliances optimized for database workloads Akshay Shah Gurmeet Goindi Oracle
Direct NFS - Design considerations for next-gen NAS appliances optimized for database workloads Akshay Shah Gurmeet Goindi Oracle Agenda Introduction Database Architecture Direct NFS Client NFS Server
Linux Performance Optimizations for Big Data Environments
Linux Performance Optimizations for Big Data Environments Dominique A. Heger Ph.D. DHTechnologies (Performance, Capacity, Scalability) www.dhtusa.com Data Nubes (Big Data, Hadoop, ML) www.datanubes.com
A Comparative Study on Vega-HTTP & Popular Open-source Web-servers
A Comparative Study on Vega-HTTP & Popular Open-source Web-servers Happiest People. Happiest Customers Contents Abstract... 3 Introduction... 3 Performance Comparison... 4 Architecture... 5 Diagram...
Advances in Virtualization In Support of In-Memory Big Data Applications
9/29/15 HPTS 2015 1 Advances in Virtualization In Support of In-Memory Big Data Applications SCALE SIMPLIFY OPTIMIZE EVOLVE Ike Nassi [email protected] 9/29/15 HPTS 2015 2 What is the Problem We
HP Smart Array Controllers and basic RAID performance factors
Technical white paper HP Smart Array Controllers and basic RAID performance factors Technology brief Table of contents Abstract 2 Benefits of drive arrays 2 Factors that affect performance 2 HP Smart Array
Hyper-V vs ESX at the datacenter
Hyper-V vs ESX at the datacenter Gabrie van Zanten www.gabesvirtualworld.com GabesVirtualWorld Which hypervisor to use in the data center? Virtualisation has matured Virtualisation in the data center grows
Performance Evaluation of VMXNET3 Virtual Network Device VMware vsphere 4 build 164009
Performance Study Performance Evaluation of VMXNET3 Virtual Network Device VMware vsphere 4 build 164009 Introduction With more and more mission critical networking intensive workloads being virtualized
Application Performance Testing Basics
Application Performance Testing Basics ABSTRACT Todays the web is playing a critical role in all the business domains such as entertainment, finance, healthcare etc. It is much important to ensure hassle-free
How AWS Pricing Works May 2015
How AWS Pricing Works May 2015 (Please consult http://aws.amazon.com/whitepapers/ for the latest version of this paper) Page 1 of 15 Table of Contents Table of Contents... 2 Abstract... 3 Introduction...
Scalable Architecture on Amazon AWS Cloud
Scalable Architecture on Amazon AWS Cloud Kalpak Shah Founder & CEO, Clogeny Technologies [email protected] 1 * http://www.rightscale.com/products/cloud-computing-uses/scalable-website.php 2 Architect
Splunk for VMware Virtualization. Marco Bizzantino [email protected] Vmug - 05/10/2011
Splunk for VMware Virtualization Marco Bizzantino [email protected] Vmug - 05/10/2011 Collect, index, organize, correlate to gain visibility to all IT data Using Splunk you can identify problems,
Cloud computing - Architecting in the cloud
Cloud computing - Architecting in the cloud [email protected] 1 Outline Cloud computing What is? Levels of cloud computing: IaaS, PaaS, SaaS Moving to the cloud? Architecting in the cloud Best practices
0408 - Avoid Paying The Virtualization Tax: Deploying Virtualized BI 4.0 The Right Way. Ashish C. Morzaria, SAP
0408 - Avoid Paying The Virtualization Tax: Deploying Virtualized BI 4.0 The Right Way Ashish C. Morzaria, SAP LEARNING POINTS Understanding the Virtualization Tax : What is it, how it affects you How
Real-Time Scheduling 1 / 39
Real-Time Scheduling 1 / 39 Multiple Real-Time Processes A runs every 30 msec; each time it needs 10 msec of CPU time B runs 25 times/sec for 15 msec C runs 20 times/sec for 5 msec For our equation, A
MEASURING WORKLOAD PERFORMANCE IS THE INFRASTRUCTURE A PROBLEM?
MEASURING WORKLOAD PERFORMANCE IS THE INFRASTRUCTURE A PROBLEM? Ashutosh Shinde Performance Architect [email protected] Validating if the workload generated by the load generating tools is applied
Agenda. Enterprise Application Performance Factors. Current form of Enterprise Applications. Factors to Application Performance.
Agenda Enterprise Performance Factors Overall Enterprise Performance Factors Best Practice for generic Enterprise Best Practice for 3-tiers Enterprise Hardware Load Balancer Basic Unix Tuning Performance
Accelerating Web-Based SQL Server Applications with SafePeak Plug and Play Dynamic Database Caching
Accelerating Web-Based SQL Server Applications with SafePeak Plug and Play Dynamic Database Caching A SafePeak Whitepaper February 2014 www.safepeak.com Copyright. SafePeak Technologies 2014 Contents Objective...
Amazon Cloud Storage Options
Amazon Cloud Storage Options Table of Contents 1. Overview of AWS Storage Options 02 2. Why you should use the AWS Storage 02 3. How to get Data into the AWS.03 4. Types of AWS Storage Options.03 5. Object
The Data Placement Challenge
The Data Placement Challenge Entire Dataset Applications Active Data Lowest $/IOP Highest throughput Lowest latency 10-20% Right Place Right Cost Right Time 100% 2 2 What s Driving the AST Discussion?
How AWS Pricing Works
How AWS Pricing Works (Please consult http://aws.amazon.com/whitepapers/ for the latest version of this paper) Page 1 of 15 Table of Contents Table of Contents... 2 Abstract... 3 Introduction... 3 Fundamental
2 2011 Oracle Corporation Proprietary and Confidential
The following is intended to outline our general product direction. It is intended for information purposes only, and may not be incorporated into any contract. It is not a commitment to deliver any material,
Using Iometer to Show Acceleration Benefits for VMware vsphere 5.5 with FlashSoft Software 3.7
Using Iometer to Show Acceleration Benefits for VMware vsphere 5.5 with FlashSoft Software 3.7 WHITE PAPER Western Digital Technologies, Inc. 951 SanDisk Drive, Milpitas, CA 95035 www.sandisk.com Table
I/O PERFORMANCE COMPARISON OF VMWARE VCLOUD HYBRID SERVICE AND AMAZON WEB SERVICES
I/O PERFORMANCE COMPARISON OF VMWARE VCLOUD HYBRID SERVICE AND AMAZON WEB SERVICES Businesses are rapidly transitioning to the public cloud to take advantage of on-demand resources and potential cost savings.
How To Choose Between A Relational Database Service From Aws.Com
The following text is partly taken from the Oracle book Middleware and Cloud Computing It is available from Amazon: http://www.amazon.com/dp/0980798000 Cloud Databases and Oracle When designing your cloud
Best Practices for Deploying Citrix XenDesktop on NexentaStor Open Storage
Best Practices for Deploying Citrix XenDesktop on NexentaStor Open Storage White Paper July, 2011 Deploying Citrix XenDesktop on NexentaStor Open Storage Table of Contents The Challenges of VDI Storage
Portable Scale-Out Benchmarks for MySQL. MySQL User Conference 2008 Robert Hodges CTO Continuent, Inc.
Portable Scale-Out Benchmarks for MySQL MySQL User Conference 2008 Robert Hodges CTO Continuent, Inc. Continuent 2008 Agenda / Introductions / Scale-Out Review / Bristlecone Performance Testing Tools /
Container-based operating system virtualization: a scalable, high-performance alternative to hypervisors
Container-based operating system virtualization: a scalable, high-performance alternative to hypervisors Soltesz, et al (Princeton/Linux-VServer), Eurosys07 Context: Operating System Structure/Organization
