Universal Scalability Law
|
|
|
- Mae Katrina McDowell
- 10 years ago
- Views:
Transcription
1 Northern California CMG Meeting February 7, 2006 Scalability on a Stick Neil Gunther Performance Dynamics Company Copyright 2006 Performance Dynamics Company 1 Universal Scalability Law Virtual Load Testing Hardware scalability Amdahl s law Universal law of computer scalability Software scalability Same model applies Amdahl and the Repairman Theorem of mine from 2002 Proves that it is a physical theory Not just curve fitting games Copyright 2006 Performance Dynamics Company 2 1
2 Scaling versus Scalability Physical Scaling Laws Load mass 1/3 Limits to giantism Phase transitions Nanotechnology Computer Scalability Diminishing returns Internal overheads Multiprocessors Clusters, GRIDs Internet congestion Copyright 2006 Performance Dynamics Company 3 Scaling Characteristics Nature Computer Copyright 2006 Performance Dynamics Company 4 2
3 Commonality Both hardware and application scalability are two sides of the same coin (more later) Hardware Hold the number of users-per-processor fixed Capacity function of number of processors (p) Software Hold the number of processors fixed Capacity is a function of user load (N) Copyright 2006 Performance Dynamics Company 5 Hardware Scalability (CPUs) Predicted Scalability Krays/Second Modeled Measured CPUs (p) Copyright 2006 Performance Dynamics Company 6 3
4 Software Scaling (Users) Copyright 2006 Performance Dynamics Company 7 Scalability is Not a Number! Extreme Software Scaling, ACM Queue vol.3(no.7), Sep by R. MacDougall Demonstrating Amdahl speedup max speedup is 75% of linear Scalability is a function (not a number!) Diminishing returns (due to increasing overhead) appears as fall from linearity Want to express this fall from linearity as quantitative function Copyright 2006 Performance Dynamics Company 8 4
5 Why Should We Care? Old reason: Should always care about performance New reason: Multicores have arrived! Intel paxville 2 CPU cores (2 buffers) = 4 virtual CPUs Sun UltraSPARC T 8 CPU cores (4 buffers) = 32 virtual CPUs Copyright 2006 Performance Dynamics Company 9 Multiprocessing is Hard Welcome back to SMPs Just on a smaller scale Faster cycles Pyramid Technology Spent a lot of time with ISVs Teach them how to use SMPs Need to understand scalability Copyright 2006 Performance Dynamics Company 10 5
6 Universal Hardware Scalability Copyright 2006 Performance Dynamics Company 11 Naive Parallelism parallelizable Single processor execution time, T(1) time reduction p-processor execution time, T(1) / p ICCM 95 Copyright 2006 Performance Dynamics Company 12 6
7 Speedup Metric SN(p) = = = p T(1) T(p) T T/p Naïve parallelism means linear scalability Simple function of p-processors No diminishing returns (unrealistic) Mathematical definition of equal bang for the bucks Copyright 2006 Performance Dynamics Company 13 Equal Bang for the Buck C a p a c i t y Ideal Nearly Processors (P) Copyright 2006 Performance Dynamics Company 14 7
8 Sources of Serialization Ideal SMP scalability is linear but... Transaction may wait on: + Longer O/S code paths + Exchange of shared writeable-data between CPU caches + Data exchange between CPUs and main memory + Spin lock synchronization (serialization) of accesses to shared writeable data + Waiting for an I/O DMA to complete Nonlinear scaling more typical for OLTP Scalability curve is a function + But a function of what? Copyright 2006 Performance Dynamics Company 15 Effect of Diminishing Returns linear rising C a p a c i t y Ideal Real nonlinear falling Processors (P) Copyright 2006 Performance Dynamics Company 16 8
9 Amdahl s Law Single processor execution time parallelizable serial P - processor time reduction Serial portion can only be done by SINGLE processor The greater the parallelism, the greater the serial fraction Copyright 2006 Performance Dynamics Company 17 Derivation of Amdahl s Law Uniprocessor elapsed time: T(1) = T Fixed serial portion: σt Accessable parallel portion: (1 σ)t Elapsed time with p processors: T(p) = Serial portion + Parallel portion T(p) = σt + (1 σ)t/p Copyright 2006 Performance Dynamics Company 18 9
10 Amdahl Speedup! SA(p) = T(1) T(p) = = T "T + (1#")T p p "p + (1#") This equation is known as Amdahl s law p SA( p) = 1+ "(p #1) Copyright 2006 Performance Dynamics Company 19! Amdahl s Speedup Curve p SA( p) = 1+ "(p #1)! Diminishing returns are due to the increasing relative serial fraction of the execution time: σt(1) Serial component of workload dominates at large p Copyright 2006 Performance Dynamics Company 20 10
11 Parallel Scaleup Amdahl s law (1967) assumes there is a fixed amount of work which is partitioned across p-processors. One way to defeat Amdahl serialization is to increase the amount of work in proportion to the number of processors (demonstrated in 1987). Generally very difficult to achieve in practice. Only for special workloads. Can still be defeated by comms overhead. Copyright 2006 Performance Dynamics Company 21 Multi-user Scaleup Per user response time, T Uniprocessor elapse time ( p - 1) σ T Multiprocessor elapse time Contention with users on other ( p - 1) processors The greater the concurrency, the greater the contention Copyright 2006 Performance Dynamics Company 22 11
12 Derivation of Multi-user Scaleup Fixed number of tx s per processor: C Uniprocessor elapsed time: T(1) = T (as before) Uniprocessor throughput: X(1) = C/T Contention from each added processor: σt Elapsed time with p processors: T(p) = T + (p 1) σ T SMP throughput with p processors: X(p) = p [C/T(p)] Copyright 2006 Performance Dynamics Company 23 Relative SMP Capacity C(p) = X(p) X(1) = pc T T(p) C = pt T + (p -1)"T p = 1+ "(p #1) Identical to Amdahl s Law!!!! Copyright 2006 Performance Dynamics Company 24 12
13 Super Serial Scaleup Uniprocessor elapse time ( p - 1) ( σ T ) Multiprocessor elapse time Coherency delay is smaller than serial contention but... it grows in proportion to (p x p) processors! Serial contention with p-1 processors The greater the parallelism, the faster the coherency delay grows! Copyright 2006 Performance Dynamics Company 25 Super-Serial Capacity Fixed number of tx s per user: C Uniprocessor elapsed time: T Delay for each added processor: σt + λpt SMP elapsed time: T(p) = T + (p 1)σT + σ (p 1) λpt Same algebra as before produces: C(",#,p) = p 1+ "[(p -1) + #p(p -1)] Copyright 2006 Performance Dynamics Company 26! 13
14 Super-Serial Characteristic 25 S c a l e u p Processors (P) The downward arrow indicates retrograde reduction in performance as the fraction of coherency work (per processor) increases. Note the appearance of a MAXIMUM in the capacity curve. Copyright 2006 Performance Dynamics Company 27 Capacity Ceiling MAXIMUM throughput occurs at: If λ = 0, Pmax : Best case (Amdahl scaling; no maximum) As λ, Pmax 0:! p = max % % & 1"# ' ' # $ ( Worst case (moves toward origin) Copyright 2006 Performance Dynamics Company 28 14
15 The Three C s Concurrency Degree of inherent parallelism in the work Contention Degree of serialization in the OLTP workload Coherency Degree of induced delay for data consistency maintenance (analogous to page-faulting in VM memory) Copyright 2006 Performance Dynamics Company 29 How Do We Find σ and λ? Might we to look at the instrunction level (like Amdahl) to determine parameters. This is the brute force method. There is a smarter way: Measure throughput X(p) for a few (p=4-6) configurations Renormalize these data into appropriate ratios Fit those data as a quadratic function: {a, b, c} Compute the parameters {σ, λ} for the universal scalability model Use them to generate the full scaling curve Copyright 2006 Performance Dynamics Company 30 15
16 Statistics Is Your Friend Find trends in raw performance data Trends can be used to predict growth Methods include: Regression Time-independent data Time Series Analysis Time-dependent data ( auto-correlated ) ANOVA: ANalysis Of VAriance Quantify degree of interaction between otherwise random performance metrics Copyright 2006 Performance Dynamics Company 31 Why I Use EXCEL? Ubiquitous Cheap Mgmt. desktops already running MS Office Internet news groups for EXCEL/VBA help Programmable VBA, object-oriented, debugging, journaling, good prototyping tool Statistical analysis functions (Regression, ANOVA, ARIMA) Don t always know which functions are needed Plotting is integrated with VBA Printing Single push-button (also has HTML option) Use those statistical tools with which you re comfortable Copyright 2006 Performance Dynamics Company 32 16
17 Image Processing Benchmark SGI Origin2K Ray Tracing B/mark Data KRays per Second CPUs (p) Copyright 2006 Performance Dynamics Company 33 Predicting Scalability: Step 1 Measured CPU (p) KR ays/sec X(p) Copyright 2006 Performance Dynamics Company 34 17
18 Step 2: Transformed Data R elcap Efficiency Inverse C=X(p)/X(1) C/p p/c Copyright 2006 Performance Dynamics Company 35 Step 3: Regression Analysis Choose Polynomial degree 2 (quadratic) Check Set intercept Set = 0 Check Show Equation Check Show R 2 Fit Transform p-1 (p/c) Copyright 2006 Performance Dynamics Company 36 18
19 EXCEL Trendline Type Copyright 2006 Performance Dynamics Company 37 EXCEL - Trendline Options Copyright 2006 Performance Dynamics Company 38 19
20 Deviation from Linearity Quadratic Regression 3.5 Deviation from Linearity y = 7E-05x x R 2 = Effective CPUs (p - 1) Copyright 2006 Performance Dynamics Company 39 Step 4: Trendline Parameters Enter Trendline coefficients in EXCEL Note that coefficient c = 0 Trendline Quadratic a Parameters Coefficients E-05 b c Copyright 2006 Performance Dynamics Company 40 20
21 Parameter Relationships " = b # a $ = a b # a p max = Floor( 1#" " $ ) p opt = 1 " Copyright 2006 Performance Dynamics Company 41! Step 5: Model Parameters Super Parameter Serial Values σ λ Pmax 377 Popt 22 Contention Coherency Copyright 2006 Performance Dynamics Company 42 21
22 Step 6: Predictions Predicted Capacity p C(p) Modeled Measured %Error Negative errors => model underestimates data Copyright 2006 Performance Dynamics Company 43 Predicted Hardware Scalability Predicted Scalability Krays/Second Modeled Measured CPUs (p) Copyright 2006 Performance Dynamics Company 44 22
23 Conclusions Error < 10% over 64-way (good) Contention (σ): 4.64% This is relatively high Due to NUMA bus, compiler, etc. Coherency (λ): 0.02% Relatively low Few Cache misses, no paging, etc. Pmax: 377 is physically unachievable Popt: 22 poor for this application on a 64- way SMP Copyright 2006 Performance Dynamics Company 45 Summary of Technique Measure appln. throughput X(p) vs. config (p) A sparse data sample (at least 4 data points) Calculate the capacity ratio C(p), the efficiency C/p, and its inverse p/c, from the data. Calculate the Quadratic transform Perform regression fit on the Quadratic transform Use EXCEL Trendline + Options dialog box Calculate the parameters {σ, λ} from the regression coefficients {a, b, c = 0} Use the values of σ and λ to predict the complete scalability function, C(σ, λ, p) Copyright 2006 Performance Dynamics Company 46 23
24 Why Do I Invert the Data? EXCEL is too dumb to fit data to a nonlinear equation like my universal scaling equation (USL) More sophisticated tools like Mathematica, S+ and R can do it Inverting the data corresponds to inverting my USL formula Just makes it easy for EXCEL to fit a 2 nd degree polynomial Copyright 2006 Performance Dynamics Company 47 Universal Software Scalability Copyright 2006 Performance Dynamics Company 48 24
25 General Application Scalability Rename variables and parameters p N σ α λ β Otherwise, the same as before N C(N,",#) = 1+ $[(N %1) + &N(N %1)]! Copyright 2006 Performance Dynamics Company 49 CASE STUDY (I): Login Certification for CRM Load test measurements using Load Runner from Mercury-Interactive at a well-known networking company in Silicon Valley Copyright 2006 Performance Dynamics Company 50 25
26 LoadRunner 7.5 Settings CRM application User login/authentification User searches for customer parameters (ORACLE) Transaction definition Login (1-shot) is incorporated in Init part of LR script Iterate on specific searches being evaluated Mean TPS calculated as Action_Tx / Duration_Seconds Each VU load must reach steady state mins. is common runtime per VU load Use Rendezvous Start/Stop VU 15 mins. Use Goal mode rather than Scenario mode (new in LR 7.5) Copyright 2006 Performance Dynamics Company 51 Throughput Measurements Measured VUs (N) TPS X(N) Searches per Second Load Test Measurements VUsers (N) Copyright 2006 Performance Dynamics Company 52 26
27 Regression Analysis 7.0 Quadratic Transform y = x x R 2 = Deviation from Linear Effective Users (N- 1) Copyright 2006 Performance Dynamics Company 53 Parameters Mappings Trendline Parameters Super Serial Quadratic Coefficients Parameter Values a 2.50E-03 α b β c Nmax 19 Nopt 5 Copyright 2006 Performance Dynamics Company 54 27
28 Predicted Scalability Predicted Capacity VUs C(N) Modeled Measured Copyright 2006 Performance Dynamics Company 55 Predicted and Modeled X(N) Predicted Scalability 1.20 Searches per Second Modeled Measured LR 7,5 VUsers Copyright 2006 Performance Dynamics Company 56 28
29 Response Time Measurements Predicted Delay Percent VU Modelled Measured Error N R(N) =! X(N) Z Copyright 2006 Performance Dynamics Company 57 Predicted Application Delay Predicted Delay (Z=0) Seconds Modelled Measured LR 7.5 VUsers Copyright 2006 Performance Dynamics Company 58 29
30 Conclusions Error < 2% on throughput (v. good) Error < 4% on latency (excellent!) Contention (α): 20.26% Extremely high ODBC calls need serious revision! Coherency (β): 1.23% Relatively high DBMS cache misses? Nmax: 19 is severely bottlenecked Nopt: 5 users is untenable in production Copyright 2006 Performance Dynamics Company 59 CASE STUDY (II): Multi-tier Scalability Based on a CMG paper Copyright 2006 Performance Dynamics Company 60 30
31 Reference Based on CMG 2001 paper by Buch & Pentkovski, Experience of Characterization of Typical Multi-tier e- Business Systems Using Operational Analysis This is an excellent paper I analyze this model using PDQ in my other book Analyzing Performance with Perl::PDQ, Springer 2005 Here, I use the Universal Scaling Law instead (much easier but not as insightful) Copyright 2006 Performance Dynamics Company 61 Multi-tier Model Classic 3-tier architecture Only show 1 queue per tier Measured using MS-WAS (not LR) Copyright 2006 Performance Dynamics Company 62 31
32 Measured Throughput VUs (N) X(N) WAS Measurements GETs per Sec. (X) Clients (N) Copyright 2006 Performance Dynamics Company 63 Deviation from Linearity Quadratic Regression Linear Deviation y = x x R 2 = Effective Users (N- 1) Copyright 2006 Performance Dynamics Company 64 32
33 Model Parameters Trendline Parameters Super Serial Quadratic Coefficients Parameter Values a 7.50E-03 α b β c Nmax 10 Nopt 15 Copyright 2006 Performance Dynamics Company 65 Predicted Throughput Predicted Capacity Percent VUs C(N) Modeled Measured Error Copyright 2006 Performance Dynamics Company 66 33
34 Retrograde Scalability Predicted Scalability 125 GETS per Sec. (GPS) Modeled Measured MS WAS VUsers Copyright 2006 Performance Dynamics Company 67 Predicted Response Times Predicted Delay Percent VU Modelled Measured Error Copyright 2006 Performance Dynamics Company 68 34
35 Response Scalability Predicted Delay (Z=0) Seconds Modelled Measured MS WAS VUsers Copyright 2006 Performance Dynamics Company 69 Conclusions Contention (α): 6.8% Very high Middleware need serious tuning Coherency (β): 11.3% Incredibly high! Some kind of thrashing is occurring Nmax: 10 is poor Nopt: 15 can t be achieved Recommend you compare their CMG 2001 paper with Chap. 10 in my PPDQ book Copyright 2006 Performance Dynamics Company 70 35
36 Application Scaling Example Wintel platform NT 4.0 O/S MS-SQLS Fixed CPU configs Vary Vuser load Copyright 2006 Performance Dynamics Company 71 SQLServer Scalability Plotted NT SQLS TPS w ay 6-w ay 8-w ay VUsers Copyright 2006 Performance Dynamics Company 72 36
37 4-Way Parameters Trendline Parameters Super Serial Quadratic Coefficients Parameter Values a 2.30E-03 α b β c Nmax 20 Nopt 13 After performing the same kind of regression analysis we did for hardware scalability Copyright 2006 Performance Dynamics Company 73 6-Way Parameters Trendline Parameters Super Serial Quadratic Coefficients Parameter Values a 2.00E-04 α b β c Nmax 70 Nopt 13 Copyright 2006 Performance Dynamics Company 74 37
38 8-Way Parameters Trendline Parameters Super Serial Quadratic Coefficients Parameter Values a 2.00E-05 α b β c Nmax 223 Nopt 18 Copyright 2006 Performance Dynamics Company 75 Modeled Scalability Curves Copyright 2006 Performance Dynamics Company 76 38
39 Amdahl and the Repairman Theorem: Amdahl s law for parallel speedup is equivalent to the synchronous throughput bound of the repairman queueing model --njg (2002) Copyright 2006 Performance Dynamics Company 77 Classic Repairman Model Machines in assembly line Some are up and working Some are down for repairs Repair station Copyright 2006 Performance Dynamics Company 78 39
40 Multiuser (Timeshare) Model Machines -> users at their terminals Repair station -> computing resource Copyright 2006 Performance Dynamics Company 79 Multiprocessor Bus Model Machines -> Processors Repair station -> communication bus Copyright 2006 Performance Dynamics Company 80 40
41 Synchronous Queueing Amdahl s law == Synchronous queueing bound Serial fraction: σ = D / (D + Z) Copyright 2006 Performance Dynamics Company 81 Load-Dependent Bus Service rate is a function of the load Load measured by queue length Copyright 2006 Performance Dynamics Company 82 41
42 Retrograde Throughput Commonly seen in application load testing Difficult to model in queueing theory Copyright 2006 Performance Dynamics Company 83 Virtual Load Testing 1. Measure appln. throughput X(N) vs. load (N) 2. A sparse data sample (> 4 load points) is OK 3. Calculate the capacity ratio C(N), the efficiency C/N, and its inverse N/C, from the data. 4. Calculate the Quadratic transform 5. Perform regression fit on the Quadratic transform 6. Use EXCEL Trendline + Options dialog box 7. Calculate the parameters {α, β} from the regression coefficients {a, b, c = 0} 8. Use the values of α and β to predict the complete scalability function, C(N) Copyright 2006 Performance Dynamics Company 84 42
43 Bibliography G. Amdahl, Validity to the Single Processor Approach to Achieving Large Scale Computing Capabilities, AFIPS Conference, 1967 N. Gunther, A New Interpretation of Amdahl s law and Geometric Scalability, October 2002 N. Gunther, Guerrilla Capacity Planning, Springer-Verlag, 2006 (in press) The Guerrilla Manual is always available on my website Copyright 2006 Performance Dynamics Company 85 43
Symmetric Multiprocessing
Multicore Computing A multi-core processor is a processing system composed of two or more independent cores. One can describe it as an integrated circuit to which two or more individual processors (called
QSEM SM : Quantitative Scalability Evaluation Method
Copyright 2005, PerfX and Performance Engineering Services. All rights reserved. QSEM SM : Quantitative Scalability Evaluation Method Lloyd G. Williams, Ph.D. PerfX 2345 Dogwood Circle Louisville, Colorado
Case Study I: A Database Service
Case Study I: A Database Service Prof. Daniel A. Menascé Department of Computer Science George Mason University www.cs.gmu.edu/faculty/menasce.html 1 Copyright Notice Most of the figures in this set of
Chapter 18: Database System Architectures. Centralized Systems
Chapter 18: Database System Architectures! Centralized Systems! Client--Server Systems! Parallel Systems! Distributed Systems! Network Types 18.1 Centralized Systems! Run on a single computer system and
Throughput Capacity Planning and Application Saturation
Throughput Capacity Planning and Application Saturation Alfred J. Barchi [email protected] http://www.ajbinc.net/ Introduction Applications have a tendency to be used more heavily by users over time, as the
Parallelism and Cloud Computing
Parallelism and Cloud Computing Kai Shen Parallel Computing Parallel computing: Process sub tasks simultaneously so that work can be completed faster. For instances: divide the work of matrix multiplication
Performance And Scalability In Oracle9i And SQL Server 2000
Performance And Scalability In Oracle9i And SQL Server 2000 Presented By : Phathisile Sibanda Supervisor : John Ebden 1 Presentation Overview Project Objectives Motivation -Why performance & Scalability
Capacity Management for Oracle Database Machine Exadata v2
Capacity Management for Oracle Database Machine Exadata v2 Dr. Boris Zibitsker, BEZ Systems NOCOUG 21 Boris Zibitsker Predictive Analytics for IT 1 About Author Dr. Boris Zibitsker, Chairman, CTO, BEZ
Understanding the Benefits of IBM SPSS Statistics Server
IBM SPSS Statistics Server Understanding the Benefits of IBM SPSS Statistics Server Contents: 1 Introduction 2 Performance 101: Understanding the drivers of better performance 3 Why performance is faster
Performance Characteristics of VMFS and RDM VMware ESX Server 3.0.1
Performance Study Performance Characteristics of and RDM VMware ESX Server 3.0.1 VMware ESX Server offers three choices for managing disk access in a virtual machine VMware Virtual Machine File System
Parallel Algorithm Engineering
Parallel Algorithm Engineering Kenneth S. Bøgh PhD Fellow Based on slides by Darius Sidlauskas Outline Background Current multicore architectures UMA vs NUMA The openmp framework Examples Software crisis
<Insert Picture Here> An Experimental Model to Analyze OpenMP Applications for System Utilization
An Experimental Model to Analyze OpenMP Applications for System Utilization Mark Woodyard Principal Software Engineer 1 The following is an overview of a research project. It is intended
Virtuoso and Database Scalability
Virtuoso and Database Scalability By Orri Erling Table of Contents Abstract Metrics Results Transaction Throughput Initializing 40 warehouses Serial Read Test Conditions Analysis Working Set Effect of
VirtualCenter Database Performance for Microsoft SQL Server 2005 VirtualCenter 2.5
Performance Study VirtualCenter Database Performance for Microsoft SQL Server 2005 VirtualCenter 2.5 VMware VirtualCenter uses a database to store metadata on the state of a VMware Infrastructure environment.
System Models for Distributed and Cloud Computing
System Models for Distributed and Cloud Computing Dr. Sanjay P. Ahuja, Ph.D. 2010-14 FIS Distinguished Professor of Computer Science School of Computing, UNF Classification of Distributed Computing Systems
Business Application Services Testing
Business Application Services Testing Curriculum Structure Course name Duration(days) Express 2 Testing Concept and methodologies 3 Introduction to Performance Testing 3 Web Testing 2 QTP 5 SQL 5 Load
Parallel Scalable Algorithms- Performance Parameters
www.bsc.es Parallel Scalable Algorithms- Performance Parameters Vassil Alexandrov, ICREA - Barcelona Supercomputing Center, Spain Overview Sources of Overhead in Parallel Programs Performance Metrics for
Software Performance and Scalability
Software Performance and Scalability A Quantitative Approach Henry H. Liu ^ IEEE )computer society WILEY A JOHN WILEY & SONS, INC., PUBLICATION Contents PREFACE ACKNOWLEDGMENTS xv xxi Introduction 1 Performance
Introduction 1 Performance on Hosted Server 1. Benchmarks 2. System Requirements 7 Load Balancing 7
Introduction 1 Performance on Hosted Server 1 Figure 1: Real World Performance 1 Benchmarks 2 System configuration used for benchmarks 2 Figure 2a: New tickets per minute on E5440 processors 3 Figure 2b:
Oracle Database Scalability in VMware ESX VMware ESX 3.5
Performance Study Oracle Database Scalability in VMware ESX VMware ESX 3.5 Database applications running on individual physical servers represent a large consolidation opportunity. However enterprises
Centralized Systems. A Centralized Computer System. Chapter 18: Database System Architectures
Chapter 18: Database System Architectures Centralized Systems! Centralized Systems! Client--Server Systems! Parallel Systems! Distributed Systems! Network Types! Run on a single computer system and do
Introduction to Cloud Computing
Introduction to Cloud Computing Parallel Processing I 15 319, spring 2010 7 th Lecture, Feb 2 nd Majd F. Sakr Lecture Motivation Concurrency and why? Different flavors of parallel computing Get the basic
Agenda. Enterprise Application Performance Factors. Current form of Enterprise Applications. Factors to Application Performance.
Agenda Enterprise Performance Factors Overall Enterprise Performance Factors Best Practice for generic Enterprise Best Practice for 3-tiers Enterprise Hardware Load Balancer Basic Unix Tuning Performance
Chapter 12: Multiprocessor Architectures. Lesson 01: Performance characteristics of Multiprocessor Architectures and Speedup
Chapter 12: Multiprocessor Architectures Lesson 01: Performance characteristics of Multiprocessor Architectures and Speedup Objective Be familiar with basic multiprocessor architectures and be able to
Liferay Portal Performance. Benchmark Study of Liferay Portal Enterprise Edition
Liferay Portal Performance Benchmark Study of Liferay Portal Enterprise Edition Table of Contents Executive Summary... 3 Test Scenarios... 4 Benchmark Configuration and Methodology... 5 Environment Configuration...
Performance Tuning and Optimizing SQL Databases 2016
Performance Tuning and Optimizing SQL Databases 2016 http://www.homnick.com [email protected] +1.561.988.0567 Boca Raton, Fl USA About this course This four-day instructor-led course provides students
Performance metrics for parallel systems
Performance metrics for parallel systems S.S. Kadam C-DAC, Pune [email protected] C-DAC/SECG/2006 1 Purpose To determine best parallel algorithm Evaluate hardware platforms Examine the benefits from parallelism
CS550. Distributed Operating Systems (Advanced Operating Systems) Instructor: Xian-He Sun
CS550 Distributed Operating Systems (Advanced Operating Systems) Instructor: Xian-He Sun Email: [email protected], Phone: (312) 567-5260 Office hours: 2:10pm-3:10pm Tuesday, 3:30pm-4:30pm Thursday at SB229C,
<Insert Picture Here> Best Practices for Extreme Performance with Data Warehousing on Oracle Database
1 Best Practices for Extreme Performance with Data Warehousing on Oracle Database Rekha Balwada Principal Product Manager Agenda Parallel Execution Workload Management on Data Warehouse
Deciding which process to run. (Deciding which thread to run) Deciding how long the chosen process can run
SFWR ENG 3BB4 Software Design 3 Concurrent System Design 2 SFWR ENG 3BB4 Software Design 3 Concurrent System Design 11.8 10 CPU Scheduling Chapter 11 CPU Scheduling Policies Deciding which process to run
Real-Time Scheduling 1 / 39
Real-Time Scheduling 1 / 39 Multiple Real-Time Processes A runs every 30 msec; each time it needs 10 msec of CPU time B runs 25 times/sec for 15 msec C runs 20 times/sec for 5 msec For our equation, A
2009 Oracle Corporation 1
The following is intended to outline our general product direction. It is intended for information purposes only, and may not be incorporated into any contract. It is not a commitment to deliver any material,
Making Multicore Work and Measuring its Benefits. Markus Levy, president EEMBC and Multicore Association
Making Multicore Work and Measuring its Benefits Markus Levy, president EEMBC and Multicore Association Agenda Why Multicore? Standards and issues in the multicore community What is Multicore Association?
An Oracle White Paper March 2013. Load Testing Best Practices for Oracle E- Business Suite using Oracle Application Testing Suite
An Oracle White Paper March 2013 Load Testing Best Practices for Oracle E- Business Suite using Oracle Application Testing Suite Executive Overview... 1 Introduction... 1 Oracle Load Testing Setup... 2
Lecture 3: Evaluating Computer Architectures. Software & Hardware: The Virtuous Cycle?
Lecture 3: Evaluating Computer Architectures Announcements - Reminder: Homework 1 due Thursday 2/2 Last Time technology back ground Computer elements Circuits and timing Virtuous cycle of the past and
Application of Predictive Analytics for Better Alignment of Business and IT
Application of Predictive Analytics for Better Alignment of Business and IT Boris Zibitsker, PhD [email protected] July 25, 2014 Big Data Summit - Riga, Latvia About the Presenter Boris Zibitsker
Learning More About Load Testing
Welcome to this introduction to application performance testing and the LoadRunner load testing solution. This document provides a short overview of LoadRunner s features, and includes the following sections:
Principles and characteristics of distributed systems and environments
Principles and characteristics of distributed systems and environments Definition of a distributed system Distributed system is a collection of independent computers that appears to its users as a single
Performance Prediction, Sizing and Capacity Planning for Distributed E-Commerce Applications
Performance Prediction, Sizing and Capacity Planning for Distributed E-Commerce Applications by Samuel D. Kounev ([email protected]) Information Technology Transfer Office Abstract Modern e-commerce
CPS104 Computer Organization and Programming Lecture 18: Input-Output. Robert Wagner
CPS104 Computer Organization and Programming Lecture 18: Input-Output Robert Wagner cps 104 I/O.1 RW Fall 2000 Outline of Today s Lecture The I/O system Magnetic Disk Tape Buses DMA cps 104 I/O.2 RW Fall
Operating Systems 4 th Class
Operating Systems 4 th Class Lecture 1 Operating Systems Operating systems are essential part of any computer system. Therefore, a course in operating systems is an essential part of any computer science
How To Understand The Concept Of A Distributed System
Distributed Operating Systems Introduction Ewa Niewiadomska-Szynkiewicz and Adam Kozakiewicz [email protected], [email protected] Institute of Control and Computation Engineering Warsaw University of
Full and Para Virtualization
Full and Para Virtualization Dr. Sanjay P. Ahuja, Ph.D. 2010-14 FIS Distinguished Professor of Computer Science School of Computing, UNF x86 Hardware Virtualization The x86 architecture offers four levels
10.04.2008. Thomas Fahrig Senior Developer Hypervisor Team. Hypervisor Architecture Terminology Goals Basics Details
Thomas Fahrig Senior Developer Hypervisor Team Hypervisor Architecture Terminology Goals Basics Details Scheduling Interval External Interrupt Handling Reserves, Weights and Caps Context Switch Waiting
Fixed Price Website Load Testing
Fixed Price Website Load Testing Can your website handle the load? Don t be the last one to know. For as low as $4,500, and in many cases within one week, we can remotely load test your website and report
18-742 Lecture 4. Parallel Programming II. Homework & Reading. Page 1. Projects handout On Friday Form teams, groups of two
age 1 18-742 Lecture 4 arallel rogramming II Spring 2005 rof. Babak Falsafi http://www.ece.cmu.edu/~ece742 write X Memory send X Memory read X Memory Slides developed in part by rofs. Adve, Falsafi, Hill,
What Is Specific in Load Testing?
What Is Specific in Load Testing? Testing of multi-user applications under realistic and stress loads is really the only way to ensure appropriate performance and reliability in production. Load testing
Scaling from 1 PC to a super computer using Mascot
Scaling from 1 PC to a super computer using Mascot 1 Mascot - options for high throughput When is more processing power required? Options: Multiple Mascot installations Using a large Unix server Clusters
A closer look at HP LoadRunner software
Technical white paper A closer look at HP LoadRunner software Table of contents Sizing up the system 2 The limits of manual testing 2 A new take on testing: the HP LoadRunner solution 3 The HP LoadRunner
Real-Time Operating Systems for MPSoCs
Real-Time Operating Systems for MPSoCs Hiroyuki Tomiyama Graduate School of Information Science Nagoya University http://member.acm.org/~hiroyuki MPSoC 2009 1 Contributors Hiroaki Takada Director and Professor
Benchmarking Hadoop & HBase on Violin
Technical White Paper Report Technical Report Benchmarking Hadoop & HBase on Violin Harnessing Big Data Analytics at the Speed of Memory Version 1.0 Abstract The purpose of benchmarking is to show advantages
Performance Baseline of Oracle Exadata X2-2 HR HC. Part II: Server Performance. Benchware Performance Suite Release 8.4 (Build 130630) September 2013
Performance Baseline of Oracle Exadata X2-2 HR HC Part II: Server Performance Benchware Performance Suite Release 8.4 (Build 130630) September 2013 Contents 1 Introduction to Server Performance Tests 2
Quiz for Chapter 1 Computer Abstractions and Technology 3.10
Date: 3.10 Not all questions are of equal difficulty. Please review the entire quiz first and then budget your time carefully. Name: Course: Solutions in Red 1. [15 points] Consider two different implementations,
CPU Scheduling. Core Definitions
CPU Scheduling General rule keep the CPU busy; an idle CPU is a wasted CPU Major source of CPU idleness: I/O (or waiting for it) Many programs have a characteristic CPU I/O burst cycle alternating phases
Virtualization Performance on SGI UV 2000 using Red Hat Enterprise Linux 6.3 KVM
White Paper Virtualization Performance on SGI UV 2000 using Red Hat Enterprise Linux 6.3 KVM September, 2013 Author Sanhita Sarkar, Director of Engineering, SGI Abstract This paper describes how to implement
Identify and control performance and capacity risks. Introduction... 2
Application performance testing in VMware environments Identify and control performance and capacity risks Table of contents Introduction... 2 Performance and capacity planning techniques... 2 Rough sizing
High Performance Computing. Course Notes 2007-2008. HPC Fundamentals
High Performance Computing Course Notes 2007-2008 2008 HPC Fundamentals Introduction What is High Performance Computing (HPC)? Difficult to define - it s a moving target. Later 1980s, a supercomputer performs
JBoss Seam Performance and Scalability on Dell PowerEdge 1855 Blade Servers
JBoss Seam Performance and Scalability on Dell PowerEdge 1855 Blade Servers Dave Jaffe, PhD, Dell Inc. Michael Yuan, PhD, JBoss / RedHat June 14th, 2006 JBoss Inc. 2006 About us Dave Jaffe Works for Dell
Evaluation Methodology of Converged Cloud Environments
Krzysztof Zieliński Marcin Jarząb Sławomir Zieliński Karol Grzegorczyk Maciej Malawski Mariusz Zyśk Evaluation Methodology of Converged Cloud Environments Cloud Computing Cloud Computing enables convenient,
Multiprocessor Scheduling and Scheduling in Linux Kernel 2.6
Multiprocessor Scheduling and Scheduling in Linux Kernel 2.6 Winter Term 2008 / 2009 Jun.-Prof. Dr. André Brinkmann [email protected] Universität Paderborn PC² Agenda Multiprocessor and
Vorlesung Rechnerarchitektur 2 Seite 178 DASH
Vorlesung Rechnerarchitektur 2 Seite 178 Architecture for Shared () The -architecture is a cache coherent, NUMA multiprocessor system, developed at CSL-Stanford by John Hennessy, Daniel Lenoski, Monica
DIABLO TECHNOLOGIES MEMORY CHANNEL STORAGE AND VMWARE VIRTUAL SAN : VDI ACCELERATION
DIABLO TECHNOLOGIES MEMORY CHANNEL STORAGE AND VMWARE VIRTUAL SAN : VDI ACCELERATION A DIABLO WHITE PAPER AUGUST 2014 Ricky Trigalo Director of Business Development Virtualization, Diablo Technologies
PostgreSQL Business Intelligence & Performance Simon Riggs CTO, 2ndQuadrant PostgreSQL Major Contributor
PostgreSQL Business Intelligence & Performance Simon Riggs CTO, 2ndQuadrant PostgreSQL Major Contributor The research leading to these results has received funding from the European Union's Seventh Framework
CHAPTER 1 INTRODUCTION
1 CHAPTER 1 INTRODUCTION 1.1 MOTIVATION OF RESEARCH Multicore processors have two or more execution cores (processors) implemented on a single chip having their own set of execution and architectural recourses.
MEASURING WORKLOAD PERFORMANCE IS THE INFRASTRUCTURE A PROBLEM?
MEASURING WORKLOAD PERFORMANCE IS THE INFRASTRUCTURE A PROBLEM? Ashutosh Shinde Performance Architect [email protected] Validating if the workload generated by the load generating tools is applied
Performance Analysis of Web based Applications on Single and Multi Core Servers
Performance Analysis of Web based Applications on Single and Multi Core Servers Gitika Khare, Diptikant Pathy, Alpana Rajan, Alok Jain, Anil Rawat Raja Ramanna Centre for Advanced Technology Department
Nexenta Performance Scaling for Speed and Cost
Nexenta Performance Scaling for Speed and Cost Key Features Optimize Performance Optimize Performance NexentaStor improves performance for all workloads by adopting commodity components and leveraging
How To Test A Web Server
Performance and Load Testing Part 1 Performance & Load Testing Basics Performance & Load Testing Basics Introduction to Performance Testing Difference between Performance, Load and Stress Testing Why Performance
Client/Server Computing Distributed Processing, Client/Server, and Clusters
Client/Server Computing Distributed Processing, Client/Server, and Clusters Chapter 13 Client machines are generally single-user PCs or workstations that provide a highly userfriendly interface to the
Control 2004, University of Bath, UK, September 2004
Control, University of Bath, UK, September ID- IMPACT OF DEPENDENCY AND LOAD BALANCING IN MULTITHREADING REAL-TIME CONTROL ALGORITHMS M A Hossain and M O Tokhi Department of Computing, The University of
TRACE PERFORMANCE TESTING APPROACH. Overview. Approach. Flow. Attributes
TRACE PERFORMANCE TESTING APPROACH Overview Approach Flow Attributes INTRODUCTION Software Testing Testing is not just finding out the defects. Testing is not just seeing the requirements are satisfied.
Objectives. Chapter 5: Process Scheduling. Chapter 5: Process Scheduling. 5.1 Basic Concepts. To introduce CPU scheduling
Objectives To introduce CPU scheduling To describe various CPU-scheduling algorithms Chapter 5: Process Scheduling To discuss evaluation criteria for selecting the CPUscheduling algorithm for a particular
Performance Testing of a Cloud Service
Performance Testing of a Cloud Service Trilesh Bhurtun, Junior Consultant, Capacitas Ltd Capacitas 2012 1 Introduction Objectives Environment Tests and Results Issues Summary Agenda Capacitas 2012 2 1
CPU Scheduling Outline
CPU Scheduling Outline What is scheduling in the OS? What are common scheduling criteria? How to evaluate scheduling algorithms? What are common scheduling algorithms? How is thread scheduling different
Maximize Performance and Scalability of RADIOSS* Structural Analysis Software on Intel Xeon Processor E7 v2 Family-Based Platforms
Maximize Performance and Scalability of RADIOSS* Structural Analysis Software on Family-Based Platforms Executive Summary Complex simulations of structural and systems performance, such as car crash simulations,
Performance Modeling and Analysis of a Database Server with Write-Heavy Workload
Performance Modeling and Analysis of a Database Server with Write-Heavy Workload Manfred Dellkrantz, Maria Kihl 2, and Anders Robertsson Department of Automatic Control, Lund University 2 Department of
Cellular Computing on a Linux Cluster
Cellular Computing on a Linux Cluster Alexei Agueev, Bernd Däne, Wolfgang Fengler TU Ilmenau, Department of Computer Architecture Topics 1. Cellular Computing 2. The Experiment 3. Experimental Results
Lecture 23: Multiprocessors
Lecture 23: Multiprocessors Today s topics: RAID Multiprocessor taxonomy Snooping-based cache coherence protocol 1 RAID 0 and RAID 1 RAID 0 has no additional redundancy (misnomer) it uses an array of disks
Removing Performance Bottlenecks in Databases with Red Hat Enterprise Linux and Violin Memory Flash Storage Arrays. Red Hat Performance Engineering
Removing Performance Bottlenecks in Databases with Red Hat Enterprise Linux and Violin Memory Flash Storage Arrays Red Hat Performance Engineering Version 1.0 August 2013 1801 Varsity Drive Raleigh NC
Chapter 2 Parallel Architecture, Software And Performance
Chapter 2 Parallel Architecture, Software And Performance UCSB CS140, T. Yang, 2014 Modified from texbook slides Roadmap Parallel hardware Parallel software Input and output Performance Parallel program
HP ProLiant BL660c Gen9 and Microsoft SQL Server 2014 technical brief
Technical white paper HP ProLiant BL660c Gen9 and Microsoft SQL Server 2014 technical brief Scale-up your Microsoft SQL Server environment to new heights Table of contents Executive summary... 2 Introduction...
LCMON Network Traffic Analysis
LCMON Network Traffic Analysis Adam Black Centre for Advanced Internet Architectures, Technical Report 79A Swinburne University of Technology Melbourne, Australia [email protected] Abstract The Swinburne
Mike Chyi, Micro Focus Solution Consultant May 12, 2010
Mike Chyi, Micro Focus Solution Consultant May 12, 2010 Agenda Load Testing Overview, Best Practice: Performance Testing with Diagnostics Demo (?), Q&A Load Testing Overview What is load testing? Type
Performance evaluation
Performance evaluation Arquitecturas Avanzadas de Computadores - 2547021 Departamento de Ingeniería Electrónica y de Telecomunicaciones Facultad de Ingeniería 2015-1 Bibliography and evaluation Bibliography
Recommended hardware system configurations for ANSYS users
Recommended hardware system configurations for ANSYS users The purpose of this document is to recommend system configurations that will deliver high performance for ANSYS users across the entire range
