Rigorous Performance Testing on the Web. Grant Ellis Senior Performance Architect, Instart Logic

Similar documents
Measuring CDN Performance. Hooman Beheshti, VP Technology

Testing & Assuring Mobile End User Experience Before Production. Neotys

Application Performance Monitoring (APM) Technical Whitepaper

AKAMAI WHITE PAPER. Delivering Dynamic Web Content in Cloud Computing Applications: HTTP resource download performance modelling

2012 AKAMAI FASTER FORWARD TM

Monitoring the Real End User Experience

SOASTA Real User Monitoring Best Practices

Results-Oriented Application Acceleration with FastView Because Every Second Counts Whitepaper

White Paper. How To Deliver Fast, Engaging Responsive Web Design Sites

/ BROCHURE / BEST PRACTICE FOR BENCHMARKING WEB & MOBILE SITE PERFORMANCE. By Melbourne IT Enterprise Services

Q: What is the difference between the other load testing tools which enables the wan emulation, location based load testing and Gomez load testing?

FIVE WAYS TO OPTIMIZE MOBILE WEBSITE PERFORMANCE WITH PAGE SPEED

AKAMAI WHITE PAPER. How To Deliver Fast, Engaging Responsive Web Design Sites

End User Monitoring. AppDynamics Pro Documentation. Version Page 1

HOW IS WEB APPLICATION DEVELOPMENT AND DELIVERY CHANGING?

How the Netflix ISP Speed Index Documents Netflix Congestion Problems

1. Comments on reviews a. Need to avoid just summarizing web page asks you for:

NOT ALL END USER EXPERIENCE MONITORING SOLUTIONS ARE CREATED EQUAL COMPARING ATERNITY WORKFORCE APM TO FOUR OTHER MONITORING APPROACHES

Update logo and logo link on A Master. Update Date and Product on B Master

The Wireless Last Hop WHITE PAPER. What is it?...and why is it important? networks. - Improving the way the world connects -

Netcraft Analysis: Online Speed Testing Tools

Ajax Design and Usability

Front-End Performance Testing and Optimization

Traffic delivery evolution in the Internet ENOG 4 Moscow 23 rd October 2012

Process Optimizer Hands-on Exercise

Today s webinar will begin shortly. Please stand by

Elevating Data Center Performance Management

Meeting the challenges of modern website performance Developments in monitoring strategies

Six Steps to Improved SharePoint Performance & Operations

NOT ALL END USER EXPERIENCE MONITORING SOLUTIONS ARE CREATED EQUAL COMPARING ATERNITY WORKFORCE APM TO FOUR OTHER MONITORING APPROACHES

WEB AND MOBILE SITE PERFORMANCE PRIMER FOR BUSINESS MANAGERS

WHITE PAPER : WEB PERFORMANCE MANAGEMENT

THE AKAMAI SERVICE CONSULTING PACKAGE 10FOR10 IMPROVES YOUR WEB PERFORMANCE METRIC(S) BY AT LEAST 10%! AKAMAI 10For10 AKAMAI INDUSTRY BROCHURE

12 Steps to Ensure Successful SaaS Delivery

Mobile Performance Testing Approaches and Challenges

BUSINESS IMPACT OF POOR WEB PERFORMANCE

BI SURVEY. The world s largest survey of business intelligence software users

THE CHALLENGE OF AVAILABILITY & PERFORMANCE OF YOUR SITES

A SENSIBLE GUIDE TO LATENCY MANAGEMENT

Real vs. Synthetic Web Performance Measurements, a Comparative Study

Why Web Performance Matters Open Text Web Solutions Usergroup Anwendertagung Feb Tobias Dreyschultze Tobel Online Web Consultancy

Speak<geek> Tech Brief. RichRelevance Infrastructure: a robust, retail- optimized foundation. richrelevance

Ensighten Tag Delivery Network. Advanced Infrastructure for Enterprise-Class Tag Management

TOP New Features of Oracle Business Intelligence 11g

Secure, Reliable & Proven Application Performance Management for the Enterprise

Applications. Network Application Performance Analysis. Laboratory. Objective. Overview

Single Pass Load Balancing with Session Persistence in IPv6 Network. C. J. (Charlie) Liu Network Operations Charter Communications

Web Performance Management How's My App? Five Best Practices for Web Performance Management

WHITE PAPER RUN VDI IN THE CLOUD WITH PANZURA SKYBRIDGE

Transaction Monitoring Version for AIX, Linux, and Windows. Reference IBM

Responsive Web Design: A Three-Screen Study in Site Performance

Doug Goldberg. Vice President of Magento Solutions, ZeroLag

Products & Features. For more information. Web/app service to be managed Real Brower. Public. ARGOS PC Probe. Apps. Mobile subscriber network

Making Sense of Broadband Performance Solving Last Mile Connection Speed Problems Traffic Congestion vs. Traffic Control

Measuring AJAX Performance on a GPRS Mobile Platform

Software for Ministry. Remote Access FAQ s

Whitepaper Performance Testing and Monitoring of Mobile Applications

Chapter 1 Reading Organizer

New Relic & JMeter - Perfect Performance Testing

Web Load Stress Testing

The Business Value of a Comprehensive All-in-One Data Protection Solution for Your Organization

Route Control Optimize Multi-homed Connections for Performance, Load and Cost By John Bartlett January 2002

XpoLog Center Suite Log Management & Analysis platform

Mobile Application Performance Report

Business white paper. Performance testing for mobile applications. Will your mobile application fail your users?

Gen-iTest Services. Realize The Full Power Of The Cloud For Performance Testing. Ian Mortimer and Gareth Shackel

HOW DOES GOOGLE ANALYTICS HELP ME?

Accelerating Wordpress for Pagerank and Profit

Acceleration Systems Performance Assessment Tool (PAT) User Guide v 2.1

Enterprise Services Whitepaper. 3 Reasons Why You Need Real User Measurements

Data Center Content Delivery Network

Data Visualization Frameworks: D3.js vs. Flot vs. Highcharts by Igor Zalutsky, JavaScript Developer at Altoros

Diagrams and Graphs of Statistical Data

Informatica Data Director Performance

Web Analytics Understand your web visitors without web logs or page tags and keep all your data inside your firewall.

Optimizing Your Web site 101.2: Motivating your supporters to take initial actions

Web Caching and CDNs. Aditya Akella

Here s how to choose the right mobile app for you.

Cellular and Networks Application and Data Usage

IBM WebSphere Business Monitor, Version 6.1

Teridion. Rethinking Network Performance. The Internet. Lightning Fast. Technical White Paper July,

When did you first build your current ecommerce platform? Or when did you last upgrade or replatform?

Stress Testing Technologies for Citrix MetaFrame. Michael G. Norman, CEO December 5, 2001

Monitoring Galera with MONyog

Load and Performance Load Testing. RadView Software October

QUALITY OF SERVICE FOR CLOUD-BASED MOBILE APPS: Aruba Networks AP-135 and Cisco AP3602i

Web Performance Management 12 Steps To Ensure Successful SAAS Delivery How To Protect Revenue, Renewals & Customer Satisfaction

Web applications today are part of every IT operation within an organization.

Addressing Mobile Load Testing Challenges. A Neotys White Paper

Network Monitoring Comparison

Performance Testing Process A Whitepaper

Lecture 3: Scaling by Load Balancing 1. Comments on reviews i. 2. Topic 1: Scalability a. QUESTION: What are problems? i. These papers look at

Fixed Price Website Load Testing

STeP-IN SUMMIT June 18 21, 2013 at Bangalore, INDIA. Enhancing Performance Test Strategy for Mobile Applications

Improved metrics collection and correlation for the CERN cloud storage test framework

WHITE PAPER. Web Application Streaming: A Radical New Approach

Architecting Distributed Databases for Failure A Case Study with Druid

2012 Annual State of the Union: E-Commerce Page Speed and Website Performance

Managing a Portfolio of Products

STeP-IN SUMMIT June 2014 at Bangalore, Hyderabad, Pune - INDIA. Mobile Performance Testing

Transcription:

Rigorous Performance Testing on the Web Grant Ellis Senior Performance Architect, Instart Logic

Who is Instart Logic? Software company focused on Application Delivery We work with globally known brands whose business depends on performance, and make their sites and apps really fast Team includes big data, virtualization and web performance experts from Google, Facebook, Akamai, Cisco, Citrix, VMware, and Aster Data

Who Uses Instart Logic?

Response Time: 4.98 seconds How was the data collected? Aggregated? Normalized? What is response time? What does that mean for the users? Did any actual human beings see this response time? What devices/browsers were used? Laptop? Phone? Tablet? Where were the users located?

Performance Testing: Two Truths 1. Methodology matters more than results 2. Statistical analysis can (and sometimes does) lie. Ø It is really easy to Ø make great results look poor, Ø make poor results look great, Ø either deliberately or accidentally.

Table of Contents The Internet, The Bottleneck, and The Test: A brief history Last-Mile Performance Tools (It s dangerous to go alone!) Now I have data Lots of data But, wait, there s more (data)! Need more? Meet the CDF. Tie it all together

First: A quick network primer! Need For Speed: Packet Edition, created by Raphaël Luta http://www.aptiwan.com/packetstory/

The Internet, The Bottleneck, and The Test: A brief history The Dawn of the (World Wide) Web Adoption viable for commerce and business Performance detractors: - Weak server hardware - Clumsy scaling technology - Poor first-mile connectivity Primary Bottlenecks: - Hardware - First-mile connectivity

The Internet, The Bottleneck, and The Test: A brief history ISP ADC LAST MILE MIDDLE MILE FIRST MILE HARDWARE Bottleneck Bottleneck Repeatedly loads whole pages. Measured performance takes into account the page, the embedded objects, and the server latency introduced by a then-traditional three-tier architecture.

The Internet, The Bottleneck, and The Test: A brief history Data center scale was conquered. Adoption on the web increased again: - Google, Facebook, fully-baked e-commerce, others - Governments digitized records and moved vital functions to the Web Performance detractors: - Middle-mile copper - Congested switches - Poorly maintained peering points Primary Bottlenecks: - Middle-mile

The Internet, The Bottleneck, and The Test: A brief history CDN ISP ADC LAST MILE MIDDLE MILE FIRST MILE HARDWARE Bottleneck Backbone products from Gomez and Keynote Enables ongoing performance testing (e.g. monitoring) from multiple geographies at the same time. Beware: Some content delivery networks have taken care to place their nodes on the same network, or even the same rack, as synthetic testing nodes. Look for unrealistically low response times in your embedded objects!

The Internet, The Bottleneck, and The Test: A brief history CDN ISP ADC LAST MILE MIDDLE MILE FIRST MILE HARDWARE Bottleneck Last mile latency, packet loss Browser mechanics

The Application Delivery Challenge Today 250 200 Latency (ms) 150 100 50 0 Wired LTE WiFi 4G 3G High Performance Browser Networking by Ilya Grigorik, Figures 7-16 and 10-6 Available for free online: http://chimera.labs.oreilly.com/books/1230000000545/index.html

Table of Contents The Internet, The Bottleneck, and The Test: A brief history Last-Mile Performance Tools (It s dangerous to go alone!) Now I have data Lots of data But, wait, there s more (data)! Need more? Meet the CDF. Tie it all together

Last-Mile Performance Tools JMeter and LoadRunner measure: From a single geography (usually on-premise) With a single browser Keynote backbone / Gomez backbone: Report only on average Use fixed (backbone) connectivity Still simulate data (It s dangerous to go alone!) None of the above measure: Multiple devices Multiple connection types True user experience Impact from wireless technologies So, we need more tools!

Last-Mile Performance Tools (It s dangerous to go alone!) Synthetic Testing Real User Monitoring (RUM) boomerang.js Pros User Experience metrics Open source! Multiple device types Multiple connection types (traffic shaping) Great reports Captures waterfall diagrams Pros True user experience Easy set-up Great browser support Multiple device types Multiple connection types Open source tools available Cons Cons Limited analysis tools Difficult to monitor performance Platform stability It s still synthetic Requires live traffic - Responsive, not preemptive Measurement impacts results Safari data is limited Outliers are can be extreme and must be removed

First: New vocab for last-mile tools Fully Loaded - Entire page has been loaded - Including asynchronous functions like analytics beacons. - The browser hasn t utilized the Internet Connection for a while - Generally transparent from a users perspective For a long time, fully loaded is all we had. With mature client-side technologies, the Fully Loaded metric is much less relevant: Does not take into account browser mechanics Fires after connection is disused nothing to do with user experience!

First: New vocab for last-mile tools Fully Loaded - Entire page has been loaded - Including asynchronous functions like analytics beacons. - The browser hasn t utilized the Internet Connection for a while - Generally transparent from a users perspective Document Complete (or Onload) - The page is assembled by the browser and ready for the user. - (Almost) always visually complete - User can use the scroll bars, click links, or search. - The browser may still be doing things in the background. Some sites defer loading of prominent content until after document complete. Some Front-End Optimization (FEO) packages defer script execution for document complete. In this case, an interactive site may look visually complete at document complete, but won t actually be responsive or usable until after those scripts execute!

First: New vocab for last-mile tools Fully Loaded - Entire page has been loaded - Including asynchronous functions like analytics beacons. - The browser hasn t utilized the Internet Connection for a while - Generally transparent from a users perspective Document Complete (or Onload) - The page is assembled by the browser and ready for the user. - (Almost) always visually complete - User can use the scroll bars, click links, or search. - The browser may still be doing things in the background. Start Render (or Render Start) - Browser paints something (anything) on the screen. - May be all or most of the page, or a single image, or a single paragraph, or a single pixel. - The moment your user knows that the web site is actually working. - Abandonment (usually) happens before Start Render.

First: New vocab for last-mile tools Load Time Otherwise known as Document Complete. First Byte Network latency plus server latency. Start Render Otherwise known as Render Start. Transparent for users. Critical path for all browser functions Minimize wherever possible. Visually Complete All visual components of the page are painted on the screen. Speed Index Loosely, the average time for visual components to be painted on the screen. Fully Loaded The same Fully Loaded. The Browser stops using the connection.

First: New vocab for last-mile tools Load Time Otherwise known as Document Complete. First Byte Network latency plus server latency. Start Render Otherwise known as Render Start. Visually Complete All visual components of the page are painted on the screen. Speed Index Loosely, the average time for visual components to be painted on the screen. Fully Loaded The same Fully Loaded. The Browser stops using the connection. BEWARE: Visually complete is not the same as functional. Some Front-End Optimizations defer JavaScript execution to make the page look visually complete faster but users may not be able to click links, scroll the window, or search!

First: New vocab for last-mile tools Load Time Otherwise known as Document Complete. First Byte Network latency plus server latency. Start Render Otherwise known as Render Start. Visually Complete All visual components of the page are painted on the screen. More technically: the integration of the area above the curve if all paint events are plotted (lower is better). The same warnings around visual completeness apply. Sites with great speed indexes are not necessarily functional as quickly as they are visible. Speed Index Loosely, the average time for visual components to be painted on the screen. Fully Loaded The same Fully Loaded. The Browser stops using the connection.

Table of Contents The Internet, The Bottleneck, and The Test: A brief history Last-Mile Performance Tools (It s dangerous to go alone!) Now I have data Lots of data But, wait, there s more (data)! Need more? Meet the CDF. Tie it all together

Now I have data lots of data Over 6,000 data points. à What can we do with this?

Possible interpretations Average Median Standard Devia/on blue 8.947 7.323 4.792 red 9.239 7.168 5.357 green 8.155 6.977 4.844 purple 14.104 Over 13.109 6,000 data points. 4.397 à Gross oversimplification May be useful. But, look at how the graph changes with slightly different cuts. à Could be highly misleading!

Table of Contents The Internet, The Bottleneck, and The Test: A brief history Last-Mile Performance Tools (It s dangerous to go alone!) Now I have data Lots of data But, wait, there s more (data)! Need more? Meet the CDF. Tie it all together

But, wait! There s more (data)! None of these representations capture the whole picture! There are hundreds of permutations of variability- different: Internet connection types Devices Browsers Geographies Wireless connection quality Computing power And then, there s the natural variability of the Internet. Plots over time usually aren t that relevant for web performance: Oversimplification sometimes misleading! Rarely actionable

But, wait! There s more (data)! We can t take all these things and distill them into one number, or even one number plotted over time. Enter the histogram: The histogram expresses how many users experienced a particular page load time.

But, wait! There s more (data)! We can t take all these things and distill them into one number, or even one number plotted over time. Enter the histogram: Taller bars mean that more users saw the load time in that interval.

But, wait! There s more (data)! We can t take all these things and distill them into one number, or even one number plotted over time. Enter the histogram: Shorter bars mean that fewer users saw the load time in that interval.

But, wait! There s more (data)! We can t take all these things and distill them into one number, or even one number plotted over time. Enter the histogram: Faster transaction times are on the left side of the histogram.

But, wait! There s more (data)! We can t take all these things and distill them into one number, or even one number plotted over time. Enter the histogram: When the taller bars are on the left side, it means that more users saw a fast experience. If you are comparing two experiences, plot the histograms on the same chart!

But, wait! There s more (data)! We can t take all these things and distill them into one number, or even one number plotted over time. Enter the histogram: Red is definitely faster than blue: Fast users got faster Medium users got faster Slow users got faster

Table of Contents The Internet, The Bottleneck, and The Test: A brief history Last-Mile Performance Tools (It s dangerous to go alone!) Now I have data Lots of data But, wait, there s more (data)! Need more? Meet the CDF. Tie it all together

Need More? Meet the Cumulative Distribution Function (CDF) We all love histograms: - Everything is represented - Easy to consume But, they still have shortcomings: - Finite granularity - Arbitrary bucket designations Maybe we need something a little more high-octane!

Need More? Meet the Cumulative Distribution Function (CDF) The Cumulative Distribution Function (CDF) expresses the percentage of page loads completed after a given amount of elapsed time.

Need More? Meet the Cumulative Distribution Function (CDF) So, for blue, approximately 20% of page loads were completed in 5 seconds or less. 5s

Need More? Meet the Cumulative Distribution Function (CDF) Slightly less than 70% of transactions were done in 10 seconds or less. 10s

Need More? Meet the Cumulative Distribution Function (CDF) As with histograms, a better (faster) CDF is one with a curve to the left and above this one. Two data sets are easy to compare!

Need More? Meet the Cumulative Distribution Function (CDF) The red line is higher and more to the left. A greater percentage of users are done with their page load at any given time.

Need More? Meet the Cumulative Distribution Function (CDF) The gap between the lines is the differential. Right here, only 80% of blue users were done with their page load. After the same amount of time, more than 90% of red users were done.

Need More? Meet the Cumulative Distribution Function (CDF) The red curve is above and to the left of the blue curve in all cases. Red is faster for all users.

Table of Contents The Internet, The Bottleneck, and The Test: A brief history Last-Mile Performance Tools (It s dangerous to go alone!) Now I have data Lots of data But, wait, there s more (data)! Need more? Meet the CDF. Tie it all together

Tie it all together The Internet is a jungle. Methodology matters more than results. Statistics can lie. Pick your tool wisely. Irrelevant metrics mislead. Performance is never a single number. Powerful visualizations trump aggregate figures. Spreadsheets are your friend.

Thanks! http://grant.ellis.io grant@instartlogic.com