Statistical Confidence Calculations



Similar documents
Biostatistics: DESCRIPTIVE STATISTICS: 2, VARIABILITY

Week 4: Standard Error and Confidence Intervals

4. Continuous Random Variables, the Pareto and Normal Distributions

MEASURES OF VARIATION

LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING

CALCULATIONS & STATISTICS

Constructing and Interpreting Confidence Intervals

Simple Regression Theory II 2010 Samuel L. Baker

Summary of Formulas and Concepts. Descriptive Statistics (Ch. 1-4)

Probability Distributions

99.37, 99.38, 99.38, 99.39, 99.39, 99.39, 99.39, 99.40, 99.41, cm

Statistical estimation using confidence intervals

Using R for Linear Regression

Descriptive Statistics

12: Analysis of Variance. Introduction

1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number

Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression

An introduction to Value-at-Risk Learning Curve September 2003

Testing Research and Statistical Hypotheses

LOGIT AND PROBIT ANALYSIS

One-Way ANOVA using SPSS SPSS ANOVA procedures found in the Compare Means analyses. Specifically, we demonstrate

Statistics I for QBIC. Contents and Objectives. Chapters 1 7. Revised: August 2013

STATISTICA Formula Guide: Logistic Regression. Table of Contents

Signal to Noise Instrumental Excel Assignment

Chapter 5: Normal Probability Distributions - Solutions

CS 147: Computer Systems Performance Analysis

" Y. Notation and Equations for Regression Lecture 11/4. Notation:

COMPARISON MEASURES OF CENTRAL TENDENCY & VARIABILITY EXERCISE 8/5/2013. MEASURE OF CENTRAL TENDENCY: MODE (Mo) MEASURE OF CENTRAL TENDENCY: MODE (Mo)

Part 2: Analysis of Relationship Between Two Variables

5.1 Identifying the Target Parameter

Two-sample inference: Continuous data

Lecture Notes Module 1

Errata and updates for ASM Exam C/Exam 4 Manual (Sixteenth Edition) sorted by page

Northumberland Knowledge

Lesson 7 Z-Scores and Probability

Method To Solve Linear, Polynomial, or Absolute Value Inequalities:

STRUTS: Statistical Rules of Thumb. Seattle, WA

DESCRIPTIVE STATISTICS. The purpose of statistics is to condense raw data to make it easier to answer specific questions; test hypotheses.

Concepts in Investments Risks and Returns (Relevant to PBE Paper II Management Accounting and Finance)

Lesson 1: Comparison of Population Means Part c: Comparison of Two- Means

Descriptive Statistics

2 Sample t-test (unequal sample sizes and unequal variances)

MATHEMATICAL TRADING INDICATORS

3. What is the difference between variance and standard deviation? 5. If I add 2 to all my observations, how variance and mean will vary?

Scatter Plots with Error Bars

Chapter 5. Random variables

Balance Accuracy. FlexiWeigh

Independent t- Test (Comparing Two Means)

Algebra Practice Problems for Precalculus and Calculus

Simple linear regression

Measures of Central Tendency and Variability: Summarizing your Data for Others

Confidence Intervals for One Standard Deviation Using Standard Deviation

INTERPRETING THE ONE-WAY ANALYSIS OF VARIANCE (ANOVA)

6.4 Normal Distribution

Confidence Intervals in Public Health

Characterizing Digital Cameras with the Photon Transfer Curve

THE FIRST SET OF EXAMPLES USE SUMMARY DATA... EXAMPLE 7.2, PAGE 227 DESCRIBES A PROBLEM AND A HYPOTHESIS TEST IS PERFORMED IN EXAMPLE 7.

Study Guide for the Final Exam

Means, standard deviations and. and standard errors

CHAPTER THREE COMMON DESCRIPTIVE STATISTICS COMMON DESCRIPTIVE STATISTICS / 13

Probability and Statistics Prof. Dr. Somesh Kumar Department of Mathematics Indian Institute of Technology, Kharagpur

Module 4: Data Exploration

MATH BOOK OF PROBLEMS SERIES. New from Pearson Custom Publishing!

Overview Classes Logistic regression (5) 19-3 Building and applying logistic regression (6) 26-3 Generalizations of logistic regression (7)

TImath.com. F Distributions. Statistics

Chapter 7. Comparing Means in SPSS (t-tests) Compare Means analyses. Specifically, we demonstrate procedures for running Dependent-Sample (or

3.2. Solving quadratic equations. Introduction. Prerequisites. Learning Outcomes. Learning Style

TImath.com. Statistics. Areas in Intervals

Confidence Intervals for the Difference Between Two Means

3.4 Statistical inference for 2 populations based on two samples

Coefficient of Determination

Quadratics - Build Quadratics From Roots

SENSITIVITY ANALYSIS AND INFERENCE. Lecture 12

How To Run Statistical Tests in Excel

Good luck! BUSINESS STATISTICS FINAL EXAM INSTRUCTIONS. Name:

Chapter 4 and 5 solutions

Normal Distribution as an Approximation to the Binomial Distribution

CA200 Quantitative Analysis for Business Decisions. File name: CA200_Section_04A_StatisticsIntroduction

Estimating Weighing Uncertainty From Balance Data Sheet Specifications

Statistics. Measurement. Scales of Measurement 7/18/2012

Binomial Sampling and the Binomial Distribution

Multiple-Comparison Procedures

Statistical Process Control (SPC) Training Guide

Algebra I Vocabulary Cards

Chapter 8 Hypothesis Testing Chapter 8 Hypothesis Testing 8-1 Overview 8-2 Basics of Hypothesis Testing

SAMPLE SIZE CONSIDERATIONS

Multivariate Analysis of Variance. The general purpose of multivariate analysis of variance (MANOVA) is to determine

Error Analysis. Table 1. Capacity Tolerances for Class A Volumetric Glassware.

This unit will lay the groundwork for later units where the students will extend this knowledge to quadratic and exponential functions.

Module 3: Correlation and Covariance

Characteristics of Binomial Distributions

Introduction to Analysis of Variance (ANOVA) Limitations of the t-test

One-Way Analysis of Variance

Mean = (sum of the values / the number of the value) if probabilities are equal

DESCRIPTIVE STATISTICS & DATA PRESENTATION*

HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1. used confidence intervals to answer questions such as...

VisualCalc AdWords Dashboard Indicator Whitepaper Rev 3.2

CHAPTER 13 SIMPLE LINEAR REGRESSION. Opening Example. Simple Regression. Linear Regression

A guide to level 3 value added in 2015 school and college performance tables

Probability Distribution for Discrete Random Variables

Transcription:

Statistical Confidence Calculations Statistical Methodology Omniture Test&Target utilizes standard statistics to calculate confidence, confidence intervals, and lift for each campaign. The student s T test is used for these calculations. For conversions and steps, calculations for binomial distributions are used; for all other calculations a normal distribution is assumed. Conversion Data Calculations Conversion Rate Conversion rate is the number of conversions divided by the number of visitors (or visits or impressions). "#$%&()#$( "#$%&()#$*+,&"*./ 0)(),#( Confidence Confidence is the result of the Student s T Test. Confidence indicates how likely it is that if the test were repeated, the same results would be found. To calculate confidence, the standard deviation must first be determined. Standard Deviation Standard deviation is a measure of the spread or dispersion of a set of data. The standard deviation is simply a way to measure the dispersion of data. If many data points are close to the mean, then the standard deviation is small; if many data points are far from the mean, then the standard deviation is large. Standard deviation is also the square root of the variance. The conversion rate results are a binomial distribution: visitors either convert or do not convert. Thus, we use the binomial distribution formula for variance: "#$%#&() * = +,." +,/ Since standard deviation is the square root of variance, we get: "#$%&$&()*+$#+,% =./01"./2 Standard Error The standard error is the estimated standard deviation of the error. This error is the noise in the results. For the control experience: 1

"#$%&$&() *)%#)+,"( = *)%#)+./0/#)0 *)%#)+ For alternative experiences: "#$%&$&() $*# +"( =,)%#)* + $*#./0/#)0,)%#)*./0/#)0 $*# The standard error is then used to calculate the signal to noise ratio. Signal to Noise Ratio "#$%& ()#*+ =,, &. /)%.0) "1 &. Confidence This signal to noise ratio is then used to calculate confidence. Confidence is calculated with the Student s T Test. It is a 2 tailed distribution, and the degrees of freedom is equal to visitors control +visitors alt 2. In Excel this is calculated with the TDIST() function: "#$%&#()*+,)#,.$#,/012*+3"*4 ",567 67# 9:,),#/) 8/,)& 67# + :,),#/) ;/#/7 <9<= 67# Confidence refers to the likelihood that the alternative experience s performance relative to control wasnt due to noise. This determines how confident you can be that the results would be repeated if the test were re run. Confidence Intervals The confidence interval displayed in Test&Target is different than the confidence level. While the confidence level shows the likelihood that the test results were not based on noise, the confidence interval assumes a 95% confidence level and shows how much your results could vary and still be within that 95% confidence level. Essentially, this calculation describes how large the standard deviation is in an easily understood way. The confidence interval is derived from the standard deviation and the sample size (# of visitors). The smaller the standard deviation and the larger the sample size, the narrower your confidence interval. " "#$%&($)(*$+(,./01234 $ # $ %./+ 5&6&+#,6./+ & where ±1.96 is the interval on the normal distribution curve containing 95% of the area under the curve. Then, determine the high and low bounds for your conversion rate by adding and subtracting this confidence interval from your conversion rate: 2

"#$%&()*+,+, + +. "#$%#&()*+)* ), Lift Lift is the percentage difference of your tested experience vs. control. " ( "#$%& )*% ( +,%.,* % $ # ( +,%.,* & Lift Intervals Lift intervals describe the best and worst lift a tested experience could have over control. To get this information, the high and low bound of conversion rate for control is compared to the high and low bound of conversion rate for the alternative (or tested) experience. The statistical confidence has already been applied, so no additional statistical equations are involved here. The worst lift the alternative could experience would be if the control performed at its high bound and the alternative performed at its lower bound: "#$%#&(")*+,"#$.+ /0)12 3# Conversely, the best lift the alternative could experience would be if the control performed at its low bound and the alternative performed at its high bound: "#$%&()*+#,."#$% /0 1+2 3) Revenue Data Calculations Revenue Per Visitor 3

Revenue per visitor is the total sales number divided by the number of visitors (or visits or impressions). Standard Deviation Standard Error For revenue calculations, the standard error calculation is the same for the control and alternate experiences. This calculation is used for the signal/noise ratio. Standard Error of the Difference This calculates the difference in performance between the alternate experience and the control. Signal to Noise Ratio Confidence Revenue Confidence Interval Then, determine the high and low bounds for your revenue per visitor by adding and subtracting this confidence interval from your revenue per visitor number, just like for conversion rate. 4

Lift Lift is the percentage difference of your tested experience vs. control. Lift Intervals The lift intervals are calculated in the same manner as for conversion rate above. These calculations describe the derivation of all the statistical values displayed in the Test&Target reports. With these calculations, you can download the raw data directly from Test&Target or programmatically access it via the API and run your own analyses. 5