Normal Distribution.

Similar documents
Sampling Distribution And Central Limit Theorem

Properties of MLE: consistency, asymptotic normality. Fisher information.

5: Introduction to Estimation

I. Chi-squared Distributions

Case Study. Normal and t Distributions. Density Plot. Normal Distributions

Z-TEST / Z-STATISTIC: used to test hypotheses about. µ when the population standard deviation is unknown

Overview of some probability distributions.

Confidence Intervals for One Mean

Maximum Likelihood Estimators.

Chapter 7 Methods of Finding Estimators

University of California, Los Angeles Department of Statistics. Distributions related to the normal distribution

1 Computing the Standard Deviation of Sample Means

The following example will help us understand The Sampling Distribution of the Mean. C1 C2 C3 C4 C5 50 miles 84 miles 38 miles 120 miles 48 miles

Measures of Spread and Boxplots Discrete Math, Section 9.4

Chapter 6: Variance, the law of large numbers and the Monte-Carlo method

Hypothesis testing. Null and alternative hypotheses


Chapter 7 - Sampling Distributions. 1 Introduction. What is statistics? It consist of three major areas:

1. C. The formula for the confidence interval for a population mean is: x t, which was

Output Analysis (2, Chapters 10 &11 Law)

BASIC STATISTICS. f(x 1,x 2,..., x n )=f(x 1 )f(x 2 ) f(x n )= f(x i ) (1)

Parametric (theoretical) probability distributions. (Wilks, Ch. 4) Discrete distributions: (e.g., yes/no; above normal, normal, below normal)

Statistical inference: example 1. Inferential Statistics

In nite Sequences. Dr. Philippe B. Laval Kennesaw State University. October 9, 2008

One-sample test of proportions

Confidence Intervals. CI for a population mean (σ is known and n > 30 or the variable is normally distributed in the.

Incremental calculation of weighted mean and variance

Center, Spread, and Shape in Inference: Claims, Caveats, and Insights

GCSE STATISTICS. 4) How to calculate the range: The difference between the biggest number and the smallest number.

Overview. Learning Objectives. Point Estimate. Estimation. Estimating the Value of a Parameter Using Confidence Intervals

Lesson 17 Pearson s Correlation Coefficient

Determining the sample size

This document contains a collection of formulas and constants useful for SPC chart construction. It assumes you are already familiar with SPC.

Descriptive Statistics

hp calculators HP 12C Statistics - average and standard deviation Average and standard deviation concepts HP12C average and standard deviation

THE REGRESSION MODEL IN MATRIX FORM. For simple linear regression, meaning one predictor, the model is. for i = 1, 2, 3,, n

Chapter 14 Nonparametric Statistics

Confidence Intervals

Systems Design Project: Indoor Location of Wireless Devices

Section 11.3: The Integral Test

Convexity, Inequalities, and Norms

Chapter 7: Confidence Interval and Sample Size

Approximating Area under a curve with rectangles. To find the area under a curve we approximate the area using rectangles and then use limits to find

PSYCHOLOGICAL STATISTICS

Convention Paper 6764

SAMPLE QUESTIONS FOR FINAL EXAM. (1) (2) (3) (4) Find the following using the definition of the Riemann integral: (2x + 1)dx

UC Berkeley Department of Electrical Engineering and Computer Science. EE 126: Probablity and Random Processes. Solutions 9 Spring 2006

1 The Gaussian channel

Repeating Decimals are decimal numbers that have number(s) after the decimal point that repeat in a pattern.

Math C067 Sampling Distributions

Practice Problems for Test 3

Now here is the important step

Quadrat Sampling in Population Ecology

Definition. A variable X that takes on values X 1, X 2, X 3,...X k with respective frequencies f 1, f 2, f 3,...f k has mean

Exploratory Data Analysis

Week 3 Conditional probabilities, Bayes formula, WEEK 3 page 1 Expected value of a random variable

, a Wishart distribution with n -1 degrees of freedom and scale matrix.

CHAPTER 7: Central Limit Theorem: CLT for Averages (Means)

Multi-server Optimal Bandwidth Monitoring for QoS based Multimedia Delivery Anup Basu, Irene Cheng and Yinzhe Yu

Solutions to Selected Problems In: Pattern Classification by Duda, Hart, Stork

Lesson 15 ANOVA (analysis of variance)

LECTURE 13: Cross-validation

1 Correlation and Regression Analysis

Biology 171L Environment and Ecology Lab Lab 2: Descriptive Statistics, Presenting Data and Graphing Relationships

Unit 8: Inference for Proportions. Chapters 8 & 9 in IPS

Theorems About Power Series

Trigonometric Form of a Complex Number. The Complex Plane. axis. ( 2, 1) or 2 i FIGURE The absolute value of the complex number z a bi is

Soving Recurrence Relations

CHAPTER 3 DIGITAL CODING OF SIGNALS

Solving Logarithms and Exponential Equations

Example 2 Find the square root of 0. The only square root of 0 is 0 (since 0 is not positive or negative, so those choices don t exist here).

Universal coding for classes of sources

Infinite Sequences and Series

MEI Structured Mathematics. Module Summary Sheets. Statistics 2 (Version B: reference to new book)

Research Method (I) --Knowledge on Sampling (Simple Random Sampling)

Non-life insurance mathematics. Nils F. Haavardsson, University of Oslo and DNB Skadeforsikring

Modified Line Search Method for Global Optimization

Confidence intervals and hypothesis tests

Here are a couple of warnings to my students who may be here to get a copy of what happened on a day that you missed.

Our aim is to show that under reasonable assumptions a given 2π-periodic function f can be represented as convergent series

3. If x and y are real numbers, what is the simplified radical form

Lecture 4: Cauchy sequences, Bolzano-Weierstrass, and the Squeeze theorem

Vladimir N. Burkov, Dmitri A. Novikov MODELS AND METHODS OF MULTIPROJECTS MANAGEMENT

How To Solve The Homewor Problem Beautifully

CME 302: NUMERICAL LINEAR ALGEBRA FALL 2005/06 LECTURE 8

THE TWO-VARIABLE LINEAR REGRESSION MODEL

Present Values, Investment Returns and Discount Rates

Department of Computer Science, University of Otago

INVESTMENT PERFORMANCE COUNCIL (IPC)

Inference on Proportion. Chapter 8 Tests of Statistical Hypotheses. Sampling Distribution of Sample Proportion. Confidence Interval

Unbiased Estimation. Topic Introduction

Topic 5: Confidence Intervals (Chapter 9)

Basic Elements of Arithmetic Sequences and Series

Analyzing Longitudinal Data from Complex Surveys Using SUDAAN

WHEN IS THE (CO)SINE OF A RATIONAL ANGLE EQUAL TO A RATIONAL NUMBER?

4.3. The Integral and Comparison Tests

Transcription:

Normal Distributio www.icrf.l

Normal distributio I probability theory, the ormal or Gaussia distributio, is a cotiuous probability distributio that is ofte used as a first approimatio to describe realvalued radom variables that ted to cluster aroud a sigle mea value. f µ e π where parameter μ is the mea locatio of the peak ad is the variace the measure of the width of the distributio. The distributio with μ 0 ad is called the stadard ormal. www.icrf.l

The ormal distributio is cosidered the most promiet probability distributio i statistics There are several reasos for this: First, the ormal distributio is very tractable aalytically, that is, a large umber of results ivolvig this distributio ca be derived i eplicit form. Secod, the ormal distributio arises as the outcome of the cetral limit theorem, which states that uder mild coditios the sum of a large umber of radom variables is distributed approimately ormally. Fially, the bell shape of the ormal distributio make it a coveiet choice for modelig a large variety of radom variables ecoutered i practice. The ormal distributio is usually deoted by Nμ,. Commoly the letter N is writte i calligraphic fot typed as \mathcal{n} i LaTeX. Thus whe a radom variable X is distributed ormally with mea μ ad variace, we write: X ~ Ν µ, www.icrf.l 3

Stadard ormal distributio The simplest case of a ormal distributio is kow as the stadard ormal distributio, described by the probability desity fuctio φ e π Notice that for a stadard ormal distributio, μ 0 ad. The parameter μ is at the same time the mea, the media ad the mode of the ormal distributio. The parameter is called the variace; as for ay radom variable, it describes how cocetrated the distributio is aroud its mea. The square root of is called the stadard deviatio ad is the width of the desity fuctio. µ µ f ;, e π φ µ www.icrf.l 4

Cumulative distributio fuctio I The cumulative distributio fuctio cdf describes probabilities for a radom variable to fall i the itervals of the form, ]. The cdfof the stadard ormal distributio is deoted with the capital Greek letter Φ phi, ad ca be computed as a itegral of the probability desity fuctio: t / Φ e dt [ + erf π ] I mathematics, the error fuctio also called the Gauss error fuctio or probability itegral is a special fuctio o-elemetary of sigmoid shape which occurs i probability, statistics ad partial differetial equatios. It is defied as: erf e t π 0 dt www.icrf.l 5

Cumulative distributio fuctio II This itegral ca oly be epressed i terms of a special fuctio erf, called the error fuctio. The umerical methods for calculatio of the stadard ormal cdfare discussed below. For a geeric ormal radom variable with mea μ ad variace > 0 the cdfwill be equal to F ; µ, Φ µ µ [+ erf ] µ The complemet of the stadard ormal cdf, Q Φ, is referred to as the Q-fuctio, especially i egieerig tets. This represets the tail probability of the Gaussia distributio, that is the probability that a stadard ormal radom variable X is greater tha the umber. Other defiitios of the Q-fuctio, all of which are simple trasformatios of Φ, are also used occasioally. www.icrf.l 6

Stadardizig ormal radom variables It is possible to relate all ormal radom variables to the stadard ormal. For eample if X is ormal with mea μ ad variace, the Z Xµ has mea zero ad uit variace, that is Z has the stadard ormal distributio. Coversely, havig a stadard ormal radom variable Z we ca always costruct aother ormal radom variable with specific mea μ ad variace : X Z+µ This stadardizig trasformatio is coveiet as it allows oe to compute the pdfad especially the cdfof a ormal distributio havig the table of pdfad cdf values for the stadard ormal. They will be related via µ µ φ, f FX Φ www.icrf.l 7

Stadard deviatio About 68% of values draw from a ormal distributio are withi oe stadard deviatio away from the mea; about 95% of the values lie withi two stadard deviatios; ad about 99.7% are withi three stadard deviatios. This fact is kow as the 68-95-99.7 rule, or the empirical rule, or the 3-sigma rule. To be more precise, the area uder the bell curve betwee μ ad μ + is give by F µ + ; µ, F µ ; µ, Φ Φ erf Dark blue is less tha oe stadard deviatio from the mea. For the ormal distributio, this accouts for about 68% of the set, while two stadard deviatios from the mea medium ad dark blue accout for about 95%, ad three stadard deviatios light, medium, ad dark blue accout for about 99.7% www.icrf.l 8

Estimatio of parameters I It is ofte the case that we do t kow the parameters of the ormal distributio, but istead wat to estimate them. That is, havig a sample,, from a ormal Nμ, populatio we would like to lear the approimate values of parameters μad. The stadard approach to this problem is the maimum likelihood method. Maimum likelihood estimates: ˆ i, i i i ˆ µ Estimator µˆ is called the sample mea, sice it is the arithmetic mea of all observatios. The estimator ˆ is called the sample variace, sice it is the variace of the sample,, Of practical importace is the fact that the stadard error of µˆ is proportioal to /sqrtn, that is, if oe wishes to decrease the stadard error by a factor of 0, oe must icrease the umber of poits i the sample by a factor of 00. www.icrf.l 9

Estimatio of parameters II To use statistical parameters such as mea ad stadard deviatio reliably, you eed to have a good estimator for them. The maimum likelihood estimates MLEs provide oe such estimator. However, a MLE might be biased, which meas that its epected value of the parameter might ot equal the parameter beig estimated. For eample, a MLE is biased for estimatig the variace of a ormal distributio. A ubiased estimator that is commoly used to estimate the parameters of the ormal distributio is the miimum variace ubiased estimatormvue. The MVUE has the miimum variace of all ubiased estimators of a parameter. The MVUEs of parameters µad for the ormal distributio are the sample mea ad variace. The sample mea is also the MLE for µ. The followig are two commo formulas for the variace. s s i i i i with i i The first equatio is the maimum likelihood estimator for, ad the secod equatio is MVUE. www.icrf.l 0

Sigal to oise improvemet due to averagig Ideally it is assumed that: Sigal ad oise are ucorrelated. Sigal stregth is costat i the replicate measuremets. Noise is radom, with a mea of zero ad costat variace i the replicate measuremets. Uder these assumptios let the sigal stregth be deoted by Sad let the stadard deviatio of a sigle measuremet be ; this represets the oise i oe measuremet, N. If measuremets are added together the sum of sigal stregths will be *S. For the oise, the stadard error propagatio formula shows that the variace,, is additive. The variace of the sum is equal to. Hece the sigal-to-oise ratio, S/N, is give by S N S The equivalet epressio for sigal averagig is obtaied by dividig both umerator ad deomiator by. S N S Thus, i the ideal case S/N icreases with the square root of the umber of measuremets that are averaged. I practice, the assumptios may be ot be fully realized. This will result i a lower S/N improvemet tha i the ideal case, but i may cases ear-ideal S/N improvemet ca be achieved. S S www.icrf.l

Radom umbers with a Gaussia distributio i Ecel This Ecel formula computes a radom umber from a Gaussia distributio with a mea of 0.0 ad a SD of.0. NORMSINVRAND The RAND fuctio calculates a radom umber from 0 to. the NORMSINV fuctio takes a fractio betwee 0 ad ad tells you how may stadard deviatios you eed to go above or below the mea for a cumulative Gaussia distributio to cotai that fractio of the etire populatio. Multiple by the stadard deviatio ad add a mea, ad you'll have radom umbers draw from a Gaussia distributio with that mea ad SD. For eample, use this formula to sample from a Gaussia distributio with a mea of 00 ad a SD of 5: NORMSINVRAND*5+00 www.icrf.l

Normal distributio i Matlab Normal probability desity fuctios are geerated usig fuctio ormpdf. Characteristic of a ormal distributio are mea ad stadard deviatio. ormpdf, mea, std : vector of rage icludig graularity -5:0.:5; mu 3; sigma 4; pdfnormal ormpdf, mu, sigma; plot, pdfnormal; www.icrf.l 3

Radom umbers with a Gaussia ormrd- Normal radom umbers Syta R ormrdmu,sigma R ormrdmu,sigma,m,,... R ormrdmu,sigma,[m,,...] distributio i Matlab Descriptio R ormrdmu,sigma geerates radom umbers from the ormal distributio with mea parameter mu ad stadard deviatio parameter sigma. mu ad sigma ca be vectors, matrices, or multidimesioal arrays that have the same size, which is also the size of R. A scalar iput for mu or sigma is epaded to a costat array with the same dimesios as the other iput. R ormrdmu,sigma,m,,... or R ormrdmu,sigma,[m,,...] geerates a m-by--by-... array. The mu, sigma parameters ca each be scalars or arrays of the same size as R. Eamples ormrd:6,./:6.650.334 3.050 4.0879 4.8607 6.87 ormrd0,,[ 5] 0.059.797 0.64 0.877 -.446 3 ormrd[ 3;4 5 6],0.,,3 3 0.999.936.9640 4.46 5.0577 5.9864 www.icrf.l 4

Sources http://e.wikipedia.org/wiki/normal_distributio http://e.wikipedia.org/wiki/error_fuctio http://e.wikipedia.org/wiki/sigal_averagig http://www.graphpad.com/faq/viewfaq.cfm?faq966 http://www.aquaphoei.com/lecture/matlab0/page.html#0. http://www.mathworks.com/help/toolbo/stats/ormrd.html http://www.mathworks.com/help/toolbo/stats/brivz-9.html www.icrf.l 5