The most important summary descriptor of a random variable Y is the mean or expectation.

Similar documents
Lecture 3: Continuous distributions, expected value & mean, variance, the normal distribution

LOGNORMAL MODEL FOR STOCK PRICES

Probability and Statistics Prof. Dr. Somesh Kumar Department of Mathematics Indian Institute of Technology, Kharagpur

FEGYVERNEKI SÁNDOR, PROBABILITY THEORY AND MATHEmATICAL

Overview of Monte Carlo Simulation, Probability Review and Introduction to Matlab

Maximum Likelihood Estimation

Probability and Random Variables. Generation of random variables (r.v.)

Lecture 6: Discrete & Continuous Probability and Random Variables

Lecture 8. Confidence intervals and the central limit theorem

APPLIED MATHEMATICS ADVANCED LEVEL

The Binomial Distribution

Multiple regression - Matrices

BNG 202 Biomechanics Lab. Descriptive statistics and probability distributions I

PSTAT 120B Probability and Statistics

Chapter 4 - Lecture 1 Probability Density Functions and Cumul. Distribution Functions

1 The Brownian bridge construction

Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics

Principle of Data Reduction

Stat 704 Data Analysis I Probability Review

. (3.3) n Note that supremum (3.2) must occur at one of the observed values x i or to the left of x i.

Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model

99.37, 99.38, 99.38, 99.39, 99.39, 99.39, 99.39, 99.40, 99.41, cm

1.5 Oneway Analysis of Variance

An Introduction to Basic Statistics and Probability

A Coefficient of Variation for Skewed and Heavy-Tailed Insurance Losses. Michael R. Powers[ 1 ] Temple University and Tsinghua University

ECE302 Spring 2006 HW5 Solutions February 21,

Random variables, probability distributions, binomial random variable

Lecture 7: Continuous Random Variables

For a partition B 1,..., B n, where B i B j = for i. A = (A B 1 ) (A B 2 ),..., (A B n ) and thus. P (A) = P (A B i ) = P (A B i )P (B i )

IEOR 6711: Stochastic Models I Fall 2012, Professor Whitt, Tuesday, September 11 Normal Approximations and the Central Limit Theorem

Outline. Definitions Descriptive vs. Inferential Statistics The t-test - One-sample t-test

STAT 830 Convergence in Distribution

Hypothesis Testing for Beginners

SAS Certificate Applied Statistics and SAS Programming

CA200 Quantitative Analysis for Business Decisions. File name: CA200_Section_04A_StatisticsIntroduction

Triangular Distributions

Advanced Topics in Statistical Process Control

Notes on Continuous Random Variables

Master s Theory Exam Spring 2006

Lecture 5 : The Poisson Distribution

DESCRIPTIVE STATISTICS. The purpose of statistics is to condense raw data to make it easier to answer specific questions; test hypotheses.

Pr(X = x) = f(x) = λe λx

Summary of Formulas and Concepts. Descriptive Statistics (Ch. 1-4)

MBA 611 STATISTICS AND QUANTITATIVE METHODS

5. Continuous Random Variables

VISUALIZATION OF DENSITY FUNCTIONS WITH GEOGEBRA

Maximum likelihood estimation of mean reverting processes

Data Modeling & Analysis Techniques. Probability & Statistics. Manfred Huber

m i: is the mass of each particle

Approximation of Aggregate Losses Using Simulation

Stats on the TI 83 and TI 84 Calculator

Curriculum Map Statistics and Probability Honors (348) Saugus High School Saugus Public Schools

CHAPTER 14 NONPARAMETRIC TESTS

II. DISTRIBUTIONS distribution normal distribution. standard scores

Normality Testing in Excel

Week 4: Standard Error and Confidence Intervals

Joint Exam 1/P Sample Exam 1

Lecture 19: Chapter 8, Section 1 Sampling Distributions: Proportions

The correlation coefficient

A Primer on Mathematical Statistics and Univariate Distributions; The Normal Distribution; The GLM with the Normal Distribution

Chapter 3 RANDOM VARIATE GENERATION

What s New in Econometrics? Lecture 8 Cluster and Stratified Sampling

Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression

MAS108 Probability I

Descriptive Statistics

CHI-SQUARE: TESTING FOR GOODNESS OF FIT

On the Efficiency of Competitive Stock Markets Where Traders Have Diverse Information

Topic 3b: Kinetic Theory

Measures of Central Tendency and Variability: Summarizing your Data for Others

DYNAMIC LIGHT SCATTERING COMMON TERMS DEFINED

STA-201-TE. 5. Measures of relationship: correlation (5%) Correlation coefficient; Pearson r; correlation and causation; proportion of common variance

Math 431 An Introduction to Probability. Final Exam Solutions

Probability Distributions

Geostatistics Exploratory Analysis

Discrete Mathematics and Probability Theory Fall 2009 Satish Rao, David Tse Note 18. A Brief Introduction to Continuous Probability

Biostatistics: DESCRIPTIVE STATISTICS: 2, VARIABILITY

Chapter 5 Discrete Probability Distribution. Learning objectives

Math 370, Actuarial Problemsolving Spring 2008 A.J. Hildebrand. Practice Test, 1/28/2008 (with solutions)

Wald s Identity. by Jeffery Hein. Dartmouth College, Math 100

Probability density function : An arbitrary continuous random variable X is similarly described by its probability density function f x = f X

Financial Assets Behaving Badly The Case of High Yield Bonds. Chris Kantos Newport Seminar June 2013

E3: PROBABILITY AND STATISTICS lecture notes

The Normal Distribution. Alan T. Arnholt Department of Mathematical Sciences Appalachian State University

Binomial lattice model for stock prices

Introduction to Probability

Figure 1. A typical Laboratory Thermometer graduated in C.

Statistics, Probability and Noise

PLANNING PROBLEMS OF A GAMBLING-HOUSE WITH APPLICATION TO INSURANCE BUSINESS. Stockholm

Statistical tests for SPSS

Description. Textbook. Grading. Objective

Data, Measurements, Features

Lecture Notes 1. Brief Review of Basic Probability

Financial Mathematics and Simulation MATH Spring 2011 Homework 2

Pricing and calibration in local volatility models via fast quantization

individualdifferences

EXIT TIME PROBLEMS AND ESCAPE FROM A POTENTIAL WELL

Review of Random Variables

ST 371 (IV): Discrete Random Variables

STAT 360 Probability and Statistics. Fall 2012

The Behavior of Bonds and Interest Rates. An Impossible Bond Pricing Model. 780 w Interest Rate Models

Transcription:

Important Facts about Random Variables Friday, February 01, 2013 2:01 PM Homework 1 hopefully posted over the weekend. Two more fundamental concepts about random variables. Often it is useful to summarize a random variable not in complete detail (as would be obtained from a PDF or CDF) but in terms of numbers. The most important summary descriptor of a random variable Y is the mean or expectation. The second most important statistic of a random variable is its variance: The meaning of variance is more transparent in terms of its square root, which is called the Standard deviation This is the root-mean-square deviation of the random variable about its mean. This describes, in the most basic way, the variability of the random variable. AppSDE13 Page 1

Examples of Random Variables Uniform distribution The discontinuity of the PDF plays no role because anything involving the PDF involves integration and discontinuities are harmless when integrated. Exponential distribution AppSDE13 Page 2

If one wanted to go beyond mean and standard deviation to encode, i.e., asymmetries of the probability distribution about the mean, then one could go to a third order moment, and develop a skewness from that. And one can go to arbitrarily high order moments, but doing this only makes sense in the context of particular questions and applications -- the first two moments (mean and variance) are by far the most fundamental. Moments: Actually cumulants are a more efficient way to represent higher order information relative to moments, i.e., removing redundancies. Simplest example is that the variance is the second order cumulant. Normal (Gaussian) distribution AppSDE13 Page 3

Let's start thinking about dealing with a family of random variables. The simplest setting is when these random variables are independent. Intuitively, this means the values that one of the random variables take is completely unrelated to the values that the other random variables will take. AppSDE13 Page 4

A collection of random variables Are said to be identically distributed when each random variable in the collection has the same probability distribution. In this case, we simply say they all have a common probability distribution: Or in terms of probability density Central Limit Theorem Under some mild conditions (PDFs of the random variables should decay fast enough -- Lindeberg and Lyapunov conditions) sums of many independent, identically distributed random variables can be well approximated by a Gaussian distribution. In fact, this result can be generalized to allow weak correlation between the random variables, but the conditions for this are technical (mixing, etc.). Also the random variables don't need to be quite identically distributed -- so long as their sizes are not too disparate. Simplest Model for Brownian Motion Single Brownian particle, observed at discrete moments of time. AppSDE13 Page 5

Model: What properties should the random variables Z n have? Mean zero because the solvent is isotropic. Identically distributed: time homogeneity. Independence: This relies on separation of time scales. On the one hand, the water molecules themselves have rather short mean free paths, meaning they change direction fairly rapidly And surely the time between experimental observations is long compared to this. And this would surely justify saying that the force the Brownian particle experiences during one experimental interval is independent of the next. But be careful -- the particle has inertia, and so its response (as manifested by the kick random variables) takes place over a longer time, and for micron size particles, this is So this implies that the changes in the motion to the Brownian particle are roughly independent over successive time intervals if the length of the time interval satisfies A priori, we don't have a basis for assuming the probability distribution for the random kicks Z n. But consider the following argument: Choose a time scale AppSDE13 Page 6

We could have formulated a simple model just as above, but with the time interval between successive "observations" as All the above assumptions apply to this model (Z n is mean zero and iid). And therefore by the central limit theorem, we see that Z n should be well approximated by a Gaussian distribution. AppSDE13 Page 7