Statistics with Matlab for Engineers
|
|
- Mariah McCormick
- 7 years ago
- Views:
Transcription
1 Statistics with Matlab for Engineers Paul Razafimandimby 1 1 Montanuniversität Leoben October 6, 2015 Contents 1 Introduction 1 2 Probability concepts: a short review Basic definitions Independence concept Parameters of a random variable Frequently used distribution Two important Limit Theorems in probability Inferential Statistics: Parameter Estimation Sampling concepts and distributions Parameter Estimation Introduction Roughly speaking, Statistics is the science of gaining knowledge from numerical and categorical data. It deals with the collection, analysis, interpretation and drawing conclusion from collected data. A population is basically the collection or set of all individuals under consideration in a statistical study. A sample is a part of the part or subset of the population from which information is collected. One can distinguish two branches of Statistics. 1. Descriptive Statistics is the methodology of organizing and summarizing information. This branch of statistics deals with the construction of the distribution of the sample/population (calculation of frequency), the visualization of data (graphs, charts, histograms), and the calculation of various descriptive measures (averages, standard deviation, percentiles). 1
2 2. Inferential Statistics is a science of drawing and measuring the reliability of conclusions about population based on information collected from a sample of population. Inferential statistics deals with point estimation, interval estimation and hypothesis testing which rely very much on probability theory. Descriptive and inferential statistics are interrelated in that before inferring conclusion from the statistical investigation it is necessary to organize and summarize the information collected from a sample. Moreover, the knowledge from the descriptive statistics usually suggests the appropriate method or approach to be used for the inferential statistics. In a statistical study, either it is a descriptive or inferential, the property of a population is usually described by numerical parameters. In many cases these parameters are unknown and a statistical study are very often oriented to the investigation/estimation of these parameters. For this purpose, one usually uses statistical samples to make inference about these unknown parameters. Numerical values calculated from and characterizing a statistical sample is called a statistic and they are used to make inference about the unknown parameters of the whole population. Statistics finds its applications in numerous applied sciences, among others, economics, political science, medicine. Of course, Statistics play an important role in many branches of Engineering sciences. For instance, assuming that a factory producing use the same equipment, the raw materials and the methods of production, then using statistics we can infer about the qualities of the light bulbs produced in the future. 2 Probability concepts: a short review 2.1 Basic definitions The estimation of the population parameters leads the statistician to investigate the statistic of (random) experiment whose outcome cannot be predicted with certainty and would very likely to change if the experiment is repeated. The set of all possible outcomes of a random experiment is called the sample space. Flipping a coin and rolling a die are examples of random experiment and the sample spaces are respectively Ω 1 = {T, H} and Ω 2 = {1, 2, 3, 4, 5, 6}. An example of a random experiment in Engineering is the determination of the probability of piston failure in each leg of steam-driven compressors. The sample space is Ω 3 = {0, 1} where 0 indicates a piston non-failure and 1 a failure. A random variable X is a numerical function defined on the sample space, very often we call it as the outcomes from random experiments. When rolling a die, a random variable X may represent the number of dots on the upper face. In the case of the observation of piston failure, X may represent the 2
3 number of failures of piston in a compressor. So far we have only enumerated examples of discrete random variables, i.e., random variables that take on values from a finite or countable infinite set if numbers, but there are also random variable that can take on values from an interval of the real numbers. We call the latter continuous random variables. An example of a continuous random variable is the tensile strength (in kg/m 3 ) of cement by a cement factory. An event is a subset of outcomes in the sample space, e.g. the tensile strength of cement is in the range of [40, 50]. An event itself can be the union of events, for example, the number of dots on the upper face of a die is odd. The probability measures the likelihood of an event to occur in a random experiment. It also measures the likelihood of a random variable X to takes on an observed value x or to be in the range of observed values x < y, i.e, X = x or x X y. Three important axioms of probability is given below. AXIOM 1: The probability of any event E is always between 0 and 1: P(E) [0, 1]. AXIOM 2: The probability of the sample space Ω is 1: P(Ω) = 1. AXIOM 3: For mutually disjoint event E 1,..., E k we have P(E 1 E k ) = k P(E i ). A null probability indicates that an event is impossible and an event with probability one is a sure event: obtaining 7 dots when rolling a die is an impossible event and obtaining dots in {1, 2, 3, 4, 5, 6} is a sure event. The probability distribution of a random variable X is a function describing the probabilities associated to each possible value of X. To determine the PD of a random variable we can use either the equal likelihood or the relative frequency model. When the n outcomes of a random experiment has the same likelihood/equally likely to appear then we assign to each outcome the probability value 1/n. This is the case of rolling a fair die: let X be the number of dots observed on the upper face of a fair die. The probability distribution in this experiment is f (x) = P(X = x) = 1/6, for any x {1, 2, 3, 4, 5, 6}. When the outcomes do not have the same chance to occur, then we conduct the experiment n times and denote by f the frequency of a particular event 3
4 E during our experience. In this case, we can assign to the event E the probability f /n. Another way of determine the probability of an event E is to use a probability density function (pdf) or a probability mass function (pmf) when we respectively deal with a continuous and discrete random variable. We have already seen an example of pmf. In the case of continuous random variable X with probability function f (x), the probability that the falls within the interval [x 1, x 2 ] is P(x 1 X x 2 ) = x2 x 1 f (x)dx. Note that: 1. a pmf or a pdf takes on non-negative values. 2. If the sample space of a discrete random variable X with pmf f (x) is {x 1,..., x n }, then n f (x i) = Let X be a continuous random variable with pdf f (x). If X does not takes on values within the interval [x 1, x 2 ], then P(x 1 X x 2 ) = x2 x 1 f (x)dx = Let X be a continuous random variable with pdf f (x). Then, f (x)dx = 1. In many realistic situation it is more practical to use the cumulative distribution function F(x) which is defined by F(x) = P(X x) = x f (r)dr, where f (x) is the probability density function of the continuous random variable X. 2.2 Independence concept Two events E 1 and E 2 are independent if the occurrence of E 1 does not affect the occurrence of E 2, and vice verse. In mathematical term, they are independent if their joint probability is equal to the product of their probabilities, i.e., P(E 1 E 2 ) = P(E 1 ) P(E 2 ). This definition can be generalized to any number of events. 4
5 In a similar way, we define the independence of two random variables Y and Y. Let E 1 and E 2 be any sets in the range of the random variables X and Y, respectively. Then, X and Y are independent iff P[(X E 1 ) (Y E 2 )] = P(X E 1 )P(Y E 2 ). In other words, two random variables X and Y are independent if their joint probability density (mass) function is the product of the pdf/pfm: f (x, y) = f (x)g(y), where f (x) and g(y) are the pdf/pmf of X and Y, respectively. 2.3 Parameters of a random variable We will now give various formula and interpretation of various parameters associated to a random variable X. Mean and variance of a random variable The mean of a discrete random variable X is µ = E(X) = x i f (x i ). It provides a central tendency of the distribution: we would expect that the average of many observed values of a random variable will be close to the mean. Assuming that µ <, then its variance is σ 2 = V(X) = E[(X µ) 2 ] = (x i µ) 2 f (x i ). One important parameter associated to a random variable is its standard deviation denoted by σ and defined by σ = V(X). The variance or standard deviation measures the dispersion of a distribution. An observed value of a random variable having small standard deviation is more likely to be closer the mean µ. For a continuous random variable X with pdf f (x), we have the following formula for the mean and the variance µ =E(X) = σ 2 =V(X) = x f (x)dx, (x µ) 2 f (x)dx. 5
6 r-th moment and r-th central moment of a random variable With the definition of the mean above, we define the r-th moment and the r-th central moment of a random as follows Skewness µ r =E(X r ), µ r =E[(X µ) r ]. The coefficient of skewness γ 1, which is associated to the third central moment µ 3, is used to measure the asymmetry or skewness and is given by γ 1 = µ 3 µ A negative (resp. positive) coefficient of skewness means that the distribution is skewed to the left (resp. to teh right). For a symmetric distribution, we have γ 3 = 0. (N.B., γ 3 = 0 does not in general imply that a distribution is symmetric.) Kurtosis The Kurtosis measure the peakedness/flatness of a distribution near its center. It also measures the departure of the distribution from normality. Its formula is given by γ 2 = µ 4 µ 2. 2 if γ 2 > 3, then the distribution have more values in the vicinity of the mean (more peaked than the normal distribution). A kurtosis less than 3 indicates that the distribution is flatter than the normal. 2.4 Frequently used distribution Binomial distribution Assume that the sample space of an experiment contains only two elements, say {0, 1}. In this case, we can define a probability mass function as follows f (0) =P(X = 0) = 1 p, f (1) =P(X = 1) = p, where p is the probability of an outcome X = 1. A random variable whose pmf is defined as above is called a Bernoulli random variable. 6
7 When repeating this experiment for n independent trials, we obtain a Binomial random variable X where X denotes the number of 1 in these n trials. The pmf for X is given by f (x; n, p) = P(X = x) = ( n x ) p x (1 p) n x ; x = 0, 1, 2,..., n, where ( n x ) = n! x!(n x)! and x! the factorial of a non-negative integer x. Straightforward calculation showed that E(X) =np, V(X) =np(1 p). Example: A manufacturer of light bulbs finds that on average 5% are defective. To monitor the manufacturing process, they take a random sample size of 100. If the sample contains more than five defective light bulbs, then the production must be stopped. What is the probability that the process is stopped? Poisson distribution A discrete random variable X is a Poisson random variable with parameter λ > 0 iff its pmf is We have f (x; λ) = P(X = x) = e λ λx E(X) = λ, V(X) = λ. ; x = 0, 1,.... x! Example: What is the probability that a page have at least 2 typos if the typographical errors per page follows the Poisson distribution with parameter λ = 0.25? Uniform distribution One of the most important distributions is the uniform distribution for continuous random variable. A continuous random variable X with values on a interval (a, b) follows the uniform distribution iff its pdf is given by We have f (x; a, b) = 1 b a ; a < x < b. E(X) = a + b 2, V(X) = (b a)
8 Normal distribution A continuous random variable X follows a normal or Gaussian distribution and we denote X N(µ, σ 2 ) iff its pdf is defined by where f (x; µ, σ 2 ) = 1 σ (x µ)2 exp{ 2π 2σ 2 }, x (, ), µ (, ), σ 2 > 0. A normal distribution is determined by its parameters µ and σ 2 and We have the following properties: (N1) lim x f (x; µ, σ 2 ) = 0, E(X) =µ, V(X) =σ 2. (N2) The pdf f (x; µ, σ 2 ) attains its maximum value at x = µ. (N3) The pdf f (x; µ, σ 2 ) is symmetric about the mean µ. A random variable X such that X N(0, 1) is called a standard normal random variable. Normal distribution is frequently used in statistics and engineering. Exponential distribution A continuous random variable follows an exponential distribution with parameter λ iff its pdf is defined by We have f (x; λ) = λe λx ; X 0, λ > 0. E(X) = 1 λ, V(X) = 1 λ 2. Exponential random variables are used to describe (i) the time between arrivals of telephone calls: in this case, λ is a rate with units of arrivals per time period; (ii) the time until a machine part fails and λ is failure rate. Example: The time between arrivals of telephone calls at a switchboard follows an exponential distribution with a mean 12 seconds. What is the probability that the time between arrivals is 10 seconds or less? 8
9 The Gamma and Chi-Square distributions In this part we will review a generalization of the exponential distribution. A random variable X follows the Gamma distribution with parameters λ > 0 and t > 0 iff its pdf is given by f (x; λ, t) = λe λx (λx) t 1 ; x 0, Γ(t) where the Gamma function Γ is defined by We have Γ(x) = 0 t x 1 e t dt, x > 0. E(X) = t λ, V(X) = t λ 2. When t is a positive integer the Gamma distribution can be used to model the amount of time one has to wiat until t events have occurred, if the inter-arrival times are exponentially distributed. The Gamma distribution is called a Chi-Square distribution with ν degrees of freedom when λ = 0.5 and t = ν/2 where ν is a positive integer. The pdf of a Chi-Square random variable is defined by f (x; ν) = 1 Γ (ν/2) ( 1 2 ) ν 2 x ν 2 1 e 1 2 x ; x 0. The Gamma distribution is a generalization of the exponential distribution in the sense that the former reduces to the latter when t = 1. Student s t distribution This kind of distribution is also frequently used in inferential statistics, especially when the sample size is small (usually less than 30). The pdf of a t distribution with degree of freedom nu is defined by We have f (x; ν) = 1 Γ( ν+1 πν Γ( ν 2 ) 2 ) ) (ν+1)/2 (1 + x2. ν E(X) =0, ν 2, V(X) = ν ν 2, ν 3. 9
10 The pdf of a t distribution is symmetric and bell-shaped and its is centered at 0, however, in contrast to the normal distribution it has a havier tails and a larger spread. We should notice that one can define a t distribution with ν degrees of freedom from a standard normal random variable Z and a chi-square random variable U by setting X = Z. νu 2.5 Two important Limit Theorems in probability In statistics we usually want to estimate the unknown mean of a given population. For this purpose we randomly choose a sample from the population and calculate its mean µ and its variance σ 2. We repeat this experiments n times by assuming that the trials are mutually independent and identically distributed. In this case we have formed n independent and identically distributed random variables X 1,... X 2. Two important theorems in probability give the behaviour of the mean S n = n X i n when n is becoming bigger and bigger. Theorem 2.1 (Law of Large Number). As n gets bigger and bigger, S n will get closer and closer to the theoretical mean µ. The next theorem gives the behaviour of the distribution of S n as n gets bigger. Theorem 2.2 (Central Limit Theorem). As n gets bigger and bigger, S n will be approximately normally distributed with mean µ and variance σ 2 /n. These theorem has many versions, but we simplify their statements so that they are accessible to non-mathematician. 3 Inferential Statistics: Parameter Estimation 3.1 Sampling concepts and distributions In what follows a random sample of size n is a sequence of independent and identically distributed (iid) X 1,..., X n, i.e., the X i -s are independent and they have a common probability density/mass f (x). As we have defined in the previous section, a population parameter (mean, variance, quantiles,... ) is in many instances unknown and the goal of inferential statistics is to use a random sample to estimate or make a statement about a unknown population parameter. A statistic is a function observed (known) random variables which is used as a point estimate for a population parameter, 10
11 to obtain a confidence interval for a parameter, as a test statistic in hypothesis testing. Before we move to the main subject of this section let us define several statistics frequently encountered in many applications. Sample Mean and Sample Variance The sample mean of a random sample of size n is given by X = 1 n The sample variance is defined by S 2 = 1 n 1 n X i. n (X i X) 2. The sample standard deviation is the square root of the sample variance. Sample Moments The r-th sample moment is defined by M r = 1 n The r-th central moment is given by M r = 1 n n n Xi r. (X i X) r. The following are the sample coefficient skewness and kurtosis γ 1 = M 3 M 3 2 2, γ 2 = M 4 M2 2. Note that as X 1,..., X n are random variables the above quantities are also random variables. The above statistics can be used to estimate the population parameters, for instance, X, S 2, γ 1 and γ 2 can be used to estimate the the mean, variance, the skewness coefficient and kurtosis of the population. But, since we are working with sample which are much smaller than the actual population, it is very likely that there are some errors in our estimate. To study the 11
12 efficiency of our estimate and to manage the uncertainty of our estimate, we must know the distribution of the statistic we use (and only then we can perform statistical hypothesis test and calculate confidence intervals.) For instance, if we know that our sample is normal, then, by a classical theorem in probability, its mean X is also normal. Anyway, whatever the distribution of the random sample is, when the sample is big enough (of size bigger than 30) the CLT theorem tells us that the mean will be approximately normally distributed. 3.2 Parameter Estimation One can use two types of methods to estimate a population parameter. A point estimation deals with the calculation from the sample a single value (point estimate) that, with high probability, will be close to the unknown population parameter. An interval estimation is a procedure which return a range of values (or interval) around the point estimate that, with a certain degree of confidence, will contain the population parameter. Point estimation A point estimator T for a parameter θ is function from all possible values of the sample data X i, i = 1,..., n. One of the main interest in point estimation is the measure of the performance of the estimation. To assess the estimators we have four criteria: bias, mean square error, efficiency, and standard error. 1. Bias: The bias measures the average error we have made in estimating θ by T, i.e., bias(t) = E(T θ). The estimator T is said to be unbiased if bias(t) = 0, i.e, E(T) = θ. 2. Mean Squared Error: The MSE of the estimator T is defined by MSE(T) = E[(T θ) 2 ]. It is a straightforward task to check that MSE(T) = V(T) + [bias(t)] Relative Efficiency: this criteria is used to compare estimators. Assume we have two estimators T 1 and T 2 for the same parameter θ. Then, the relative efficiency of T 1 to T 2 is defined by eff(t 1, T 2 ) = MSE(T 1) MSE(T 2 ). When eff(t 1, T 2 ) > 1, i.e., MSE(T 2 ) > MSE(T 1 ), then T 1 is more efficient than T 2. 12
13 4. Standard Error is the square root of the variance of the estimator T. We have said that the sample mean X is an estimator of the population mean µ. But how precise is this estimation? By CLT, we have V( X) = 1 n σ2, thus the standard error ot X is SE( X) = σ n. If the standard deviation σ is unknown, then we can derive an estimate of the standard error ŜE( X) by using an estimator of σ. Method of Moments 13
Summary of Formulas and Concepts. Descriptive Statistics (Ch. 1-4)
Summary of Formulas and Concepts Descriptive Statistics (Ch. 1-4) Definitions Population: The complete set of numerical information on a particular quantity in which an investigator is interested. We assume
More informationBNG 202 Biomechanics Lab. Descriptive statistics and probability distributions I
BNG 202 Biomechanics Lab Descriptive statistics and probability distributions I Overview The overall goal of this short course in statistics is to provide an introduction to descriptive and inferential
More informationProbability and Statistics Vocabulary List (Definitions for Middle School Teachers)
Probability and Statistics Vocabulary List (Definitions for Middle School Teachers) B Bar graph a diagram representing the frequency distribution for nominal or discrete data. It consists of a sequence
More informationAn Introduction to Basic Statistics and Probability
An Introduction to Basic Statistics and Probability Shenek Heyward NCSU An Introduction to Basic Statistics and Probability p. 1/4 Outline Basic probability concepts Conditional probability Discrete Random
More informationRandom variables, probability distributions, binomial random variable
Week 4 lecture notes. WEEK 4 page 1 Random variables, probability distributions, binomial random variable Eample 1 : Consider the eperiment of flipping a fair coin three times. The number of tails that
More informationE3: PROBABILITY AND STATISTICS lecture notes
E3: PROBABILITY AND STATISTICS lecture notes 2 Contents 1 PROBABILITY THEORY 7 1.1 Experiments and random events............................ 7 1.2 Certain event. Impossible event............................
More informationStatistics I for QBIC. Contents and Objectives. Chapters 1 7. Revised: August 2013
Statistics I for QBIC Text Book: Biostatistics, 10 th edition, by Daniel & Cross Contents and Objectives Chapters 1 7 Revised: August 2013 Chapter 1: Nature of Statistics (sections 1.1-1.6) Objectives
More informationImportant Probability Distributions OPRE 6301
Important Probability Distributions OPRE 6301 Important Distributions... Certain probability distributions occur with such regularity in real-life applications that they have been given their own names.
More informationData Modeling & Analysis Techniques. Probability & Statistics. Manfred Huber 2011 1
Data Modeling & Analysis Techniques Probability & Statistics Manfred Huber 2011 1 Probability and Statistics Probability and statistics are often used interchangeably but are different, related fields
More information4. Continuous Random Variables, the Pareto and Normal Distributions
4. Continuous Random Variables, the Pareto and Normal Distributions A continuous random variable X can take any value in a given range (e.g. height, weight, age). The distribution of a continuous random
More informationCA200 Quantitative Analysis for Business Decisions. File name: CA200_Section_04A_StatisticsIntroduction
CA200 Quantitative Analysis for Business Decisions File name: CA200_Section_04A_StatisticsIntroduction Table of Contents 4. Introduction to Statistics... 1 4.1 Overview... 3 4.2 Discrete or continuous
More informationExploratory Data Analysis
Exploratory Data Analysis Johannes Schauer johannes.schauer@tugraz.at Institute of Statistics Graz University of Technology Steyrergasse 17/IV, 8010 Graz www.statistics.tugraz.at February 12, 2008 Introduction
More informationInstitute of Actuaries of India Subject CT3 Probability and Mathematical Statistics
Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics For 2015 Examinations Aim The aim of the Probability and Mathematical Statistics subject is to provide a grounding in
More informationDescriptive Statistics. Purpose of descriptive statistics Frequency distributions Measures of central tendency Measures of dispersion
Descriptive Statistics Purpose of descriptive statistics Frequency distributions Measures of central tendency Measures of dispersion Statistics as a Tool for LIS Research Importance of statistics in research
More informationWhat is Statistics? Lecture 1. Introduction and probability review. Idea of parametric inference
0. 1. Introduction and probability review 1.1. What is Statistics? What is Statistics? Lecture 1. Introduction and probability review There are many definitions: I will use A set of principle and procedures
More informationFor a partition B 1,..., B n, where B i B j = for i. A = (A B 1 ) (A B 2 ),..., (A B n ) and thus. P (A) = P (A B i ) = P (A B i )P (B i )
Probability Review 15.075 Cynthia Rudin A probability space, defined by Kolmogorov (1903-1987) consists of: A set of outcomes S, e.g., for the roll of a die, S = {1, 2, 3, 4, 5, 6}, 1 1 2 1 6 for the roll
More informationProbability and Statistics Prof. Dr. Somesh Kumar Department of Mathematics Indian Institute of Technology, Kharagpur
Probability and Statistics Prof. Dr. Somesh Kumar Department of Mathematics Indian Institute of Technology, Kharagpur Module No. #01 Lecture No. #15 Special Distributions-VI Today, I am going to introduce
More informationSums of Independent Random Variables
Chapter 7 Sums of Independent Random Variables 7.1 Sums of Discrete Random Variables In this chapter we turn to the important question of determining the distribution of a sum of independent random variables
More informationA Primer on Mathematical Statistics and Univariate Distributions; The Normal Distribution; The GLM with the Normal Distribution
A Primer on Mathematical Statistics and Univariate Distributions; The Normal Distribution; The GLM with the Normal Distribution PSYC 943 (930): Fundamentals of Multivariate Modeling Lecture 4: September
More informationMASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.436J/15.085J Fall 2008 Lecture 5 9/17/2008 RANDOM VARIABLES
MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.436J/15.085J Fall 2008 Lecture 5 9/17/2008 RANDOM VARIABLES Contents 1. Random variables and measurable functions 2. Cumulative distribution functions 3. Discrete
More informationMBA 611 STATISTICS AND QUANTITATIVE METHODS
MBA 611 STATISTICS AND QUANTITATIVE METHODS Part I. Review of Basic Statistics (Chapters 1-11) A. Introduction (Chapter 1) Uncertainty: Decisions are often based on incomplete information from uncertain
More informationChapter 5. Random variables
Random variables random variable numerical variable whose value is the outcome of some probabilistic experiment; we use uppercase letters, like X, to denote such a variable and lowercase letters, like
More informationPeople have thought about, and defined, probability in different ways. important to note the consequences of the definition:
PROBABILITY AND LIKELIHOOD, A BRIEF INTRODUCTION IN SUPPORT OF A COURSE ON MOLECULAR EVOLUTION (BIOL 3046) Probability The subject of PROBABILITY is a branch of mathematics dedicated to building models
More informationFairfield Public Schools
Mathematics Fairfield Public Schools AP Statistics AP Statistics BOE Approved 04/08/2014 1 AP STATISTICS Critical Areas of Focus AP Statistics is a rigorous course that offers advanced students an opportunity
More informationST 371 (IV): Discrete Random Variables
ST 371 (IV): Discrete Random Variables 1 Random Variables A random variable (rv) is a function that is defined on the sample space of the experiment and that assigns a numerical variable to each possible
More informationA review of the portions of probability useful for understanding experimental design and analysis.
Chapter 3 Review of Probability A review of the portions of probability useful for understanding experimental design and analysis. The material in this section is intended as a review of the topic of probability
More informationThe sample space for a pair of die rolls is the set. The sample space for a random number between 0 and 1 is the interval [0, 1].
Probability Theory Probability Spaces and Events Consider a random experiment with several possible outcomes. For example, we might roll a pair of dice, flip a coin three times, or choose a random real
More informationPROBABILITY AND SAMPLING DISTRIBUTIONS
PROBABILITY AND SAMPLING DISTRIBUTIONS SEEMA JAGGI AND P.K. BATRA Indian Agricultural Statistics Research Institute Library Avenue, New Delhi - 0 0 seema@iasri.res.in. Introduction The concept of probability
More informationHow To Write A Data Analysis
Mathematics Probability and Statistics Curriculum Guide Revised 2010 This page is intentionally left blank. Introduction The Mathematics Curriculum Guide serves as a guide for teachers when planning instruction
More informationProbability density function : An arbitrary continuous random variable X is similarly described by its probability density function f x = f X
Week 6 notes : Continuous random variables and their probability densities WEEK 6 page 1 uniform, normal, gamma, exponential,chi-squared distributions, normal approx'n to the binomial Uniform [,1] random
More informationChapter 4. Probability and Probability Distributions
Chapter 4. robability and robability Distributions Importance of Knowing robability To know whether a sample is not identical to the population from which it was selected, it is necessary to assess the
More informationChapter 3: DISCRETE RANDOM VARIABLES AND PROBABILITY DISTRIBUTIONS. Part 3: Discrete Uniform Distribution Binomial Distribution
Chapter 3: DISCRETE RANDOM VARIABLES AND PROBABILITY DISTRIBUTIONS Part 3: Discrete Uniform Distribution Binomial Distribution Sections 3-5, 3-6 Special discrete random variable distributions we will cover
More informationDefinition: Suppose that two random variables, either continuous or discrete, X and Y have joint density
HW MATH 461/561 Lecture Notes 15 1 Definition: Suppose that two random variables, either continuous or discrete, X and Y have joint density and marginal densities f(x, y), (x, y) Λ X,Y f X (x), x Λ X,
More informationMath/Stats 425 Introduction to Probability. 1. Uncertainty and the axioms of probability
Math/Stats 425 Introduction to Probability 1. Uncertainty and the axioms of probability Processes in the real world are random if outcomes cannot be predicted with certainty. Example: coin tossing, stock
More information6.4 Normal Distribution
Contents 6.4 Normal Distribution....................... 381 6.4.1 Characteristics of the Normal Distribution....... 381 6.4.2 The Standardized Normal Distribution......... 385 6.4.3 Meaning of Areas under
More informationIntroduction to Probability
Introduction to Probability EE 179, Lecture 15, Handout #24 Probability theory gives a mathematical characterization for experiments with random outcomes. coin toss life of lightbulb binary data sequence
More information5.1 Identifying the Target Parameter
University of California, Davis Department of Statistics Summer Session II Statistics 13 August 20, 2012 Date of latest update: August 20 Lecture 5: Estimation with Confidence intervals 5.1 Identifying
More informationMATH4427 Notebook 2 Spring 2016. 2 MATH4427 Notebook 2 3. 2.1 Definitions and Examples... 3. 2.2 Performance Measures for Estimators...
MATH4427 Notebook 2 Spring 2016 prepared by Professor Jenny Baglivo c Copyright 2009-2016 by Jenny A. Baglivo. All Rights Reserved. Contents 2 MATH4427 Notebook 2 3 2.1 Definitions and Examples...................................
More informationBasics of Statistical Machine Learning
CS761 Spring 2013 Advanced Machine Learning Basics of Statistical Machine Learning Lecturer: Xiaojin Zhu jerryzhu@cs.wisc.edu Modern machine learning is rooted in statistics. You will find many familiar
More informationChapter 4 Lecture Notes
Chapter 4 Lecture Notes Random Variables October 27, 2015 1 Section 4.1 Random Variables A random variable is typically a real-valued function defined on the sample space of some experiment. For instance,
More informationNotes on Continuous Random Variables
Notes on Continuous Random Variables Continuous random variables are random quantities that are measured on a continuous scale. They can usually take on any value over some interval, which distinguishes
More informationYou flip a fair coin four times, what is the probability that you obtain three heads.
Handout 4: Binomial Distribution Reading Assignment: Chapter 5 In the previous handout, we looked at continuous random variables and calculating probabilities and percentiles for those type of variables.
More information1 Prior Probability and Posterior Probability
Math 541: Statistical Theory II Bayesian Approach to Parameter Estimation Lecturer: Songfeng Zheng 1 Prior Probability and Posterior Probability Consider now a problem of statistical inference in which
More informationUNIT I: RANDOM VARIABLES PART- A -TWO MARKS
UNIT I: RANDOM VARIABLES PART- A -TWO MARKS 1. Given the probability density function of a continuous random variable X as follows f(x) = 6x (1-x) 0
More informationProbability Theory. Florian Herzog. A random variable is neither random nor variable. Gian-Carlo Rota, M.I.T..
Probability Theory A random variable is neither random nor variable. Gian-Carlo Rota, M.I.T.. Florian Herzog 2013 Probability space Probability space A probability space W is a unique triple W = {Ω, F,
More informationDescriptive Statistics
Y520 Robert S Michael Goal: Learn to calculate indicators and construct graphs that summarize and describe a large quantity of values. Using the textbook readings and other resources listed on the web
More informationCharacteristics of Binomial Distributions
Lesson2 Characteristics of Binomial Distributions In the last lesson, you constructed several binomial distributions, observed their shapes, and estimated their means and standard deviations. In Investigation
More informationDongfeng Li. Autumn 2010
Autumn 2010 Chapter Contents Some statistics background; ; Comparing means and proportions; variance. Students should master the basic concepts, descriptive statistics measures and graphs, basic hypothesis
More information3.4. The Binomial Probability Distribution. Copyright Cengage Learning. All rights reserved.
3.4 The Binomial Probability Distribution Copyright Cengage Learning. All rights reserved. The Binomial Probability Distribution There are many experiments that conform either exactly or approximately
More informationMaster s Theory Exam Spring 2006
Spring 2006 This exam contains 7 questions. You should attempt them all. Each question is divided into parts to help lead you through the material. You should attempt to complete as much of each problem
More informationMATH 10: Elementary Statistics and Probability Chapter 5: Continuous Random Variables
MATH 10: Elementary Statistics and Probability Chapter 5: Continuous Random Variables Tony Pourmohamad Department of Mathematics De Anza College Spring 2015 Objectives By the end of this set of slides,
More informationBASIC STATISTICAL METHODS FOR GENOMIC DATA ANALYSIS
BASIC STATISTICAL METHODS FOR GENOMIC DATA ANALYSIS SEEMA JAGGI Indian Agricultural Statistics Research Institute Library Avenue, New Delhi-110 012 seema@iasri.res.in Genomics A genome is an organism s
More informationQuantitative Methods for Finance
Quantitative Methods for Finance Module 1: The Time Value of Money 1 Learning how to interpret interest rates as required rates of return, discount rates, or opportunity costs. 2 Learning how to explain
More informationTHE CENTRAL LIMIT THEOREM TORONTO
THE CENTRAL LIMIT THEOREM DANIEL RÜDT UNIVERSITY OF TORONTO MARCH, 2010 Contents 1 Introduction 1 2 Mathematical Background 3 3 The Central Limit Theorem 4 4 Examples 4 4.1 Roulette......................................
More informationSTT315 Chapter 4 Random Variables & Probability Distributions KM. Chapter 4.5, 6, 8 Probability Distributions for Continuous Random Variables
Chapter 4.5, 6, 8 Probability Distributions for Continuous Random Variables Discrete vs. continuous random variables Examples of continuous distributions o Uniform o Exponential o Normal Recall: A random
More information5/31/2013. 6.1 Normal Distributions. Normal Distributions. Chapter 6. Distribution. The Normal Distribution. Outline. Objectives.
The Normal Distribution C H 6A P T E R The Normal Distribution Outline 6 1 6 2 Applications of the Normal Distribution 6 3 The Central Limit Theorem 6 4 The Normal Approximation to the Binomial Distribution
More informationStandard Deviation Estimator
CSS.com Chapter 905 Standard Deviation Estimator Introduction Even though it is not of primary interest, an estimate of the standard deviation (SD) is needed when calculating the power or sample size of
More informationLecture 3: Continuous distributions, expected value & mean, variance, the normal distribution
Lecture 3: Continuous distributions, expected value & mean, variance, the normal distribution 8 October 2007 In this lecture we ll learn the following: 1. how continuous probability distributions differ
More informationLecture 1 Introduction Properties of Probability Methods of Enumeration Asrat Temesgen Stockholm University
Lecture 1 Introduction Properties of Probability Methods of Enumeration Asrat Temesgen Stockholm University 1 Chapter 1 Probability 1.1 Basic Concepts In the study of statistics, we consider experiments
More informationReview of Random Variables
Chapter 1 Review of Random Variables Updated: January 16, 2015 This chapter reviews basic probability concepts that are necessary for the modeling and statistical analysis of financial data. 1.1 Random
More informationNonparametric adaptive age replacement with a one-cycle criterion
Nonparametric adaptive age replacement with a one-cycle criterion P. Coolen-Schrijner, F.P.A. Coolen Department of Mathematical Sciences University of Durham, Durham, DH1 3LE, UK e-mail: Pauline.Schrijner@durham.ac.uk
More information99.37, 99.38, 99.38, 99.39, 99.39, 99.39, 99.39, 99.40, 99.41, 99.42 cm
Error Analysis and the Gaussian Distribution In experimental science theory lives or dies based on the results of experimental evidence and thus the analysis of this evidence is a critical part of the
More informationRandom variables P(X = 3) = P(X = 3) = 1 8, P(X = 1) = P(X = 1) = 3 8.
Random variables Remark on Notations 1. When X is a number chosen uniformly from a data set, What I call P(X = k) is called Freq[k, X] in the courseware. 2. When X is a random variable, what I call F ()
More informationIEOR 6711: Stochastic Models I Fall 2012, Professor Whitt, Tuesday, September 11 Normal Approximations and the Central Limit Theorem
IEOR 6711: Stochastic Models I Fall 2012, Professor Whitt, Tuesday, September 11 Normal Approximations and the Central Limit Theorem Time on my hands: Coin tosses. Problem Formulation: Suppose that I have
More information1.1 Introduction, and Review of Probability Theory... 3. 1.1.1 Random Variable, Range, Types of Random Variables... 3. 1.1.2 CDF, PDF, Quantiles...
MATH4427 Notebook 1 Spring 2016 prepared by Professor Jenny Baglivo c Copyright 2009-2016 by Jenny A. Baglivo. All Rights Reserved. Contents 1 MATH4427 Notebook 1 3 1.1 Introduction, and Review of Probability
More informationCHAPTER 6: Continuous Uniform Distribution: 6.1. Definition: The density function of the continuous random variable X on the interval [A, B] is.
Some Continuous Probability Distributions CHAPTER 6: Continuous Uniform Distribution: 6. Definition: The density function of the continuous random variable X on the interval [A, B] is B A A x B f(x; A,
More informationJoint Exam 1/P Sample Exam 1
Joint Exam 1/P Sample Exam 1 Take this practice exam under strict exam conditions: Set a timer for 3 hours; Do not stop the timer for restroom breaks; Do not look at your notes. If you believe a question
More informationWhy Taking This Course? Course Introduction, Descriptive Statistics and Data Visualization. Learning Goals. GENOME 560, Spring 2012
Why Taking This Course? Course Introduction, Descriptive Statistics and Data Visualization GENOME 560, Spring 2012 Data are interesting because they help us understand the world Genomics: Massive Amounts
More informationCurriculum Map Statistics and Probability Honors (348) Saugus High School Saugus Public Schools 2009-2010
Curriculum Map Statistics and Probability Honors (348) Saugus High School Saugus Public Schools 2009-2010 Week 1 Week 2 14.0 Students organize and describe distributions of data by using a number of different
More informationProbability and statistics; Rehearsal for pattern recognition
Probability and statistics; Rehearsal for pattern recognition Václav Hlaváč Czech Technical University in Prague Faculty of Electrical Engineering, Department of Cybernetics Center for Machine Perception
More informationLecture 5 : The Poisson Distribution
Lecture 5 : The Poisson Distribution Jonathan Marchini November 10, 2008 1 Introduction Many experimental situations occur in which we observe the counts of events within a set unit of time, area, volume,
More informationATM 552 Notes: Review of Statistics Page 1
ATM 552 Notes: Review of Statistics Page 1 Class Notes: ATM 552 Objective Analysis 1. Review of Basic Statistics We will review a few features of statistics that come up frequently in objective analysis
More informationMAS131: Introduction to Probability and Statistics Semester 1: Introduction to Probability Lecturer: Dr D J Wilkinson
MAS131: Introduction to Probability and Statistics Semester 1: Introduction to Probability Lecturer: Dr D J Wilkinson Statistics is concerned with making inferences about the way the world is, based upon
More informationA Tutorial on Probability Theory
Paola Sebastiani Department of Mathematics and Statistics University of Massachusetts at Amherst Corresponding Author: Paola Sebastiani. Department of Mathematics and Statistics, University of Massachusetts,
More informationPractice problems for Homework 11 - Point Estimation
Practice problems for Homework 11 - Point Estimation 1. (10 marks) Suppose we want to select a random sample of size 5 from the current CS 3341 students. Which of the following strategies is the best:
More informationSimulation Exercises to Reinforce the Foundations of Statistical Thinking in Online Classes
Simulation Exercises to Reinforce the Foundations of Statistical Thinking in Online Classes Simcha Pollack, Ph.D. St. John s University Tobin College of Business Queens, NY, 11439 pollacks@stjohns.edu
More informationMaximum Likelihood Estimation
Math 541: Statistical Theory II Lecturer: Songfeng Zheng Maximum Likelihood Estimation 1 Maximum Likelihood Estimation Maximum likelihood is a relatively simple method of constructing an estimator for
More informationCOMMON CORE STATE STANDARDS FOR
COMMON CORE STATE STANDARDS FOR Mathematics (CCSSM) High School Statistics and Probability Mathematics High School Statistics and Probability Decisions or predictions are often based on data numbers in
More informationBayesian Analysis for the Social Sciences
Bayesian Analysis for the Social Sciences Simon Jackman Stanford University http://jackman.stanford.edu/bass November 9, 2012 Simon Jackman (Stanford) Bayesian Analysis for the Social Sciences November
More informationDescriptive statistics Statistical inference statistical inference, statistical induction and inferential statistics
Descriptive statistics is the discipline of quantitatively describing the main features of a collection of data. Descriptive statistics are distinguished from inferential statistics (or inductive statistics),
More informationSTAT 830 Convergence in Distribution
STAT 830 Convergence in Distribution Richard Lockhart Simon Fraser University STAT 830 Fall 2011 Richard Lockhart (Simon Fraser University) STAT 830 Convergence in Distribution STAT 830 Fall 2011 1 / 31
More informationSection 5 Part 2. Probability Distributions for Discrete Random Variables
Section 5 Part 2 Probability Distributions for Discrete Random Variables Review and Overview So far we ve covered the following probability and probability distribution topics Probability rules Probability
More informationSCHOOL OF ENGINEERING & BUILT ENVIRONMENT. Mathematics
SCHOOL OF ENGINEERING & BUILT ENVIRONMENT Mathematics Probability and Probability Distributions 1. Introduction 2. Probability 3. Basic rules of probability 4. Complementary events 5. Addition Law for
More informationLecture Notes Module 1
Lecture Notes Module 1 Study Populations A study population is a clearly defined collection of people, animals, plants, or objects. In psychological research, a study population usually consists of a specific
More informationSTAT 315: HOW TO CHOOSE A DISTRIBUTION FOR A RANDOM VARIABLE
STAT 315: HOW TO CHOOSE A DISTRIBUTION FOR A RANDOM VARIABLE TROY BUTLER 1. Random variables and distributions We are often presented with descriptions of problems involving some level of uncertainty about
More informationECE302 Spring 2006 HW3 Solutions February 2, 2006 1
ECE302 Spring 2006 HW3 Solutions February 2, 2006 1 Solutions to HW3 Note: Most of these solutions were generated by R. D. Yates and D. J. Goodman, the authors of our textbook. I have added comments in
More informationPrinciple of Data Reduction
Chapter 6 Principle of Data Reduction 6.1 Introduction An experimenter uses the information in a sample X 1,..., X n to make inferences about an unknown parameter θ. If the sample size n is large, then
More informationMath 461 Fall 2006 Test 2 Solutions
Math 461 Fall 2006 Test 2 Solutions Total points: 100. Do all questions. Explain all answers. No notes, books, or electronic devices. 1. [105+5 points] Assume X Exponential(λ). Justify the following two
More informationLecture 6: Discrete & Continuous Probability and Random Variables
Lecture 6: Discrete & Continuous Probability and Random Variables D. Alex Hughes Math Camp September 17, 2015 D. Alex Hughes (Math Camp) Lecture 6: Discrete & Continuous Probability and Random September
More informationTHE BINOMIAL DISTRIBUTION & PROBABILITY
REVISION SHEET STATISTICS 1 (MEI) THE BINOMIAL DISTRIBUTION & PROBABILITY The main ideas in this chapter are Probabilities based on selecting or arranging objects Probabilities based on the binomial distribution
More informationProbability Distributions
CHAPTER 6 Probability Distributions Calculator Note 6A: Computing Expected Value, Variance, and Standard Deviation from a Probability Distribution Table Using Lists to Compute Expected Value, Variance,
More informationII. DISTRIBUTIONS distribution normal distribution. standard scores
Appendix D Basic Measurement And Statistics The following information was developed by Steven Rothke, PhD, Department of Psychology, Rehabilitation Institute of Chicago (RIC) and expanded by Mary F. Schmidt,
More informationDensity Curve. A density curve is the graph of a continuous probability distribution. It must satisfy the following properties:
Density Curve A density curve is the graph of a continuous probability distribution. It must satisfy the following properties: 1. The total area under the curve must equal 1. 2. Every point on the curve
More informationDescription. Textbook. Grading. Objective
EC151.02 Statistics for Business and Economics (MWF 8:00-8:50) Instructor: Chiu Yu Ko Office: 462D, 21 Campenalla Way Phone: 2-6093 Email: kocb@bc.edu Office Hours: by appointment Description This course
More informationLecture 19: Chapter 8, Section 1 Sampling Distributions: Proportions
Lecture 19: Chapter 8, Section 1 Sampling Distributions: Proportions Typical Inference Problem Definition of Sampling Distribution 3 Approaches to Understanding Sampling Dist. Applying 68-95-99.7 Rule
More informationChapter 3 RANDOM VARIATE GENERATION
Chapter 3 RANDOM VARIATE GENERATION In order to do a Monte Carlo simulation either by hand or by computer, techniques must be developed for generating values of random variables having known distributions.
More informationFoundation of Quantitative Data Analysis
Foundation of Quantitative Data Analysis Part 1: Data manipulation and descriptive statistics with SPSS/Excel HSRS #10 - October 17, 2013 Reference : A. Aczel, Complete Business Statistics. Chapters 1
More informationSKEWNESS. Measure of Dispersion tells us about the variation of the data set. Skewness tells us about the direction of variation of the data set.
SKEWNESS All about Skewness: Aim Definition Types of Skewness Measure of Skewness Example A fundamental task in many statistical analyses is to characterize the location and variability of a data set.
More informationMATH 140 Lab 4: Probability and the Standard Normal Distribution
MATH 140 Lab 4: Probability and the Standard Normal Distribution Problem 1. Flipping a Coin Problem In this problem, we want to simualte the process of flipping a fair coin 1000 times. Note that the outcomes
More informationNormal distribution. ) 2 /2σ. 2π σ
Normal distribution The normal distribution is the most widely known and used of all distributions. Because the normal distribution approximates many natural phenomena so well, it has developed into a
More information