Exercise 2: Numerical Analysis and Simulation using Matlab

Size: px
Start display at page:

Download "Exercise 2: Numerical Analysis and Simulation using Matlab"

Transcription

1 Exercise 2: Numerical Analysis and Simulation using Matlab Part I Time costs of computing (10 marks) Write a Matlab script that performs the following: 1. Using the rand command, generate a vector of 500 observations uniformly distributed between 0 and 10000; call this vector a. 2. Again using the rand command, generate another vector of 500 observations uniformly distributed between 0 and 1; call this vector b. 3. Repeat the following operations for iterations: i. The sum of a and b ii. The difference of a and b iii. The scalar product of a and b iv. The scalar division of a and b v. The square root of a vi. The exponential function of a. vii. The sine of a viii. The tangent of a Look up the profile command in the Matlab help files. Use the profile command to compare the computing time of these operations. Express the time costs of each operation as a ratio relative to i.; report these ratios in a table. Suggested solution The M-file ops.m executes steps 1 to 3 as follows: T = 10000; n = 500; rand('state',1); a = 10000*rand(n,1); b = rand(n,1); for i = 1:T a+b; a-b; a.*b; a./b; sqrt(a); exp(a); sin(a); tan(a); Econ 353 Spring 2006 Page 1 of 13

2 The Matlab profiler traces the sequence of calls to functions and scripts and analyses how much time is spent executing specific operations. The profile command includes an option for reporting the detail for built-in functions and operators like addition and multiplication. Running the commands >> profile on -detail operator >> ops >> profile report opsreport produces a report in HTML format. The report states how much time was spent on the following lines: 7: for i = 1:T % 8: a+b; % 9: a-b; % 10: a.*b; % 11: a./b; % 12: sqrt(a); % 13: exp(a); % 14: sin(a); % 15: tan(a); % 16: Expressing the computing times as ratios: Operation Index a+b 1.00 a-b 1.58 a.*b 1.08 a./b 1.92 sqrt(a) exp(a) sin(a) tan(a) Thus we can see that the last 4 operations are an order of magnitude more costly relative to the elementary arithmetic operations. Econ 353 Spring 2006 Page 2 of 13

3 Part II Stopping rules in iterative methods (15 marks) In Lab 1, you looked at a Matlab implementation of the simple Walrasian iterative. Modify the code in walras1.m to consider two different stopping rules: Rule 1: Stop if p k p k+1 / (1 + p k ) ε Rule 2: Stop if p k p k+1 ε (1 β*) where β* = max j=1,,k p k+1-j - p k+1 / p k-j - p k (Here,. means absolute value of.) For each rule, report the final value of the iterative and the number of iterations for ε = 10-2, 10-4, 10-6, Define a suitable accuracy measure and use it to evaluate each rule-ε combination: which is the most accurate? Based on your results, provide an estimate for the iterative s rate of (linear) convergence. Suggested solution See the M-files walras_rule1.m and walras_rule2.m for the suggested Matlab implementations. The main loops of these M-files appear below. [Marking guide: I would suggest allocating 10 marks to the quality of the Matlab implementation and reported results, 5 marks for the analysis.] Results for Rule 1: Value of ε Final no. of iterations, k Final value of the iterative, p k Excess demand evaluated at final p k [walras_rule1.m] for k=1:maxit if k>maxit maxit_reached = 1; break E_k = 0.5*p_k^(-0.2) + 0.5*p_k^(-0.5)-1; p_k1 = p_k + lambda * E_k; % excess demand at p(k) if abs(p_k-p_k1) <= tol*(1+abs(p_k)) break else p_k = p_k1; Econ 353 Spring 2006 Page 3 of 13

4 Results for Rule 2: Value of ε Final no. of iterations, k Final value of the iterative, p k Excess demand evaluated at final p k [walras_rule2.m] for k=1:maxit if k>maxit maxit_reached = 1; break E_k = 0.5*p(k)^(-0.2) + 0.5*p(k)^(-0.5)-1; % excess demand at p(k) p(k+1) = p(k) + lambda * E_k; beta_star = 0; beta = zeros(1,k); if k>1 for j=1:k-1 beta(k+1-j) = abs(p(k+1-j)-p(k+1))/abs(p(k-j)-p(k)); beta_star = max(beta); if abs(p(k)-p(k+1)) <= tol*(1-beta_star) break As discussed in class, a natural way to measure accuracy is the difference of the excess demand from zero; this is an example of forward error analysis. By this measure, it is clear that Rule 2 with ε = 10-8 produces the most accurate result. Or we could use the fact that for the given excess demand function we know the analytical solution is p* = 1 and thus calculate the true error as p k 1. For a linearly convergent iterative, we defined the rate of convergence as the limit of the sequence β k = p k+1 - p* / p k - p* as k where p* is the limiting value of the iterative. In walras_rule2.m, estimation of β* produces a vector of values p k+1-j - p k+1 / p k-j - p k which we can view as estimates of β k, j = 1,,k. If p k is close to p*, these β k estimates should approach the limiting rate of convergence as j decreases. A plot of the β k estimates for ε =10-8, k = 48 suggests that the limiting rate of convergence 0.65: Econ 353 Spring 2006 Page 4 of 13

5 beta Iteration Econ 353 Spring 2006 Page 5 of 13

6 Part III A Monte Carlo experiment (20 marks) One of the main tasks of econometrics is to study the sampling distribution of an estimator or test statistic. For instance, we may be interested in the bias the expected difference from the population mean of an estimator in repeated samples. There are powerful theorems (e.g. central limit theorems) that characterize the sampling distributions of estimators as the sample size ts to infinity. With finite samples, however, these asymptotic theorems are often uninformative or misleading. Monte Carlo simulation is a method for studying the finite-sample distribution of an estimator or test statistic. It uses simulated experimental data to evaluate the performance of the estimator or test procedure. The basic steps in a Monte Carlo experiment are the following: 1. Specify a model for the data-generating process (DGP), i.e. make assumptions about functional relationships among the variables, probability distributions, and the true values of the associated parameters. 2. Generate R data sets (samples) by simulating R random draws from the DGP. Typically, the sampling process is based on a computer-generated sequence of pseudo-random numbers. 3. Calculate the statistic of interest for each data set. The R calculated values represent the sampling distribution of the statistic. 4. Calculate the desired sampling measure for the statistic. E.g. The bias of the statistic is estimated as the average deviation from the true mean taken over the simulated sampling distribution. Your task is to write a Matlab program that performs Monte Carlo experiments. In developing your program, observe the following coding guidelines: - Declare and initialize the main variables before performing operations on them. - Where possible, use parameters rather than hard-coded numbers. - Use vectorized code rather than loops when it is efficient to do so. - Include concise and informative comments throughout. A. Preparation Consider the following DGP: y t = β + v t ; t = 1,2,,T where β is a constant and v t is indepently and identically distributed (i.i.d.) as Normal with mean 0 and variance σ Write the pseudo-code for a Monte Carlo experiment that estimates the bias of a given estimator of β. Use the following notation: Econ 353 Spring 2006 Page 6 of 13

7 R T y rt b r B(y) randn() number of samples generated by the experiment number of observations in each sample t-th observation in sample r value of the estimator for sample r the estimator for β as a function of the input vector y a function that returns a value drawn at random from a Normal distribution with mean 0 and variance 1 The bias estimate is to be computed as the average of b r β over the R samples. Assume the estimates b r are stored in an R-by-1 vector named BDist. You may declare additional scalar, vector, or matrix variables as needed. Use the symbol to denote assignment and use = to denote a test for equality. (Refer to the pseudo-code example from Session 3 as a style guide.) Suggested solution [Marking guide: I would suggest allocating 4 marks for the pseudo-code, 8 marks for the Matlab implementation, 8 marks for the analysis. The bonus question is worth 5 marks.] For r = 1, 2,, R: For t = 1, 2,, T: y rt β + σ randn() // need to multiply by σ to get Var(v rt ) = σ 2. y sample (y r1,y r2,,y rt ) // collect the observations for sample r in a vector b r B(y sample ).. bias ( r=1,..,r b r )/R β 2. Implement your pseudo-code in Matlab. Begin by creating two M-files: mc.m A script file that implements the Monte Carlo simulation steps. B.m A function file that implements the estimator B(y). The function should take a vector as input and return a scalar value. (Leave B.m as an empty stub for now; the specific estimation rules will be defined later.) Your main script should call the function B.m to obtain estimates of β. Use the built-in Matlab function randn() to draw values of v t. Include the following line of code to initialize the Matlab random number generator: randn('state',1); Econ 353 Spring 2006 Page 7 of 13

8 Suggested solution: % mc.m % % A simple Monte Carlo simulation for estimating the mean. % Assumes a normal distribution for the error term. % % Ming Kang % February 2006 clear all; % Set parameters R = 100; T = round(10^4.5); beta = 2; sigma2 = 4; % Initialize variables BDist = zeros(r,1); y = zeros(t,1); % Set random-number generator randn('state',1); % Simulation step for r = 1:R y = beta + sqrt(sigma2)*randn(t,1); BDist(r) = B(y); % Calculate sampling statistics mu = mean(bdist); bias = mu - beta; mse = mean((bdist - beta).^2); hist(bdist,20); % B.m % function b = B(y) b = mean(y); % b = median(y); Econ 353 Spring 2006 Page 8 of 13

9 B. Analysis 1. Explain how the Monte Carlo algorithm is affected by each of the three types of numerical error discussed in class. Which do you think is the greatest source of error and why? Describe the effect of R on the accuracy of the algorithm. 2. Estimate β with the sample mean: that is, B.m should compute the average of the values in the input vector. Set β = 2, σ 2 = 4, R = 100, T = 25 and run the Monte Carlo experiment. Use the hist command to plot a histogram of BDist (use bins of size 1) and comment on its shape: is the distribution symmetric, smooth, bell-shaped, etc. Report the mean and variance of BDist. 3. Repeat Question 2 but this time estimate β with the sample median; that is, B.m should compute the median (i.e. the 50 th percentile) of the values in the input vector. Compare your results with Question 1 and comment on any differences. 4. Repeat the experiments in Questions 2 and 3 for T = 10 k ; k = 2.0, 2.5, 3.0,, 5.0. For each value of T, estimate the mean-squared error (MSE) of each estimator: that is, calculate the average value of (b r -β) 2 over the R estimates. Use the loglog command to plot the MSE of both estimators against T and comment on any tr that may be apparent. Compute the ratio of the MSE for the sample median to the MSE for the sample mean; record the values for each T in a table. What happens to the MSE ratio as T grows large? 5. (Bonus question) Consider the same DGP but now suppose v t is drawn from the Student s-t distribution instead of the Normal distribution. The Student s t-distribution deps on a degrees-of-freedom parameter, d = 1,2,. The procedure for sampling from the Student s-t distribution with d degrees of freedom is the following: i. Sample d+1 values from the standard Normal distribution (mean 0, variance 1); call these z n ; n = 1,2,,d+1 ii. Compute the ratio: t = z 1 d d + 1 d n= 1 z 2 n Modify your code for mc.m so that the v t s are generated using this procedure. Repeat the steps in Question 4 and fill in the table below with values for the MSE ratio. Compare your results for T = 25 with those from Question 4; comment on the effects of d and T. Degrees of freedom (d) No. of obs. (T) Econ 353 Spring 2006 Page 9 of 13

10 Suggested solution: 1. The three sources of numeric error discussed in class were modeling error, approximation error, and round-off error. With Monte Carlo experiments, modeling error is negligible by design: the experimenter chooses the parameters, distributions, etc. associated with the data generating process and thus there is an exact correspondence between the statistical model and the truth. (Whether the model generates realistic data is not of direct interest in this problem.) Approximation error is present because the simulated sampling distribution is an estimate; we only observe R discrete values of the sample statistic where in fact the statistic has a continuous distribution over an infinity of values. (There is also approximation error in how the normal random variable is generated from a sequence of pseudo-random numbers but this is less significant.) Roundoff error is probably small given that the range of computed numbers is not extreme and also averaging ts to smooth out these errors. Therefore, approximation error seems to be the most significant source of numeric error. The parameter R controls the number of replicated draws from the sampling distribution; the higher is R, the better is the discrete approximation to the true sampling distribution. 2. Results for the sample mean with T = 25: Here, BDist has mean and variance As illustrated in the histogram, the sampling distribution is rough but it appears to be single-peaked and not noticeably skewed. Econ 353 Spring 2006 Page 10 of 13

11 3. Results for the sample median with T=25: Now BDist has mean and variance Thus the sample median appears to have greater bias and higher variance. The distribution as illustrated in the histogram is noticeably choppier and more spread out than before. 4. Results: log 10 (T) MSE for the mean MSE for the median MSE ratio (median/mean) For both estimators, the loglog plot suggests log 10 (MSE) is a linear function of log 10 (T) with negative slope, which suggests that MSE declines exponentially with T. The position of the tr is lower for the sample mean; that is, the MSE for the sample median is always greater than for the sample mean. The MSE ratio is positive for all T; the ratio fluctuates as T increases, no increasing or decreasing tr is apparent. Econ 353 Spring 2006 Page 11 of 13

12 mse for mean (solid), median (dash) T 5. Results for R = 100: Degrees of freedom (d) No. of obs. (T) Notice that the median has lower MSE than the mean in some cases which contrasts with the earlier result that the MSE ratio was always positive. No clear trs are apparent except for the case of T=25 in which the MSE ratio increases with d; this makes sense as the t-distribution actually looks more like the normal distribution as d gets large. Repeating the experiment (not required) with R = 1000 confirms this result: (R = 1000) Degrees of freedom (d) No. of obs. (T) Econ 353 Spring 2006 Page 12 of 13

13 The new code: % mc2.m % % A simple Monte Carlo simulation for estimating the mean. % Assumes a t-distribution for the error term. % % Ming Kang % February 2006 clear all; % Set parameters R = 1000; T = 100; beta = 2; d = 3; % Initialize variables BDist1 = zeros(r,1); BDist2 = zeros(r,1); y = zeros(t,1); % sample mean % sample median % Set random-number generator randn('state',1); % Simulation step for r = 1:R numer = randn(t,1); denom = sqrt(sum(randn(t,d).^2,2)); z = numer./denom; y = beta + z; BDist1(r) = mean(y); BDist2(r) = median(y); % Calculate sampling statistics mse1 = mean((bdist1 - beta).^2); mse2 = mean((bdist2 - beta).^2); mse_r = mse2/mse1 Econ 353 Spring 2006 Page 13 of 13

Chapter 3 RANDOM VARIATE GENERATION

Chapter 3 RANDOM VARIATE GENERATION Chapter 3 RANDOM VARIATE GENERATION In order to do a Monte Carlo simulation either by hand or by computer, techniques must be developed for generating values of random variables having known distributions.

More information

Lecture 2: Descriptive Statistics and Exploratory Data Analysis

Lecture 2: Descriptive Statistics and Exploratory Data Analysis Lecture 2: Descriptive Statistics and Exploratory Data Analysis Further Thoughts on Experimental Design 16 Individuals (8 each from two populations) with replicates Pop 1 Pop 2 Randomly sample 4 individuals

More information

AP Physics 1 and 2 Lab Investigations

AP Physics 1 and 2 Lab Investigations AP Physics 1 and 2 Lab Investigations Student Guide to Data Analysis New York, NY. College Board, Advanced Placement, Advanced Placement Program, AP, AP Central, and the acorn logo are registered trademarks

More information

Generating Random Numbers Variance Reduction Quasi-Monte Carlo. Simulation Methods. Leonid Kogan. MIT, Sloan. 15.450, Fall 2010

Generating Random Numbers Variance Reduction Quasi-Monte Carlo. Simulation Methods. Leonid Kogan. MIT, Sloan. 15.450, Fall 2010 Simulation Methods Leonid Kogan MIT, Sloan 15.450, Fall 2010 c Leonid Kogan ( MIT, Sloan ) Simulation Methods 15.450, Fall 2010 1 / 35 Outline 1 Generating Random Numbers 2 Variance Reduction 3 Quasi-Monte

More information

The Effects of Start Prices on the Performance of the Certainty Equivalent Pricing Policy

The Effects of Start Prices on the Performance of the Certainty Equivalent Pricing Policy BMI Paper The Effects of Start Prices on the Performance of the Certainty Equivalent Pricing Policy Faculty of Sciences VU University Amsterdam De Boelelaan 1081 1081 HV Amsterdam Netherlands Author: R.D.R.

More information

Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model

Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model 1 September 004 A. Introduction and assumptions The classical normal linear regression model can be written

More information

Why Taking This Course? Course Introduction, Descriptive Statistics and Data Visualization. Learning Goals. GENOME 560, Spring 2012

Why Taking This Course? Course Introduction, Descriptive Statistics and Data Visualization. Learning Goals. GENOME 560, Spring 2012 Why Taking This Course? Course Introduction, Descriptive Statistics and Data Visualization GENOME 560, Spring 2012 Data are interesting because they help us understand the world Genomics: Massive Amounts

More information

GLMs: Gompertz s Law. GLMs in R. Gompertz s famous graduation formula is. or log µ x is linear in age, x,

GLMs: Gompertz s Law. GLMs in R. Gompertz s famous graduation formula is. or log µ x is linear in age, x, Computing: an indispensable tool or an insurmountable hurdle? Iain Currie Heriot Watt University, Scotland ATRC, University College Dublin July 2006 Plan of talk General remarks The professional syllabus

More information

4. Continuous Random Variables, the Pareto and Normal Distributions

4. Continuous Random Variables, the Pareto and Normal Distributions 4. Continuous Random Variables, the Pareto and Normal Distributions A continuous random variable X can take any value in a given range (e.g. height, weight, age). The distribution of a continuous random

More information

A few useful MATLAB functions

A few useful MATLAB functions A few useful MATLAB functions Renato Feres - Math 350 Fall 2012 1 Uniform random numbers The Matlab command rand is based on a popular deterministic algorithm called multiplicative congruential method.

More information

99.37, 99.38, 99.38, 99.39, 99.39, 99.39, 99.39, 99.40, 99.41, 99.42 cm

99.37, 99.38, 99.38, 99.39, 99.39, 99.39, 99.39, 99.40, 99.41, 99.42 cm Error Analysis and the Gaussian Distribution In experimental science theory lives or dies based on the results of experimental evidence and thus the analysis of this evidence is a critical part of the

More information

MBA 611 STATISTICS AND QUANTITATIVE METHODS

MBA 611 STATISTICS AND QUANTITATIVE METHODS MBA 611 STATISTICS AND QUANTITATIVE METHODS Part I. Review of Basic Statistics (Chapters 1-11) A. Introduction (Chapter 1) Uncertainty: Decisions are often based on incomplete information from uncertain

More information

MEASURES OF VARIATION

MEASURES OF VARIATION NORMAL DISTRIBTIONS MEASURES OF VARIATION In statistics, it is important to measure the spread of data. A simple way to measure spread is to find the range. But statisticians want to know if the data are

More information

Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics

Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics For 2015 Examinations Aim The aim of the Probability and Mathematical Statistics subject is to provide a grounding in

More information

Probability and Statistics Prof. Dr. Somesh Kumar Department of Mathematics Indian Institute of Technology, Kharagpur

Probability and Statistics Prof. Dr. Somesh Kumar Department of Mathematics Indian Institute of Technology, Kharagpur Probability and Statistics Prof. Dr. Somesh Kumar Department of Mathematics Indian Institute of Technology, Kharagpur Module No. #01 Lecture No. #15 Special Distributions-VI Today, I am going to introduce

More information

Centre for Central Banking Studies

Centre for Central Banking Studies Centre for Central Banking Studies Technical Handbook No. 4 Applied Bayesian econometrics for central bankers Andrew Blake and Haroon Mumtaz CCBS Technical Handbook No. 4 Applied Bayesian econometrics

More information

3: Summary Statistics

3: Summary Statistics 3: Summary Statistics Notation Let s start by introducing some notation. Consider the following small data set: 4 5 30 50 8 7 4 5 The symbol n represents the sample size (n = 0). The capital letter X denotes

More information

Lecture Notes Module 1

Lecture Notes Module 1 Lecture Notes Module 1 Study Populations A study population is a clearly defined collection of people, animals, plants, or objects. In psychological research, a study population usually consists of a specific

More information

Overview of Monte Carlo Simulation, Probability Review and Introduction to Matlab

Overview of Monte Carlo Simulation, Probability Review and Introduction to Matlab Monte Carlo Simulation: IEOR E4703 Fall 2004 c 2004 by Martin Haugh Overview of Monte Carlo Simulation, Probability Review and Introduction to Matlab 1 Overview of Monte Carlo Simulation 1.1 Why use simulation?

More information

FEGYVERNEKI SÁNDOR, PROBABILITY THEORY AND MATHEmATICAL

FEGYVERNEKI SÁNDOR, PROBABILITY THEORY AND MATHEmATICAL FEGYVERNEKI SÁNDOR, PROBABILITY THEORY AND MATHEmATICAL STATIsTICs 4 IV. RANDOm VECTORs 1. JOINTLY DIsTRIBUTED RANDOm VARIABLEs If are two rom variables defined on the same sample space we define the joint

More information

Permutation Tests for Comparing Two Populations

Permutation Tests for Comparing Two Populations Permutation Tests for Comparing Two Populations Ferry Butar Butar, Ph.D. Jae-Wan Park Abstract Permutation tests for comparing two populations could be widely used in practice because of flexibility of

More information

Assumptions. Assumptions of linear models. Boxplot. Data exploration. Apply to response variable. Apply to error terms from linear model

Assumptions. Assumptions of linear models. Boxplot. Data exploration. Apply to response variable. Apply to error terms from linear model Assumptions Assumptions of linear models Apply to response variable within each group if predictor categorical Apply to error terms from linear model check by analysing residuals Normality Homogeneity

More information

Measuring Line Edge Roughness: Fluctuations in Uncertainty

Measuring Line Edge Roughness: Fluctuations in Uncertainty Tutor6.doc: Version 5/6/08 T h e L i t h o g r a p h y E x p e r t (August 008) Measuring Line Edge Roughness: Fluctuations in Uncertainty Line edge roughness () is the deviation of a feature edge (as

More information

Betting with the Kelly Criterion

Betting with the Kelly Criterion Betting with the Kelly Criterion Jane June 2, 2010 Contents 1 Introduction 2 2 Kelly Criterion 2 3 The Stock Market 3 4 Simulations 5 5 Conclusion 8 1 Page 2 of 9 1 Introduction Gambling in all forms,

More information

NCSS Statistical Software

NCSS Statistical Software Chapter 06 Introduction This procedure provides several reports for the comparison of two distributions, including confidence intervals for the difference in means, two-sample t-tests, the z-test, the

More information

Review Jeopardy. Blue vs. Orange. Review Jeopardy

Review Jeopardy. Blue vs. Orange. Review Jeopardy Review Jeopardy Blue vs. Orange Review Jeopardy Jeopardy Round Lectures 0-3 Jeopardy Round $200 How could I measure how far apart (i.e. how different) two observations, y 1 and y 2, are from each other?

More information

Analysis of Bayesian Dynamic Linear Models

Analysis of Bayesian Dynamic Linear Models Analysis of Bayesian Dynamic Linear Models Emily M. Casleton December 17, 2010 1 Introduction The main purpose of this project is to explore the Bayesian analysis of Dynamic Linear Models (DLMs). The main

More information

Means, standard deviations and. and standard errors

Means, standard deviations and. and standard errors CHAPTER 4 Means, standard deviations and standard errors 4.1 Introduction Change of units 4.2 Mean, median and mode Coefficient of variation 4.3 Measures of variation 4.4 Calculating the mean and standard

More information

1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number

1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number 1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number A. 3(x - x) B. x 3 x C. 3x - x D. x - 3x 2) Write the following as an algebraic expression

More information

This unit will lay the groundwork for later units where the students will extend this knowledge to quadratic and exponential functions.

This unit will lay the groundwork for later units where the students will extend this knowledge to quadratic and exponential functions. Algebra I Overview View unit yearlong overview here Many of the concepts presented in Algebra I are progressions of concepts that were introduced in grades 6 through 8. The content presented in this course

More information

Java Modules for Time Series Analysis

Java Modules for Time Series Analysis Java Modules for Time Series Analysis Agenda Clustering Non-normal distributions Multifactor modeling Implied ratings Time series prediction 1. Clustering + Cluster 1 Synthetic Clustering + Time series

More information

6 Scalar, Stochastic, Discrete Dynamic Systems

6 Scalar, Stochastic, Discrete Dynamic Systems 47 6 Scalar, Stochastic, Discrete Dynamic Systems Consider modeling a population of sand-hill cranes in year n by the first-order, deterministic recurrence equation y(n + 1) = Ry(n) where R = 1 + r = 1

More information

NEW YORK STATE TEACHER CERTIFICATION EXAMINATIONS

NEW YORK STATE TEACHER CERTIFICATION EXAMINATIONS NEW YORK STATE TEACHER CERTIFICATION EXAMINATIONS TEST DESIGN AND FRAMEWORK September 2014 Authorized for Distribution by the New York State Education Department This test design and framework document

More information

1.5 Oneway Analysis of Variance

1.5 Oneway Analysis of Variance Statistics: Rosie Cornish. 200. 1.5 Oneway Analysis of Variance 1 Introduction Oneway analysis of variance (ANOVA) is used to compare several means. This method is often used in scientific or medical experiments

More information

The Steepest Descent Algorithm for Unconstrained Optimization and a Bisection Line-search Method

The Steepest Descent Algorithm for Unconstrained Optimization and a Bisection Line-search Method The Steepest Descent Algorithm for Unconstrained Optimization and a Bisection Line-search Method Robert M. Freund February, 004 004 Massachusetts Institute of Technology. 1 1 The Algorithm The problem

More information

Mean, Median, Standard Deviation Prof. McGahagan Stat 1040

Mean, Median, Standard Deviation Prof. McGahagan Stat 1040 Mean, Median, Standard Deviation Prof. McGahagan Stat 1040 Mean = arithmetic average, add all the values and divide by the number of values. Median = 50 th percentile; sort the data and choose the middle

More information

Algebra 2 Chapter 1 Vocabulary. identity - A statement that equates two equivalent expressions.

Algebra 2 Chapter 1 Vocabulary. identity - A statement that equates two equivalent expressions. Chapter 1 Vocabulary identity - A statement that equates two equivalent expressions. verbal model- A word equation that represents a real-life problem. algebraic expression - An expression with variables.

More information

Biostatistics: DESCRIPTIVE STATISTICS: 2, VARIABILITY

Biostatistics: DESCRIPTIVE STATISTICS: 2, VARIABILITY Biostatistics: DESCRIPTIVE STATISTICS: 2, VARIABILITY 1. Introduction Besides arriving at an appropriate expression of an average or consensus value for observations of a population, it is important to

More information

Introduction to Matlab

Introduction to Matlab Introduction to Matlab Social Science Research Lab American University, Washington, D.C. Web. www.american.edu/provost/ctrl/pclabs.cfm Tel. x3862 Email. SSRL@American.edu Course Objective This course provides

More information

CS 103X: Discrete Structures Homework Assignment 3 Solutions

CS 103X: Discrete Structures Homework Assignment 3 Solutions CS 103X: Discrete Structures Homework Assignment 3 s Exercise 1 (20 points). On well-ordering and induction: (a) Prove the induction principle from the well-ordering principle. (b) Prove the well-ordering

More information

Lecture 5 : The Poisson Distribution

Lecture 5 : The Poisson Distribution Lecture 5 : The Poisson Distribution Jonathan Marchini November 10, 2008 1 Introduction Many experimental situations occur in which we observe the counts of events within a set unit of time, area, volume,

More information

Financial Econometrics MFE MATLAB Introduction. Kevin Sheppard University of Oxford

Financial Econometrics MFE MATLAB Introduction. Kevin Sheppard University of Oxford Financial Econometrics MFE MATLAB Introduction Kevin Sheppard University of Oxford October 21, 2013 2007-2013 Kevin Sheppard 2 Contents Introduction i 1 Getting Started 1 2 Basic Input and Operators 5

More information

SIMULATION STUDIES IN STATISTICS WHAT IS A SIMULATION STUDY, AND WHY DO ONE? What is a (Monte Carlo) simulation study, and why do one?

SIMULATION STUDIES IN STATISTICS WHAT IS A SIMULATION STUDY, AND WHY DO ONE? What is a (Monte Carlo) simulation study, and why do one? SIMULATION STUDIES IN STATISTICS WHAT IS A SIMULATION STUDY, AND WHY DO ONE? What is a (Monte Carlo) simulation study, and why do one? Simulations for properties of estimators Simulations for properties

More information

BNG 202 Biomechanics Lab. Descriptive statistics and probability distributions I

BNG 202 Biomechanics Lab. Descriptive statistics and probability distributions I BNG 202 Biomechanics Lab Descriptive statistics and probability distributions I Overview The overall goal of this short course in statistics is to provide an introduction to descriptive and inferential

More information

E3: PROBABILITY AND STATISTICS lecture notes

E3: PROBABILITY AND STATISTICS lecture notes E3: PROBABILITY AND STATISTICS lecture notes 2 Contents 1 PROBABILITY THEORY 7 1.1 Experiments and random events............................ 7 1.2 Certain event. Impossible event............................

More information

Exploratory Data Analysis

Exploratory Data Analysis Exploratory Data Analysis Johannes Schauer johannes.schauer@tugraz.at Institute of Statistics Graz University of Technology Steyrergasse 17/IV, 8010 Graz www.statistics.tugraz.at February 12, 2008 Introduction

More information

Vector and Matrix Norms

Vector and Matrix Norms Chapter 1 Vector and Matrix Norms 11 Vector Spaces Let F be a field (such as the real numbers, R, or complex numbers, C) with elements called scalars A Vector Space, V, over the field F is a non-empty

More information

1 The Brownian bridge construction

1 The Brownian bridge construction The Brownian bridge construction The Brownian bridge construction is a way to build a Brownian motion path by successively adding finer scale detail. This construction leads to a relatively easy proof

More information

Nonparametric adaptive age replacement with a one-cycle criterion

Nonparametric adaptive age replacement with a one-cycle criterion Nonparametric adaptive age replacement with a one-cycle criterion P. Coolen-Schrijner, F.P.A. Coolen Department of Mathematical Sciences University of Durham, Durham, DH1 3LE, UK e-mail: Pauline.Schrijner@durham.ac.uk

More information

Below is a very brief tutorial on the basic capabilities of Excel. Refer to the Excel help files for more information.

Below is a very brief tutorial on the basic capabilities of Excel. Refer to the Excel help files for more information. Excel Tutorial Below is a very brief tutorial on the basic capabilities of Excel. Refer to the Excel help files for more information. Working with Data Entering and Formatting Data Before entering data

More information

STT315 Chapter 4 Random Variables & Probability Distributions KM. Chapter 4.5, 6, 8 Probability Distributions for Continuous Random Variables

STT315 Chapter 4 Random Variables & Probability Distributions KM. Chapter 4.5, 6, 8 Probability Distributions for Continuous Random Variables Chapter 4.5, 6, 8 Probability Distributions for Continuous Random Variables Discrete vs. continuous random variables Examples of continuous distributions o Uniform o Exponential o Normal Recall: A random

More information

Georgia Department of Education Kathy Cox, State Superintendent of Schools 7/19/2005 All Rights Reserved 1

Georgia Department of Education Kathy Cox, State Superintendent of Schools 7/19/2005 All Rights Reserved 1 Accelerated Mathematics 3 This is a course in precalculus and statistics, designed to prepare students to take AB or BC Advanced Placement Calculus. It includes rational, circular trigonometric, and inverse

More information

Descriptive Statistics

Descriptive Statistics Y520 Robert S Michael Goal: Learn to calculate indicators and construct graphs that summarize and describe a large quantity of values. Using the textbook readings and other resources listed on the web

More information

u = [ 2 4 5] has one row with three components (a 3 v = [2 4 5] has three rows separated by semicolons (a 3 w = 2:5 generates the row vector w = [ 2 3

u = [ 2 4 5] has one row with three components (a 3 v = [2 4 5] has three rows separated by semicolons (a 3 w = 2:5 generates the row vector w = [ 2 3 MATLAB Tutorial You need a small numb e r of basic commands to start using MATLAB. This short tutorial describes those fundamental commands. You need to create vectors and matrices, to change them, and

More information

Basics of Statistical Machine Learning

Basics of Statistical Machine Learning CS761 Spring 2013 Advanced Machine Learning Basics of Statistical Machine Learning Lecturer: Xiaojin Zhu jerryzhu@cs.wisc.edu Modern machine learning is rooted in statistics. You will find many familiar

More information

a 11 x 1 + a 12 x 2 + + a 1n x n = b 1 a 21 x 1 + a 22 x 2 + + a 2n x n = b 2.

a 11 x 1 + a 12 x 2 + + a 1n x n = b 1 a 21 x 1 + a 22 x 2 + + a 2n x n = b 2. Chapter 1 LINEAR EQUATIONS 1.1 Introduction to linear equations A linear equation in n unknowns x 1, x,, x n is an equation of the form a 1 x 1 + a x + + a n x n = b, where a 1, a,..., a n, b are given

More information

Geostatistics Exploratory Analysis

Geostatistics Exploratory Analysis Instituto Superior de Estatística e Gestão de Informação Universidade Nova de Lisboa Master of Science in Geospatial Technologies Geostatistics Exploratory Analysis Carlos Alberto Felgueiras cfelgueiras@isegi.unl.pt

More information

Summary of Formulas and Concepts. Descriptive Statistics (Ch. 1-4)

Summary of Formulas and Concepts. Descriptive Statistics (Ch. 1-4) Summary of Formulas and Concepts Descriptive Statistics (Ch. 1-4) Definitions Population: The complete set of numerical information on a particular quantity in which an investigator is interested. We assume

More information

Lesson 4 Measures of Central Tendency

Lesson 4 Measures of Central Tendency Outline Measures of a distribution s shape -modality and skewness -the normal distribution Measures of central tendency -mean, median, and mode Skewness and Central Tendency Lesson 4 Measures of Central

More information

Gamma Distribution Fitting

Gamma Distribution Fitting Chapter 552 Gamma Distribution Fitting Introduction This module fits the gamma probability distributions to a complete or censored set of individual or grouped data values. It outputs various statistics

More information

Least Squares Estimation

Least Squares Estimation Least Squares Estimation SARA A VAN DE GEER Volume 2, pp 1041 1045 in Encyclopedia of Statistics in Behavioral Science ISBN-13: 978-0-470-86080-9 ISBN-10: 0-470-86080-4 Editors Brian S Everitt & David

More information

The Method of Least Squares

The Method of Least Squares The Method of Least Squares Steven J. Miller Mathematics Department Brown University Providence, RI 0292 Abstract The Method of Least Squares is a procedure to determine the best fit line to data; the

More information

Fairfield Public Schools

Fairfield Public Schools Mathematics Fairfield Public Schools AP Statistics AP Statistics BOE Approved 04/08/2014 1 AP STATISTICS Critical Areas of Focus AP Statistics is a rigorous course that offers advanced students an opportunity

More information

LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING

LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING In this lab you will explore the concept of a confidence interval and hypothesis testing through a simulation problem in engineering setting.

More information

TEC H N I C A L R E P O R T

TEC H N I C A L R E P O R T I N S P E C T A TEC H N I C A L R E P O R T Master s Thesis Determination of Safety Factors in High-Cycle Fatigue - Limitations and Possibilities Robert Peterson Supervisors: Magnus Dahlberg Christian

More information

Appendix 3 IB Diploma Programme Course Outlines

Appendix 3 IB Diploma Programme Course Outlines Appendix 3 IB Diploma Programme Course Outlines The following points should be addressed when preparing course outlines for each IB Diploma Programme subject to be taught. Please be sure to use IBO nomenclature

More information

7 Gaussian Elimination and LU Factorization

7 Gaussian Elimination and LU Factorization 7 Gaussian Elimination and LU Factorization In this final section on matrix factorization methods for solving Ax = b we want to take a closer look at Gaussian elimination (probably the best known method

More information

Week 4: Standard Error and Confidence Intervals

Week 4: Standard Error and Confidence Intervals Health Sciences M.Sc. Programme Applied Biostatistics Week 4: Standard Error and Confidence Intervals Sampling Most research data come from subjects we think of as samples drawn from a larger population.

More information

DESCRIPTIVE STATISTICS. The purpose of statistics is to condense raw data to make it easier to answer specific questions; test hypotheses.

DESCRIPTIVE STATISTICS. The purpose of statistics is to condense raw data to make it easier to answer specific questions; test hypotheses. DESCRIPTIVE STATISTICS The purpose of statistics is to condense raw data to make it easier to answer specific questions; test hypotheses. DESCRIPTIVE VS. INFERENTIAL STATISTICS Descriptive To organize,

More information

Numerical Matrix Analysis

Numerical Matrix Analysis Numerical Matrix Analysis Lecture Notes #10 Conditioning and / Peter Blomgren, blomgren.peter@gmail.com Department of Mathematics and Statistics Dynamical Systems Group Computational Sciences Research

More information

Descriptive Statistics and Measurement Scales

Descriptive Statistics and Measurement Scales Descriptive Statistics 1 Descriptive Statistics and Measurement Scales Descriptive statistics are used to describe the basic features of the data in a study. They provide simple summaries about the sample

More information

Current Standard: Mathematical Concepts and Applications Shape, Space, and Measurement- Primary

Current Standard: Mathematical Concepts and Applications Shape, Space, and Measurement- Primary Shape, Space, and Measurement- Primary A student shall apply concepts of shape, space, and measurement to solve problems involving two- and three-dimensional shapes by demonstrating an understanding of:

More information

INDIRECT INFERENCE (prepared for: The New Palgrave Dictionary of Economics, Second Edition)

INDIRECT INFERENCE (prepared for: The New Palgrave Dictionary of Economics, Second Edition) INDIRECT INFERENCE (prepared for: The New Palgrave Dictionary of Economics, Second Edition) Abstract Indirect inference is a simulation-based method for estimating the parameters of economic models. Its

More information

The right edge of the box is the third quartile, Q 3, which is the median of the data values above the median. Maximum Median

The right edge of the box is the third quartile, Q 3, which is the median of the data values above the median. Maximum Median CONDENSED LESSON 2.1 Box Plots In this lesson you will create and interpret box plots for sets of data use the interquartile range (IQR) to identify potential outliers and graph them on a modified box

More information

Statistics 104: Section 6!

Statistics 104: Section 6! Page 1 Statistics 104: Section 6! TF: Deirdre (say: Dear-dra) Bloome Email: dbloome@fas.harvard.edu Section Times Thursday 2pm-3pm in SC 109, Thursday 5pm-6pm in SC 705 Office Hours: Thursday 6pm-7pm SC

More information

Probability and Statistics Vocabulary List (Definitions for Middle School Teachers)

Probability and Statistics Vocabulary List (Definitions for Middle School Teachers) Probability and Statistics Vocabulary List (Definitions for Middle School Teachers) B Bar graph a diagram representing the frequency distribution for nominal or discrete data. It consists of a sequence

More information

Exercise 1.12 (Pg. 22-23)

Exercise 1.12 (Pg. 22-23) Individuals: The objects that are described by a set of data. They may be people, animals, things, etc. (Also referred to as Cases or Records) Variables: The characteristics recorded about each individual.

More information

AP STATISTICS REVIEW (YMS Chapters 1-8)

AP STATISTICS REVIEW (YMS Chapters 1-8) AP STATISTICS REVIEW (YMS Chapters 1-8) Exploring Data (Chapter 1) Categorical Data nominal scale, names e.g. male/female or eye color or breeds of dogs Quantitative Data rational scale (can +,,, with

More information

Please follow the directions once you locate the Stata software in your computer. Room 114 (Business Lab) has computers with Stata software

Please follow the directions once you locate the Stata software in your computer. Room 114 (Business Lab) has computers with Stata software STATA Tutorial Professor Erdinç Please follow the directions once you locate the Stata software in your computer. Room 114 (Business Lab) has computers with Stata software 1.Wald Test Wald Test is used

More information

15.062 Data Mining: Algorithms and Applications Matrix Math Review

15.062 Data Mining: Algorithms and Applications Matrix Math Review .6 Data Mining: Algorithms and Applications Matrix Math Review The purpose of this document is to give a brief review of selected linear algebra concepts that will be useful for the course and to develop

More information

MATH BOOK OF PROBLEMS SERIES. New from Pearson Custom Publishing!

MATH BOOK OF PROBLEMS SERIES. New from Pearson Custom Publishing! MATH BOOK OF PROBLEMS SERIES New from Pearson Custom Publishing! The Math Book of Problems Series is a database of math problems for the following courses: Pre-algebra Algebra Pre-calculus Calculus Statistics

More information

6.4 Normal Distribution

6.4 Normal Distribution Contents 6.4 Normal Distribution....................... 381 6.4.1 Characteristics of the Normal Distribution....... 381 6.4.2 The Standardized Normal Distribution......... 385 6.4.3 Meaning of Areas under

More information

NCSS Statistical Software Principal Components Regression. In ordinary least squares, the regression coefficients are estimated using the formula ( )

NCSS Statistical Software Principal Components Regression. In ordinary least squares, the regression coefficients are estimated using the formula ( ) Chapter 340 Principal Components Regression Introduction is a technique for analyzing multiple regression data that suffer from multicollinearity. When multicollinearity occurs, least squares estimates

More information

Introduction to General and Generalized Linear Models

Introduction to General and Generalized Linear Models Introduction to General and Generalized Linear Models General Linear Models - part I Henrik Madsen Poul Thyregod Informatics and Mathematical Modelling Technical University of Denmark DK-2800 Kgs. Lyngby

More information

MATH 304 Linear Algebra Lecture 9: Subspaces of vector spaces (continued). Span. Spanning set.

MATH 304 Linear Algebra Lecture 9: Subspaces of vector spaces (continued). Span. Spanning set. MATH 304 Linear Algebra Lecture 9: Subspaces of vector spaces (continued). Span. Spanning set. Vector space A vector space is a set V equipped with two operations, addition V V (x,y) x + y V and scalar

More information

In order to describe motion you need to describe the following properties.

In order to describe motion you need to describe the following properties. Chapter 2 One Dimensional Kinematics How would you describe the following motion? Ex: random 1-D path speeding up and slowing down In order to describe motion you need to describe the following properties.

More information

Simple Regression Theory II 2010 Samuel L. Baker

Simple Regression Theory II 2010 Samuel L. Baker SIMPLE REGRESSION THEORY II 1 Simple Regression Theory II 2010 Samuel L. Baker Assessing how good the regression equation is likely to be Assignment 1A gets into drawing inferences about how close the

More information

Data Structures and Algorithms

Data Structures and Algorithms Data Structures and Algorithms Computational Complexity Escola Politècnica Superior d Alcoi Universitat Politècnica de València Contents Introduction Resources consumptions: spatial and temporal cost Costs

More information

MATH. ALGEBRA I HONORS 9 th Grade 12003200 ALGEBRA I HONORS

MATH. ALGEBRA I HONORS 9 th Grade 12003200 ALGEBRA I HONORS * Students who scored a Level 3 or above on the Florida Assessment Test Math Florida Standards (FSA-MAFS) are strongly encouraged to make Advanced Placement and/or dual enrollment courses their first choices

More information

Big Ideas in Mathematics

Big Ideas in Mathematics Big Ideas in Mathematics which are important to all mathematics learning. (Adapted from the NCTM Curriculum Focal Points, 2006) The Mathematics Big Ideas are organized using the PA Mathematics Standards

More information

Review of basic statistics and the simplest forecasting model: the sample mean

Review of basic statistics and the simplest forecasting model: the sample mean Review of basic statistics and the simplest forecasting model: the sample mean Robert Nau Fuqua School of Business, Duke University August 2014 Most of what you need to remember about basic statistics

More information

SAN DIEGO COMMUNITY COLLEGE DISTRICT CITY COLLEGE ASSOCIATE DEGREE COURSE OUTLINE

SAN DIEGO COMMUNITY COLLEGE DISTRICT CITY COLLEGE ASSOCIATE DEGREE COURSE OUTLINE MATH 098 CIC Approval: BOT APPROVAL: STATE APPROVAL: EFFECTIVE TERM: SAN DIEGO COMMUNITY COLLEGE DISTRICT CITY COLLEGE ASSOCIATE DEGREE COURSE OUTLINE SECTION I SUBJECT AREA AND COURSE NUMBER: Mathematics

More information

Multivariate Normal Distribution

Multivariate Normal Distribution Multivariate Normal Distribution Lecture 4 July 21, 2011 Advanced Multivariate Statistical Methods ICPSR Summer Session #2 Lecture #4-7/21/2011 Slide 1 of 41 Last Time Matrices and vectors Eigenvalues

More information

In mathematics, there are four attainment targets: using and applying mathematics; number and algebra; shape, space and measures, and handling data.

In mathematics, there are four attainment targets: using and applying mathematics; number and algebra; shape, space and measures, and handling data. MATHEMATICS: THE LEVEL DESCRIPTIONS In mathematics, there are four attainment targets: using and applying mathematics; number and algebra; shape, space and measures, and handling data. Attainment target

More information

CALCULATIONS & STATISTICS

CALCULATIONS & STATISTICS CALCULATIONS & STATISTICS CALCULATION OF SCORES Conversion of 1-5 scale to 0-100 scores When you look at your report, you will notice that the scores are reported on a 0-100 scale, even though respondents

More information

LOGNORMAL MODEL FOR STOCK PRICES

LOGNORMAL MODEL FOR STOCK PRICES LOGNORMAL MODEL FOR STOCK PRICES MICHAEL J. SHARPE MATHEMATICS DEPARTMENT, UCSD 1. INTRODUCTION What follows is a simple but important model that will be the basis for a later study of stock prices as

More information

Using simulation to calculate the NPV of a project

Using simulation to calculate the NPV of a project Using simulation to calculate the NPV of a project Marius Holtan Onward Inc. 5/31/2002 Monte Carlo simulation is fast becoming the technology of choice for evaluating and analyzing assets, be it pure financial

More information

Algebra I Credit Recovery

Algebra I Credit Recovery Algebra I Credit Recovery COURSE DESCRIPTION: The purpose of this course is to allow the student to gain mastery in working with and evaluating mathematical expressions, equations, graphs, and other topics,

More information

096 Professional Readiness Examination (Mathematics)

096 Professional Readiness Examination (Mathematics) 096 Professional Readiness Examination (Mathematics) Effective after October 1, 2013 MI-SG-FLD096M-02 TABLE OF CONTENTS PART 1: General Information About the MTTC Program and Test Preparation OVERVIEW

More information

CURVE FITTING LEAST SQUARES APPROXIMATION

CURVE FITTING LEAST SQUARES APPROXIMATION CURVE FITTING LEAST SQUARES APPROXIMATION Data analysis and curve fitting: Imagine that we are studying a physical system involving two quantities: x and y Also suppose that we expect a linear relationship

More information