Simple linear regression

Size: px
Start display at page:

Download "Simple linear regression"

Transcription

1 Simple liear regressio Tro Aders Moger 3..7 Example 6: Populatio proportios Oe sample X Assume X ~ Bi(, P, so that P ˆ is a frequecy. P The ~ N(, P( P / (approximately, for large P Thus ~ N(, ( / (approximately, for large ( Thus P Z / < P Cofidece iterval for P Z < ˆ + ( α P Zα / (, + Z α / α / ( α Example 6 (Hypothesis testig Example 7: Differeces betwee populatio proportios-two samples Hypotheses: H :PP H :P P Test statistic P ~ N(, P ( P uder H, for large Reject H if P < P ( P Z α /,or if P P ( P > Z α / Assume X ~ Bi(, P ad X ~ Bi(, P, so that ˆ X P ad ˆ X P are frequecies ˆ P ( P P ~ N (, P ( P P ( P The + Cofidece iterval for P -P ± Z α / ( ( + (approximately

2 Example 7 (Hypothesis testig Hypotheses: H :P P H :P P ~ Test statistic ( ˆ ˆ ( ˆ P P P + where ˆ ˆ ˆ P + P P + Reject H if N > ˆ ˆ ˆ ( P P ( P + Z α / (, Spotaous abortios amog surgical urses ad other urses Wat to test if there is differece betwee the proportios of abortios i the two groups H : P op.urses P others H : P op.urses P others No. iterviewed No. pregacies No. abortios Percet abortios Surgical urses Other urses Calculatio: P.78 P Total o. abortios + 3 p.86 Total o. pregacies z ( (.86.4 P-value.444.%, reject H o 5%- sig.level (ca t do this i SPSS 95% cofidece iterval for P -P : ˆ ˆ ˆ ( ˆ ˆ ˆ P ( P P P ( P P ±.96 * + (.5,.9 Repetitio: Testig: Idetify data; cotiuous->t-tests; proportios- >Normal approx. to biomial dist. If cotious: oe-sample, matched pairs, two idepedet samples? Assumptios: Are data ormally distributed? If two id. samples, equal variaces i both groups? FormulateH ad H (H is always o differece, o effect of treatmet etc., choose sig. level (α5% Calculate test statistic

3 Iferece: Test statistic usually stadardized; (estimator-expected value of estimator uder H /(estimated stadard error Gives you a locatio o the x-axis i a distributio Compare this value to the value at the.5%-percetile ad 97.5%-percetile of the distributio If smaller tha the.5%-percetile or larger tha the 97.5%-percetile, reject H P-value: Area i the tails of the distributio below value of test statistic+area above value of test-statistic (twosided testig If smaller tha.5, reject H If cofidece iterval for mea or mea differece (depeds o test what you use does ot iclude H value from, reject H Last week: Looked at cotiuous, ormally distributed variables Used t-tests to see if there was sigificat differece betwee meas i two groups How strog is the relatioship betwee two such variables? Correlatio What if oe wats to study the relatioship betwee several such variables? Liear regressio Coectio betwee variables kostad kostad år areal We would like to study coectio betwee x ad y! Data from the first obligatory assigmet: Birth weight ad smokig Childre of 89 wome Low birth weight is a medical risk factor Does mother s smokig status have ay ifluece o the birth weight? Also iterested i relatioship with other variables: Mother s age, mother s weight, high blood pressure, ethicity etc. 3

4 Is birth weight ormally distributed? Q-Q plot (check Normality plots with tests uder plots: Histogram From explore i SPSS Normal Q-Q Plot of birthweight 5 3 Frequecy 5 Expected Normal Mea 944,656 Std. Dev. 79,4 N 89,, 3, 4, 5, birthweight Observed Value Tests for ormality: Pearsos correlatio coefficiet r Tests of Normality birthweight Kolmogorov-Smirov(a Statistic,43 89 Sig.,(* * This is a lower boud of the true sigificace. a Liljefors Sigificace Correctio df Shapiro-Wilk Statistic The ull hypothesis is that the data are ormal. Large p- value idicates ormal distributio. For large samples, the p-value teds to be low. The graphical methods are more importat,99 df 89 Sig.,438 Measures the liear relatioship betwee variables r: All data lie o a icreasig straight lie r-: All data lie o a decreasig straight lie r: No liear relatioship I liear regressio, ofte use R (r as a measure of the explaatory power of the model R close to meas that the observatios are close to the lie, r close to meas that there is o liear relatioship betwee the observatios 4

5 Testig for correlatio It is also possible to test whether a sample correlatio r is large eough to idicate a ozero populatio correlatio Test statistic: r ~ t r Note: The test oly works for ormal distributios ad liear correlatios: Always also ivestigate scatter plot! Pearsos correlatio coefficiet i SPSS: Aalyze->Correlate->bivariate Check Pearso Tests if r is sigificatly differet from Null hypothesis is that r The variables have to be ormally distributed Idepedece betwee observatios Example: Correlatio from SPSS: 5, birthweight 4, 3,,, birthweight weight i pouds Correlatios Pearso Correlatio Sig. (-tailed N Pearso Correlatio Sig. (-tailed N weight i birthweight pouds,86*, 89 89,86*, *. Correlatio is sigificat at the.5 level (-tailed., 5,, 5,, 5, weight i pouds 5

6 If the data are ot ormally distributed: Spearmas rak correlatio, r s Spearma correlatio: Measures all mootoous relatioships, ot oly liear oes No distributio assumptios r s is betwee - ad, similar to Pearsos correlatio coefficiet I SPSS: Aalyze->Correlate->bivariate Check Spearma Also provides a test o whether r s is differet from Correlatios Spearma's rho birthweight Correlatio Coefficiet Sig. (-tailed N weight i pouds Correlatio Coefficiet Sig. (-tailed N **. Correlatio is sigificat at the. level (-tailed. weight i birthweight pouds,,48**., 89 89,48**,, Liear regressio Coectio betwee variables Wish to fit a lie as close to the observed data (two ormally distributed varaibles as possible Example: Birth weighta+b*mother s weight I SPSS: Aalyze->Regressio->Liear Click Statistics ad check Cofidece iterval for B Choose oe variable as depedet (Birth weight as depedet, ad oe variable (mother s weight as idepedet Importat to kow which variable is your depedet variable! kostad areal Fit a lie! 6

7 The stadard simple regressio model We defie a model Yi β + βxi + ε i where ε i are idepedet, ormally distributed, with equal variace σ We ca the use data to estimate the model parameters, ad to make statemets about their ucertaity What ca you do with a fitted lie? Iterpolatio Extrapolatio (sometimes dagerous! Iterpret the parameters of the lie How to defie the lie that fits best? How to compute the lie fit with the least squares method? The sum of the squares of the errors miimized Least squares method! Note: May other ways to fit the lie ca be imagied Let (x, y, (x, y,...,(x, y deote the poits i the plae. Fid a ad b so that ya+bx fit the poits by miimizig Solutio: y + ( a + bx y + + ( a + bx y ( a + bxi yi i S ( a + bx L b xi yi ( xi ( yi xi yi ( xi ( xi xi xi xy yi b xi a y bx where x x, i y yi ad all sums are doe for i,...,. 7

8 y How do you get this aswer? Differetiate S with respect to a og b, ad set the result to S ( a + bx i y i a i S ( a + bxi yi xi b We get: i a + b( x i yi ( xi + b( xi xi yi a This is two equatios with two ukows, ad the solutio of these give the aswer. y agaist x x agaist y Liear regressio of y agaist x does ot give the same result as the opposite. Regressio of y agaist x Regressio of x agaist y x Aaylzig the variace What is the logic behid R? Defie SSE: Error sum of squares ( a+ bxi yi SSR: Regressio sum of squares ( a+ bxi y SST: Total sum of squares ( yi y We ca show that SST SSR + SSE SSR SSE Defie R corr( x, y R SST SST is the coefficiet of determiatio y y a+ bx ˆi i SST yi y x x i ε SSE y yˆ i i i SSR yˆi y 8

9 Assumptios Usually check that the depedet variable is ormally distributed More formally, the residuals, i.e. the distace from each observatio to the lie, should be ormally distributed I SPSS: I liear regressio, click Statistics. Uder residuals check casewise diagostics, ad you will get outliers larger tha 3 or less tha -3 i a separate table. I liear regressio, also click Plots. Uder stadardized residuals plots, check Histogram ad Normal probability plot. Choose *Zresid as y-variable ad *Zpred as x-variable Example: Regressio of birth weight with mother s weight as idepedet variable Model Model Model Summary b Adjusted Std. Error of R R Square R Square the Estimate,86 a,35,9 78,47 a. Predictors: (Costat, weight i pouds b. Depedet Variable: birthweight Model Regressio Residual Total ANOVA b Sum of Squares df Mea Square F Sig ,3 6,686, a , a. Predictors: (Costat, weight i pouds b. Depedet Variable: birthweight (Costat weight i pouds a. Depedet Variable: birthweight Coefficiets a Ustadardized Stadardized Coefficiets Coefficiets 95% Cofidece Iterval for B B Std. Error Beta t Sig. Lower Boud Upper Boud 369,67 8,43,374, 99,4 8,34 4,49,73,86,586,,5 7,89 Residuals: Check of assumptios: Casewise Diagostics a Histogram Predicted Case Number Std. Residual birthweight Value Residual -3,5 79, 9,837-9,8 a. Depedet Variable: birthweight 3 Depedet Variable: birthweight Residuals Statistics a 5 Miimum Maximum Mea Std. Deviatio N Predicted Value 74,3 3476, ,656 35, Residual -9,8 75,59, 76, Std. Predicted Value -,69 3,93,, 89 Std. Residual -3,5,89,, a. Depedet Variable: birthweight Frequecy Regressio Stadardized Residual Mea 6,77E-7 Std. Dev.,997 N 89 9

10 Check of assumptios cot d: Check of assumptios cot d: Normal P-P Plot of Regressio Stadardized Residual Scatterplot, Depedet Variable: birthweight 3 Depedet Variable: birthweight Expected Cum Prob,8,6,4, Regressio Stadardized Residual ,,,,4,6,8, Observed Cum Prob Regressio Stadardized Predicted Value Iterpretatio: Have fitted the lie Birth weight *mother s weight If mother s weight icreases by pouds, what is the predicted impact o ifat s birth weight? 4.49*89 grams What s the predicted birth weight of a ifat with a 5 poud mother? *5334 grams Ifluece of extreme observatios NOTE: The result of a regressio aalysis is very much iflueced by poits with extreme values, i either the x or the y directio. Always ivestigate visually, ad determie if outliers are actually erroeous observatios

11 But how to aswer questios like: Give that a positive slope (b has bee estimated: Does it give a reproducible idicatio that there is a positive tred, or is it a result of radom variatio? What is a cofidece iterval for the estimated slope? What is the predictio, with ucertaity, at a ew x value? Cofidece itervals for simple regressio I a simple regressio model, β a estimates b estimates β ˆ σ SSE /( Also, ( b β / Sb ~ t ˆ σ where Sb ( sx of b estimates So a cofidece iterval for by b± t, α /Sb σ estimates variace β is give Hypothesis testig for simple regressio Choose hypotheses: H : β H: β Test statistic: b/ S ~ b t RejectH if b S t α or / b <, / b/ Sb > t, α /

1 Correlation and Regression Analysis

1 Correlation and Regression Analysis 1 Correlatio ad Regressio Aalysis I this sectio we will be ivestigatig the relatioship betwee two cotiuous variable, such as height ad weight, the cocetratio of a ijected drug ad heart rate, or the cosumptio

More information

Inference on Proportion. Chapter 8 Tests of Statistical Hypotheses. Sampling Distribution of Sample Proportion. Confidence Interval

Inference on Proportion. Chapter 8 Tests of Statistical Hypotheses. Sampling Distribution of Sample Proportion. Confidence Interval Chapter 8 Tests of Statistical Hypotheses 8. Tests about Proportios HT - Iferece o Proportio Parameter: Populatio Proportio p (or π) (Percetage of people has o health isurace) x Statistic: Sample Proportio

More information

PSYCHOLOGICAL STATISTICS

PSYCHOLOGICAL STATISTICS UNIVERSITY OF CALICUT SCHOOL OF DISTANCE EDUCATION B Sc. Cousellig Psychology (0 Adm.) IV SEMESTER COMPLEMENTARY COURSE PSYCHOLOGICAL STATISTICS QUESTION BANK. Iferetial statistics is the brach of statistics

More information

Lesson 17 Pearson s Correlation Coefficient

Lesson 17 Pearson s Correlation Coefficient Outlie Measures of Relatioships Pearso s Correlatio Coefficiet (r) -types of data -scatter plots -measure of directio -measure of stregth Computatio -covariatio of X ad Y -uique variatio i X ad Y -measurig

More information

Hypothesis testing. Null and alternative hypotheses

Hypothesis testing. Null and alternative hypotheses Hypothesis testig Aother importat use of samplig distributios is to test hypotheses about populatio parameters, e.g. mea, proportio, regressio coefficiets, etc. For example, it is possible to stipulate

More information

Z-TEST / Z-STATISTIC: used to test hypotheses about. µ when the population standard deviation is unknown

Z-TEST / Z-STATISTIC: used to test hypotheses about. µ when the population standard deviation is unknown Z-TEST / Z-STATISTIC: used to test hypotheses about µ whe the populatio stadard deviatio is kow ad populatio distributio is ormal or sample size is large T-TEST / T-STATISTIC: used to test hypotheses about

More information

Chapter 14 Nonparametric Statistics

Chapter 14 Nonparametric Statistics Chapter 14 Noparametric Statistics A.K.A. distributio-free statistics! Does ot deped o the populatio fittig ay particular type of distributio (e.g, ormal). Sice these methods make fewer assumptios, they

More information

5: Introduction to Estimation

5: Introduction to Estimation 5: Itroductio to Estimatio Cotets Acroyms ad symbols... 1 Statistical iferece... Estimatig µ with cofidece... 3 Samplig distributio of the mea... 3 Cofidece Iterval for μ whe σ is kow before had... 4 Sample

More information

One-sample test of proportions

One-sample test of proportions Oe-sample test of proportios The Settig: Idividuals i some populatio ca be classified ito oe of two categories. You wat to make iferece about the proportio i each category, so you draw a sample. Examples:

More information

MEI Structured Mathematics. Module Summary Sheets. Statistics 2 (Version B: reference to new book)

MEI Structured Mathematics. Module Summary Sheets. Statistics 2 (Version B: reference to new book) MEI Mathematics i Educatio ad Idustry MEI Structured Mathematics Module Summary Sheets Statistics (Versio B: referece to ew book) Topic : The Poisso Distributio Topic : The Normal Distributio Topic 3:

More information

1. C. The formula for the confidence interval for a population mean is: x t, which was

1. C. The formula for the confidence interval for a population mean is: x t, which was s 1. C. The formula for the cofidece iterval for a populatio mea is: x t, which was based o the sample Mea. So, x is guarateed to be i the iterval you form.. D. Use the rule : p-value

More information

Non-life insurance mathematics. Nils F. Haavardsson, University of Oslo and DNB Skadeforsikring

Non-life insurance mathematics. Nils F. Haavardsson, University of Oslo and DNB Skadeforsikring No-life isurace mathematics Nils F. Haavardsso, Uiversity of Oslo ad DNB Skadeforsikrig Mai issues so far Why does isurace work? How is risk premium defied ad why is it importat? How ca claim frequecy

More information

THE REGRESSION MODEL IN MATRIX FORM. For simple linear regression, meaning one predictor, the model is. for i = 1, 2, 3,, n

THE REGRESSION MODEL IN MATRIX FORM. For simple linear regression, meaning one predictor, the model is. for i = 1, 2, 3,, n We will cosider the liear regressio model i matrix form. For simple liear regressio, meaig oe predictor, the model is i = + x i + ε i for i =,,,, This model icludes the assumptio that the ε i s are a sample

More information

THE TWO-VARIABLE LINEAR REGRESSION MODEL

THE TWO-VARIABLE LINEAR REGRESSION MODEL THE TWO-VARIABLE LINEAR REGRESSION MODEL Herma J. Bieres Pesylvaia State Uiversity April 30, 202. Itroductio Suppose you are a ecoomics or busiess maor i a college close to the beach i the souther part

More information

GCSE STATISTICS. 4) How to calculate the range: The difference between the biggest number and the smallest number.

GCSE STATISTICS. 4) How to calculate the range: The difference between the biggest number and the smallest number. GCSE STATISTICS You should kow: 1) How to draw a frequecy diagram: e.g. NUMBER TALLY FREQUENCY 1 3 5 ) How to draw a bar chart, a pictogram, ad a pie chart. 3) How to use averages: a) Mea - add up all

More information

Confidence Intervals for One Mean

Confidence Intervals for One Mean Chapter 420 Cofidece Itervals for Oe Mea Itroductio This routie calculates the sample size ecessary to achieve a specified distace from the mea to the cofidece limit(s) at a stated cofidece level for a

More information

Determining the sample size

Determining the sample size Determiig the sample size Oe of the most commo questios ay statisticia gets asked is How large a sample size do I eed? Researchers are ofte surprised to fid out that the aswer depeds o a umber of factors

More information

Chapter 7 Methods of Finding Estimators

Chapter 7 Methods of Finding Estimators Chapter 7 for BST 695: Special Topics i Statistical Theory. Kui Zhag, 011 Chapter 7 Methods of Fidig Estimators Sectio 7.1 Itroductio Defiitio 7.1.1 A poit estimator is ay fuctio W( X) W( X1, X,, X ) of

More information

0.7 0.6 0.2 0 0 96 96.5 97 97.5 98 98.5 99 99.5 100 100.5 96.5 97 97.5 98 98.5 99 99.5 100 100.5

0.7 0.6 0.2 0 0 96 96.5 97 97.5 98 98.5 99 99.5 100 100.5 96.5 97 97.5 98 98.5 99 99.5 100 100.5 Sectio 13 Kolmogorov-Smirov test. Suppose that we have a i.i.d. sample X 1,..., X with some ukow distributio P ad we would like to test the hypothesis that P is equal to a particular distributio P 0, i.e.

More information

Chapter 7: Confidence Interval and Sample Size

Chapter 7: Confidence Interval and Sample Size Chapter 7: Cofidece Iterval ad Sample Size Learig Objectives Upo successful completio of Chapter 7, you will be able to: Fid the cofidece iterval for the mea, proportio, ad variace. Determie the miimum

More information

Case Study. Normal and t Distributions. Density Plot. Normal Distributions

Case Study. Normal and t Distributions. Density Plot. Normal Distributions Case Study Normal ad t Distributios Bret Halo ad Bret Larget Departmet of Statistics Uiversity of Wiscosi Madiso October 11 13, 2011 Case Study Body temperature varies withi idividuals over time (it ca

More information

Mann-Whitney U 2 Sample Test (a.k.a. Wilcoxon Rank Sum Test)

Mann-Whitney U 2 Sample Test (a.k.a. Wilcoxon Rank Sum Test) No-Parametric ivariate Statistics: Wilcoxo-Ma-Whitey 2 Sample Test 1 Ma-Whitey 2 Sample Test (a.k.a. Wilcoxo Rak Sum Test) The (Wilcoxo-) Ma-Whitey (WMW) test is the o-parametric equivalet of a pooled

More information

Practice Problems for Test 3

Practice Problems for Test 3 Practice Problems for Test 3 Note: these problems oly cover CIs ad hypothesis testig You are also resposible for kowig the samplig distributio of the sample meas, ad the Cetral Limit Theorem Review all

More information

Confidence intervals and hypothesis tests

Confidence intervals and hypothesis tests Chapter 2 Cofidece itervals ad hypothesis tests This chapter focuses o how to draw coclusios about populatios from sample data. We ll start by lookig at biary data (e.g., pollig), ad lear how to estimate

More information

OMG! Excessive Texting Tied to Risky Teen Behaviors

OMG! Excessive Texting Tied to Risky Teen Behaviors BUSIESS WEEK: EXECUTIVE EALT ovember 09, 2010 OMG! Excessive Textig Tied to Risky Tee Behaviors Kids who sed more tha 120 a day more likely to try drugs, alcohol ad sex, researchers fid TUESDAY, ov. 9

More information

Now here is the important step

Now here is the important step LINEST i Excel The Excel spreadsheet fuctio "liest" is a complete liear least squares curve fittig routie that produces ucertaity estimates for the fit values. There are two ways to access the "liest"

More information

15.075 Exam 3. Instructor: Cynthia Rudin TA: Dimitrios Bisias. November 22, 2011

15.075 Exam 3. Instructor: Cynthia Rudin TA: Dimitrios Bisias. November 22, 2011 15.075 Exam 3 Istructor: Cythia Rudi TA: Dimitrios Bisias November 22, 2011 Gradig is based o demostratio of coceptual uderstadig, so you eed to show all of your work. Problem 1 A compay makes high-defiitio

More information

I. Chi-squared Distributions

I. Chi-squared Distributions 1 M 358K Supplemet to Chapter 23: CHI-SQUARED DISTRIBUTIONS, T-DISTRIBUTIONS, AND DEGREES OF FREEDOM To uderstad t-distributios, we first eed to look at aother family of distributios, the chi-squared distributios.

More information

Unit 8: Inference for Proportions. Chapters 8 & 9 in IPS

Unit 8: Inference for Proportions. Chapters 8 & 9 in IPS Uit 8: Iferece for Proortios Chaters 8 & 9 i IPS Lecture Outlie Iferece for a Proortio (oe samle) Iferece for Two Proortios (two samles) Cotigecy Tables ad the χ test Iferece for Proortios IPS, Chater

More information

Output Analysis (2, Chapters 10 &11 Law)

Output Analysis (2, Chapters 10 &11 Law) B. Maddah ENMG 6 Simulatio 05/0/07 Output Aalysis (, Chapters 10 &11 Law) Comparig alterative system cofiguratio Sice the output of a simulatio is radom, the comparig differet systems via simulatio should

More information

Chapter 7 - Sampling Distributions. 1 Introduction. What is statistics? It consist of three major areas:

Chapter 7 - Sampling Distributions. 1 Introduction. What is statistics? It consist of three major areas: Chapter 7 - Samplig Distributios 1 Itroductio What is statistics? It cosist of three major areas: Data Collectio: samplig plas ad experimetal desigs Descriptive Statistics: umerical ad graphical summaries

More information

, a Wishart distribution with n -1 degrees of freedom and scale matrix.

, a Wishart distribution with n -1 degrees of freedom and scale matrix. UMEÅ UNIVERSITET Matematisk-statistiska istitutioe Multivariat dataaalys D MSTD79 PA TENTAMEN 004-0-9 LÖSNINGSFÖRSLAG TILL TENTAMEN I MATEMATISK STATISTIK Multivariat dataaalys D, 5 poäg.. Assume that

More information

Confidence Intervals. CI for a population mean (σ is known and n > 30 or the variable is normally distributed in the.

Confidence Intervals. CI for a population mean (σ is known and n > 30 or the variable is normally distributed in the. Cofidece Itervals A cofidece iterval is a iterval whose purpose is to estimate a parameter (a umber that could, i theory, be calculated from the populatio, if measuremets were available for the whole populatio).

More information

Overview. Learning Objectives. Point Estimate. Estimation. Estimating the Value of a Parameter Using Confidence Intervals

Overview. Learning Objectives. Point Estimate. Estimation. Estimating the Value of a Parameter Using Confidence Intervals Overview Estimatig the Value of a Parameter Usig Cofidece Itervals We apply the results about the sample mea the problem of estimatio Estimatio is the process of usig sample data estimate the value of

More information

Confidence Intervals for Linear Regression Slope

Confidence Intervals for Linear Regression Slope Chapter 856 Cofidece Iterval for Liear Regreio Slope Itroductio Thi routie calculate the ample ize eceary to achieve a pecified ditace from the lope to the cofidece limit at a tated cofidece level for

More information

Math C067 Sampling Distributions

Math C067 Sampling Distributions Math C067 Samplig Distributios Sample Mea ad Sample Proportio Richard Beigel Some time betwee April 16, 2007 ad April 16, 2007 Examples of Samplig A pollster may try to estimate the proportio of voters

More information

TI-83, TI-83 Plus or TI-84 for Non-Business Statistics

TI-83, TI-83 Plus or TI-84 for Non-Business Statistics TI-83, TI-83 Plu or TI-84 for No-Buie Statitic Chapter 3 Eterig Data Pre [STAT] the firt optio i already highlighted (:Edit) o you ca either pre [ENTER] or. Make ure the curor i i the lit, ot o the lit

More information

Confidence Intervals

Confidence Intervals Cofidece Itervals Cofidece Itervals are a extesio of the cocept of Margi of Error which we met earlier i this course. Remember we saw: The sample proportio will differ from the populatio proportio by more

More information

University of California, Los Angeles Department of Statistics. Distributions related to the normal distribution

University of California, Los Angeles Department of Statistics. Distributions related to the normal distribution Uiversity of Califoria, Los Ageles Departmet of Statistics Statistics 100B Istructor: Nicolas Christou Three importat distributios: Distributios related to the ormal distributio Chi-square (χ ) distributio.

More information

CHAPTER 7: Central Limit Theorem: CLT for Averages (Means)

CHAPTER 7: Central Limit Theorem: CLT for Averages (Means) CHAPTER 7: Cetral Limit Theorem: CLT for Averages (Meas) X = the umber obtaied whe rollig oe six sided die oce. If we roll a six sided die oce, the mea of the probability distributio is X P(X = x) Simulatio:

More information

Outline. Determine Confidence Interval. EEC 686/785 Modeling & Performance Evaluation of Computer Systems. Confidence Interval for The Mean.

Outline. Determine Confidence Interval. EEC 686/785 Modeling & Performance Evaluation of Computer Systems. Confidence Interval for The Mean. EEC 686/785 Modelig & Performace Evaluatio of Computer Systems Lecture 9 Departmet of Electrical ad Computer Egieerig Clevelad State Uiversity webig@ieee.org (based o Dr. Raj jai s lecture otes) Outlie

More information

Measures of Spread and Boxplots Discrete Math, Section 9.4

Measures of Spread and Boxplots Discrete Math, Section 9.4 Measures of Spread ad Boxplots Discrete Math, Sectio 9.4 We start with a example: Example 1: Comparig Mea ad Media Compute the mea ad media of each data set: S 1 = {4, 6, 8, 10, 1, 14, 16} S = {4, 7, 9,

More information

1 Computing the Standard Deviation of Sample Means

1 Computing the Standard Deviation of Sample Means Computig the Stadard Deviatio of Sample Meas Quality cotrol charts are based o sample meas ot o idividual values withi a sample. A sample is a group of items, which are cosidered all together for our aalysis.

More information

Topic 5: Confidence Intervals (Chapter 9)

Topic 5: Confidence Intervals (Chapter 9) Topic 5: Cofidece Iterval (Chapter 9) 1. Itroductio The two geeral area of tatitical iferece are: 1) etimatio of parameter(), ch. 9 ) hypothei tetig of parameter(), ch. 10 Let X be ome radom variable with

More information

Maximum Likelihood Estimators.

Maximum Likelihood Estimators. Lecture 2 Maximum Likelihood Estimators. Matlab example. As a motivatio, let us look at oe Matlab example. Let us geerate a radom sample of size 00 from beta distributio Beta(5, 2). We will lear the defiitio

More information

A Test of Normality. 1 n S 2 3. n 1. Now introduce two new statistics. The sample skewness is defined as:

A Test of Normality. 1 n S 2 3. n 1. Now introduce two new statistics. The sample skewness is defined as: A Test of Normality Textbook Referece: Chapter. (eighth editio, pages 59 ; seveth editio, pages 6 6). The calculatio of p values for hypothesis testig typically is based o the assumptio that the populatio

More information

Descriptive Statistics

Descriptive Statistics Descriptive Statistics We leared to describe data sets graphically. We ca also describe a data set umerically. Measures of Locatio Defiitio The sample mea is the arithmetic average of values. We deote

More information

Lesson 15 ANOVA (analysis of variance)

Lesson 15 ANOVA (analysis of variance) Outlie Variability -betwee group variability -withi group variability -total variability -F-ratio Computatio -sums of squares (betwee/withi/total -degrees of freedom (betwee/withi/total -mea square (betwee/withi

More information

Hypothesis testing using complex survey data

Hypothesis testing using complex survey data Hypotesis testig usig complex survey data A Sort Course preseted by Peter Ly, Uiversity of Essex i associatio wit te coferece of te Europea Survey Researc Associatio Prague, 5 Jue 007 1 1. Objective: Simple

More information

The following example will help us understand The Sampling Distribution of the Mean. C1 C2 C3 C4 C5 50 miles 84 miles 38 miles 120 miles 48 miles

The following example will help us understand The Sampling Distribution of the Mean. C1 C2 C3 C4 C5 50 miles 84 miles 38 miles 120 miles 48 miles The followig eample will help us uderstad The Samplig Distributio of the Mea Review: The populatio is the etire collectio of all idividuals or objects of iterest The sample is the portio of the populatio

More information

Center, Spread, and Shape in Inference: Claims, Caveats, and Insights

Center, Spread, and Shape in Inference: Claims, Caveats, and Insights Ceter, Spread, ad Shape i Iferece: Claims, Caveats, ad Isights Dr. Nacy Pfeig (Uiversity of Pittsburgh) AMATYC November 2008 Prelimiary Activities 1. I would like to produce a iterval estimate for the

More information

Sampling Distribution And Central Limit Theorem

Sampling Distribution And Central Limit Theorem () Samplig Distributio & Cetral Limit Samplig Distributio Ad Cetral Limit Samplig distributio of the sample mea If we sample a umber of samples (say k samples where k is very large umber) each of size,

More information

Biology 171L Environment and Ecology Lab Lab 2: Descriptive Statistics, Presenting Data and Graphing Relationships

Biology 171L Environment and Ecology Lab Lab 2: Descriptive Statistics, Presenting Data and Graphing Relationships Biology 171L Eviromet ad Ecology Lab Lab : Descriptive Statistics, Presetig Data ad Graphig Relatioships Itroductio Log lists of data are ofte ot very useful for idetifyig geeral treds i the data or the

More information

Statistical inference: example 1. Inferential Statistics

Statistical inference: example 1. Inferential Statistics Statistical iferece: example 1 Iferetial Statistics POPULATION SAMPLE A clothig store chai regularly buys from a supplier large quatities of a certai piece of clothig. Each item ca be classified either

More information

Parametric (theoretical) probability distributions. (Wilks, Ch. 4) Discrete distributions: (e.g., yes/no; above normal, normal, below normal)

Parametric (theoretical) probability distributions. (Wilks, Ch. 4) Discrete distributions: (e.g., yes/no; above normal, normal, below normal) 6 Parametric (theoretical) probability distributios. (Wilks, Ch. 4) Note: parametric: assume a theoretical distributio (e.g., Gauss) No-parametric: o assumptio made about the distributio Advatages of assumig

More information

Normal Distribution.

Normal Distribution. Normal Distributio www.icrf.l Normal distributio I probability theory, the ormal or Gaussia distributio, is a cotiuous probability distributio that is ofte used as a first approimatio to describe realvalued

More information

SAMPLE QUESTIONS FOR FINAL EXAM. (1) (2) (3) (4) Find the following using the definition of the Riemann integral: (2x + 1)dx

SAMPLE QUESTIONS FOR FINAL EXAM. (1) (2) (3) (4) Find the following using the definition of the Riemann integral: (2x + 1)dx SAMPLE QUESTIONS FOR FINAL EXAM REAL ANALYSIS I FALL 006 3 4 Fid the followig usig the defiitio of the Riema itegral: a 0 x + dx 3 Cosider the partitio P x 0 3, x 3 +, x 3 +,......, x 3 3 + 3 of the iterval

More information

Analyzing Longitudinal Data from Complex Surveys Using SUDAAN

Analyzing Longitudinal Data from Complex Surveys Using SUDAAN Aalyzig Logitudial Data from Complex Surveys Usig SUDAAN Darryl Creel Statistics ad Epidemiology, RTI Iteratioal, 312 Trotter Farm Drive, Rockville, MD, 20850 Abstract SUDAAN: Software for the Statistical

More information

Data Analysis and Statistical Behaviors of Stock Market Fluctuations

Data Analysis and Statistical Behaviors of Stock Market Fluctuations 44 JOURNAL OF COMPUTERS, VOL. 3, NO. 0, OCTOBER 2008 Data Aalysis ad Statistical Behaviors of Stock Market Fluctuatios Ju Wag Departmet of Mathematics, Beijig Jiaotog Uiversity, Beijig 00044, Chia Email:

More information

Research Method (I) --Knowledge on Sampling (Simple Random Sampling)

Research Method (I) --Knowledge on Sampling (Simple Random Sampling) Research Method (I) --Kowledge o Samplig (Simple Radom Samplig) 1. Itroductio to samplig 1.1 Defiitio of samplig Samplig ca be defied as selectig part of the elemets i a populatio. It results i the fact

More information

Example: Probability ($1 million in S&P 500 Index will decline by more than 20% within a

Example: Probability ($1 million in S&P 500 Index will decline by more than 20% within a Value at Risk For a give portfolio, Value-at-Risk (VAR) is defied as the umber VAR such that: Pr( Portfolio loses more tha VAR withi time period t)

More information

Properties of MLE: consistency, asymptotic normality. Fisher information.

Properties of MLE: consistency, asymptotic normality. Fisher information. Lecture 3 Properties of MLE: cosistecy, asymptotic ormality. Fisher iformatio. I this sectio we will try to uderstad why MLEs are good. Let us recall two facts from probability that we be used ofte throughout

More information

hp calculators HP 12C Statistics - average and standard deviation Average and standard deviation concepts HP12C average and standard deviation

hp calculators HP 12C Statistics - average and standard deviation Average and standard deviation concepts HP12C average and standard deviation HP 1C Statistics - average ad stadard deviatio Average ad stadard deviatio cocepts HP1C average ad stadard deviatio Practice calculatig averages ad stadard deviatios with oe or two variables HP 1C Statistics

More information

A modified Kolmogorov-Smirnov test for normality

A modified Kolmogorov-Smirnov test for normality MPRA Muich Persoal RePEc Archive A modified Kolmogorov-Smirov test for ormality Zvi Drezer ad Ofir Turel ad Dawit Zerom Califoria State Uiversity-Fullerto 22. October 2008 Olie at http://mpra.ub.ui-mueche.de/14385/

More information

A Mathematical Perspective on Gambling

A Mathematical Perspective on Gambling A Mathematical Perspective o Gamblig Molly Maxwell Abstract. This paper presets some basic topics i probability ad statistics, icludig sample spaces, probabilistic evets, expectatios, the biomial ad ormal

More information

STATISTICAL METHODS FOR BUSINESS

STATISTICAL METHODS FOR BUSINESS STATISTICAL METHODS FOR BUSINESS UNIT 7: INFERENTIAL TOOLS. DISTRIBUTIONS ASSOCIATED WITH SAMPLING 7.1.- Distributios associated with the samplig process. 7.2.- Iferetial processes ad relevat distributios.

More information

Chapter 5: Basic Linear Regression

Chapter 5: Basic Linear Regression Chapter 5: Basic Liear Regressio 1. Why Regressio Aalysis Has Domiated Ecoometrics By ow we have focused o formig estimates ad tests for fairly simple cases ivolvig oly oe variable at a time. But the core

More information

Forecasting. Forecasting Application. Practical Forecasting. Chapter 7 OVERVIEW KEY CONCEPTS. Chapter 7. Chapter 7

Forecasting. Forecasting Application. Practical Forecasting. Chapter 7 OVERVIEW KEY CONCEPTS. Chapter 7. Chapter 7 Forecastig Chapter 7 Chapter 7 OVERVIEW Forecastig Applicatios Qualitative Aalysis Tred Aalysis ad Projectio Busiess Cycle Expoetial Smoothig Ecoometric Forecastig Judgig Forecast Reliability Choosig the

More information

STA 2023 Practice Questions Exam 2 Chapter 7- sec 9.2. Case parameter estimator standard error Estimate of standard error

STA 2023 Practice Questions Exam 2 Chapter 7- sec 9.2. Case parameter estimator standard error Estimate of standard error STA 2023 Practice Questios Exam 2 Chapter 7- sec 9.2 Formulas Give o the test: Case parameter estimator stadard error Estimate of stadard error Samplig Distributio oe mea x s t (-1) oe p ( 1 p) CI: prop.

More information

TESTING FOR GRANGER CAUSALITY BETWEEN RENEWABLE ENERGY CONSUMPTION, GDP, CO 2 EMISSION, AND FOSSIL FUEL PRICES IN THE USA.

TESTING FOR GRANGER CAUSALITY BETWEEN RENEWABLE ENERGY CONSUMPTION, GDP, CO 2 EMISSION, AND FOSSIL FUEL PRICES IN THE USA. TESTING FOR GRANGER CAUSALITY BETWEEN RENEWABLE ENERGY CONSUMPTION, GDP, CO 2 EMISSION, AND FOSSIL FUEL PRICES IN THE USA. MustafaYasiYeice Mustafa.yeice@udc.edu Valboa Bejleri vbejleri@udc.edu Uiversity

More information

Institute of Actuaries of India Subject CT1 Financial Mathematics

Institute of Actuaries of India Subject CT1 Financial Mathematics Istitute of Actuaries of Idia Subject CT1 Fiacial Mathematics For 2014 Examiatios Subject CT1 Fiacial Mathematics Core Techical Aim The aim of the Fiacial Mathematics subject is to provide a groudig i

More information

TI-89, TI-92 Plus or Voyage 200 for Non-Business Statistics

TI-89, TI-92 Plus or Voyage 200 for Non-Business Statistics Chapter 3 TI-89, TI-9 Plu or Voyage 00 for No-Buie Statitic Eterig Data Pre [APPS], elect FlahApp the pre [ENTER]. Highlight Stat/Lit Editor the pre [ENTER]. Pre [ENTER] agai to elect the mai folder. (Note:

More information

AP Calculus AB 2006 Scoring Guidelines Form B

AP Calculus AB 2006 Scoring Guidelines Form B AP Calculus AB 6 Scorig Guidelies Form B The College Board: Coectig Studets to College Success The College Board is a ot-for-profit membership associatio whose missio is to coect studets to college success

More information

3. If x and y are real numbers, what is the simplified radical form

3. If x and y are real numbers, what is the simplified radical form lgebra II Practice Test Objective:.a. Which is equivalet to 98 94 4 49?. Which epressio is aother way to write 5 4? 5 5 4 4 4 5 4 5. If ad y are real umbers, what is the simplified radical form of 5 y

More information

Exploratory Data Analysis

Exploratory Data Analysis 1 Exploratory Data Aalysis Exploratory data aalysis is ofte the rst step i a statistical aalysis, for it helps uderstadig the mai features of the particular sample that a aalyst is usig. Itelliget descriptios

More information

7. Concepts in Probability, Statistics and Stochastic Modelling

7. Concepts in Probability, Statistics and Stochastic Modelling 7. Cocepts i Probability, Statistics ad Stochastic Modellig 1. Itroductio 169. Probability Cocepts ad Methods 170.1. Radom Variables ad Distributios 170.. Expectatio 173.3. Quatiles, Momets ad Their Estimators

More information

Multi-server Optimal Bandwidth Monitoring for QoS based Multimedia Delivery Anup Basu, Irene Cheng and Yinzhe Yu

Multi-server Optimal Bandwidth Monitoring for QoS based Multimedia Delivery Anup Basu, Irene Cheng and Yinzhe Yu Multi-server Optimal Badwidth Moitorig for QoS based Multimedia Delivery Aup Basu, Iree Cheg ad Yizhe Yu Departmet of Computig Sciece U. of Alberta Architecture Applicatio Layer Request receptio -coectio

More information

HCL Dynamic Spiking Protocol

HCL Dynamic Spiking Protocol ELI LILLY AND COMPANY TIPPECANOE LABORATORIES LAFAYETTE, IN Revisio 2.0 TABLE OF CONTENTS REVISION HISTORY... 2. REVISION.0... 2.2 REVISION 2.0... 2 2 OVERVIEW... 3 3 DEFINITIONS... 5 4 EQUIPMENT... 7

More information

CONTROL CHART BASED ON A MULTIPLICATIVE-BINOMIAL DISTRIBUTION

CONTROL CHART BASED ON A MULTIPLICATIVE-BINOMIAL DISTRIBUTION www.arpapress.com/volumes/vol8issue2/ijrras_8_2_04.pdf CONTROL CHART BASED ON A MULTIPLICATIVE-BINOMIAL DISTRIBUTION Elsayed A. E. Habib Departmet of Statistics ad Mathematics, Faculty of Commerce, Beha

More information

This document contains a collection of formulas and constants useful for SPC chart construction. It assumes you are already familiar with SPC.

This document contains a collection of formulas and constants useful for SPC chart construction. It assumes you are already familiar with SPC. SPC Formulas ad Tables 1 This documet cotais a collectio of formulas ad costats useful for SPC chart costructio. It assumes you are already familiar with SPC. Termiology Geerally, a bar draw over a symbol

More information

Overview of some probability distributions.

Overview of some probability distributions. Lecture Overview of some probability distributios. I this lecture we will review several commo distributios that will be used ofte throughtout the class. Each distributio is usually described by its probability

More information

Incremental calculation of weighted mean and variance

Incremental calculation of weighted mean and variance Icremetal calculatio of weighted mea ad variace Toy Fich faf@cam.ac.uk dot@dotat.at Uiversity of Cambridge Computig Service February 009 Abstract I these otes I eplai how to derive formulae for umerically

More information

Quadrat Sampling in Population Ecology

Quadrat Sampling in Population Ecology Quadrat Samplig i Populatio Ecology Backgroud Estimatig the abudace of orgaisms. Ecology is ofte referred to as the "study of distributio ad abudace". This beig true, we would ofte like to kow how may

More information

Basic Data Analysis Principles. Acknowledgments

Basic Data Analysis Principles. Acknowledgments CEB - Basic Data Aalysis Priciples Basic Data Aalysis Priciples What to do oce you get the data Whe we reaso about quatitative evidece, certai methods for displayig ad aalyzig data are better tha others.

More information

In nite Sequences. Dr. Philippe B. Laval Kennesaw State University. October 9, 2008

In nite Sequences. Dr. Philippe B. Laval Kennesaw State University. October 9, 2008 I ite Sequeces Dr. Philippe B. Laval Keesaw State Uiversity October 9, 2008 Abstract This had out is a itroductio to i ite sequeces. mai de itios ad presets some elemetary results. It gives the I ite Sequeces

More information

Chapter 6: Variance, the law of large numbers and the Monte-Carlo method

Chapter 6: Variance, the law of large numbers and the Monte-Carlo method Chapter 6: Variace, the law of large umbers ad the Mote-Carlo method Expected value, variace, ad Chebyshev iequality. If X is a radom variable recall that the expected value of X, E[X] is the average value

More information

Sequences and Series

Sequences and Series CHAPTER 9 Sequeces ad Series 9.. Covergece: Defiitio ad Examples Sequeces The purpose of this chapter is to itroduce a particular way of geeratig algorithms for fidig the values of fuctios defied by their

More information

Chapter 5 Unit 1. IET 350 Engineering Economics. Learning Objectives Chapter 5. Learning Objectives Unit 1. Annual Amount and Gradient Functions

Chapter 5 Unit 1. IET 350 Engineering Economics. Learning Objectives Chapter 5. Learning Objectives Unit 1. Annual Amount and Gradient Functions Chapter 5 Uit Aual Amout ad Gradiet Fuctios IET 350 Egieerig Ecoomics Learig Objectives Chapter 5 Upo completio of this chapter you should uderstad: Calculatig future values from aual amouts. Calculatig

More information

Comparative Study On Estimate House Price Using Statistical And Neural Network Model

Comparative Study On Estimate House Price Using Statistical And Neural Network Model Comparative Study O Estimate House Price Usig Statistical Ad Neural Network Model Azme Bi Khamis, Nur Khalidah Khalilah Biti Kamarudi Abstract: This study was coducted to compare the performace betwee

More information

Allele frequency estimation in the human ABO blood group system

Allele frequency estimation in the human ABO blood group system Allele frequecy estimatio i the huma AB blood group system Pedro J.N. Silva Faculdade de Ciecias da Uiversidade de Lisboa Campo Grade, C, 4o. piso P-1700 LISBA PRTUGAL Pedro.Silva@fc.ul.pt 00 Table of

More information

3. Greatest Common Divisor - Least Common Multiple

3. Greatest Common Divisor - Least Common Multiple 3 Greatest Commo Divisor - Least Commo Multiple Defiitio 31: The greatest commo divisor of two atural umbers a ad b is the largest atural umber c which divides both a ad b We deote the greatest commo gcd

More information

Hypergeometric Distributions

Hypergeometric Distributions 7.4 Hypergeometric Distributios Whe choosig the startig lie-up for a game, a coach obviously has to choose a differet player for each positio. Similarly, whe a uio elects delegates for a covetio or you

More information

Actuarial Models for Valuation of Critical Illness Insurance Products

Actuarial Models for Valuation of Critical Illness Insurance Products INTERNATIONAL JOURNAL OF MATHEMATICAL MODELS AND METHODS IN APPLIED SCIENCES Volume 9, 015 Actuarial Models for Valuatio of Critical Illess Isurace Products P. Jidrová, V. Pacáková Abstract Critical illess

More information

A GUIDE TO LEVEL 3 VALUE ADDED IN 2013 SCHOOL AND COLLEGE PERFORMANCE TABLES

A GUIDE TO LEVEL 3 VALUE ADDED IN 2013 SCHOOL AND COLLEGE PERFORMANCE TABLES A GUIDE TO LEVEL 3 VALUE ADDED IN 2013 SCHOOL AND COLLEGE PERFORMANCE TABLES Cotets Page No. Summary Iterpretig School ad College Value Added Scores 2 What is Value Added? 3 The Learer Achievemet Tracker

More information

Learning objectives. Duc K. Nguyen - Corporate Finance 21/10/2014

Learning objectives. Duc K. Nguyen - Corporate Finance 21/10/2014 1 Lecture 3 Time Value of Moey ad Project Valuatio The timelie Three rules of time travels NPV of a stream of cash flows Perpetuities, auities ad other special cases Learig objectives 2 Uderstad the time-value

More information

NATIONAL SENIOR CERTIFICATE GRADE 12

NATIONAL SENIOR CERTIFICATE GRADE 12 NATIONAL SENIOR CERTIFICATE GRADE MATHEMATICS P EXEMPLAR 04 MARKS: 50 TIME: 3 hours This questio paper cosists of 8 pages ad iformatio sheet. Please tur over Mathematics/P DBE/04 NSC Grade Eemplar INSTRUCTIONS

More information

Definition. A variable X that takes on values X 1, X 2, X 3,...X k with respective frequencies f 1, f 2, f 3,...f k has mean

Definition. A variable X that takes on values X 1, X 2, X 3,...X k with respective frequencies f 1, f 2, f 3,...f k has mean 1 Social Studies 201 October 13, 2004 Note: The examples i these otes may be differet tha used i class. However, the examples are similar ad the methods used are idetical to what was preseted i class.

More information

Data-Enhanced Predictive Modeling for Sales Targeting

Data-Enhanced Predictive Modeling for Sales Targeting Data-Ehaced Predictive Modelig for Sales Targetig Saharo Rosset Richard D. Lawrece Abstract We describe ad aalyze the idea of data-ehaced predictive modelig (DEM). The term ehaced here refers to the case

More information

SECTION 1.5 : SUMMATION NOTATION + WORK WITH SEQUENCES

SECTION 1.5 : SUMMATION NOTATION + WORK WITH SEQUENCES SECTION 1.5 : SUMMATION NOTATION + WORK WITH SEQUENCES Read Sectio 1.5 (pages 5 9) Overview I Sectio 1.5 we lear to work with summatio otatio ad formulas. We will also itroduce a brief overview of sequeces,

More information

Doing Multiple Regression with SPSS. In this case, we are interested in the Analyze options so we choose that menu. If gives us a number of choices:

Doing Multiple Regression with SPSS. In this case, we are interested in the Analyze options so we choose that menu. If gives us a number of choices: Doing Multiple Regression with SPSS Multiple Regression for Data Already in Data Editor Next we want to specify a multiple regression analysis for these data. The menu bar for SPSS offers several options:

More information