STATISTICS AND STANDARD DEVIATION

Size: px
Start display at page:

Download "STATISTICS AND STANDARD DEVIATION"

Transcription

1 STATISTICS AND STANDARD DEVIATION

2 Statistics and Standard Deviation STSD-A Objectives... STSD 1 STSD-B Calculating Mean... STSD STSD-C Definition of Variance and Standard Deviation... STSD 4 STSD-D Calculating Standard Deviation... STSD 5 STSD-E Coefficient of Variation... STSD 7 STSD-F Normal Distribution and z-scores... STSD 8 STSD-G Chebshev s Theorem... STSD 15 STSD-H Correlation and Scatterplots... STSD 16 STSD-I Correlation Coefficient and Regression Equation... STSD 1 STSD-J Summar... STSD 5 STSD-K Review Eercise... STSD 7 STSD-L Appendi z-score Values Table... STSD 9 STSD-Y Inde... STSD 30 STSD-Z Solutions... STSD 3 STSD-A Objectives To calculate the mean and standard deviation of lists, tables and grouped data To determine the correlation co-efficient To calculate z-scores To use normal distributions to determine proportions and values To use Chebshev s theorem To determine correlation between sets of data To construct scatterplots and lines of best fit To calculate correlation coefficient and regression equation for data sets. STSD 1

3 STSD-B Calculating Mean The mean is a measure of central tendenc. It is the value usuall described as the average. The mean is determined b summing all of the numbers and dividing the result b the number of values. The mean of a population of N values (scores) is defined as the sum of all the scores, of the population,, divided b the number of scores, N. The population mean is represented b the Greek letter μ (mu) and calculated b using μ. N Often it is not possible to obtain data from an entire population. In such cases, a sample of the population is taken. The mean of a sample of n items drawn from the population is defined in the same wa and is denoted b, pronounced bar and calculated using Eample STSD-B1 Calculate the mean of the following student test results percentages.. n 9% 66% 99% 75% 69% 51% 89% 75% 54% 45% 69% n write out formula add together all scores divide b number of scores The mean of the student test results is 71.7 % (rounded to d.p.). When calculating the mean from a frequenc distribution table, it is necessar to multipl each score b its frequenc and sum these values. This result is then divided b the sum of the frequencies. The formula for the mean calculated from a frequenc table is f f Calculations using this formula are often simplified b setting up a table as shown below. Eample STSD-B Calculate the mean number of pins knocked down from the frequenc table. Pins () Frequenc (f) f Total f 60 f 40 mean Note: It is rare for an eact number to result from a mean calculation. f f The mean number of pins knocked down was 7 pins. STSD

4 If the frequenc distrubution table has grouped data, intervals, it is necessar to use the mid-value of the interval in mean calculations. The mid-value for an interval is calculated b adding the upper and lower boundaries of the interval and dividing the result b two. mid value: upper + lower Eample STSD-B3 Calculate the mean height of students from the frequenc table. Height (cm) mid-value () Frequenc(f) f Σ f 0 Σ f 315 mean f f cm Eercise STSD-B1 Calculate the mean of the following data sets. (a) Hocke goals scored. 5, 4, 3,,, 1, 0, 0, 1,, 3 The mean height is cm. (b) Points scored in basketball games (d) Babies weights Points Scored Frequenc (f) Bab Weight (kg) Freq (f) () Total 10 Total 15 (c) Number of tping errors (e) ATM withdrawals Tping (f) Withdrawals (f) errors ($) Total Total 30 STSD 3

5 STSD-C Definition of Variance and Standard Deviation To further describe data sets, measures of spread or dispersion are used. One of the most commonl used measures is standard deviation. This value gives information on how the values of the data set are varing, or deviating, from the mean of the data set. Deviations are calculated b subtracting the mean,, from each of the sample values,, i.e. deviation. As some values are less than the mean, negative deviations will result, and for values greater than the mean positive deviations will be obtained. B simpl adding the values of the deviations from the mean, the positive and negative values will cancel to result in a value of zero. B squaring each of the deviations, the problem of positive and negative values is avoided. To calculate the standard deviation, the deviations are squared. These values are summed, divided b the appropriate number of values and then finall the square root is taken of this result, to counteract the initial squaring of the deviation. The standard deviation of a population, σ, of N data items is defined b the following formula. ( μ ) Σ σ where μ is the population mean. N For a sample of n data items the standard deviation, s, is defined b, s Σ ( ) n 1 where is the sample mean. NOTE: When calculating the sample standard deviation we divide b (n 1) not N. The reason for this is comple but it does give a more accurate measurement for the variance of a sample. Standard deviation is measured in the same units as the mean. It is usual to assume that data is from a sample, unless it is stated that a population is being used. To assist in calculations data should be set up in a table and the following headings used: μ OR ( μ ) OR ( ) Eample STSD-C1 Determine the standard deviation of the following student test results percentages. 9% 66% 99% 75% 69% 51% 89% 75% 54% 45% 69% ( ) ( 0.7) Σ 784 Σ( ) Σ n 11 ( ) Σ s n The standard deviation of the test results is approimatel 17.6%. STSD 4

6 The variance is the average of the squared deviations when the data given represents the population. The lower case Greek letter sigma squared, σ, is used to represent the population variance. ( μ ) σ where μ is the population mean, and N is the population size. N The sample variance, which is denoted b s, is defined as s ( ) n 1 where is the sample mean, and n is the sample size. As variance is measured in squared units, it is more useful to use standard deviation, the square root of variance, as a measure of dispersion. STSD-D Calculating Standard Deviation The previousl mentioned formulae for standard deviation of a population, σ and a sample standard deviation, s, ( μ ) ( ) Σ Σ σ s N n 1 can be manipulated to obtain the following formula which are easier to use for calculations. These are commonl called computational formulae. ( Σ) ( Σ) Σ Σ N n σ s N n 1 To perform calculations again it is necessar to set up a table. The table heading in this case will be: Eample STSD-D1 Determine the standard deviation of the following student test results percentages. 9% 66% 99% 75% 69% 51% 89% 75% 54% 45% 69% Σ 784 Σ ( Σ) Σ n s n NOTE: This is approimatel the same value as calculated previousl. This value will actuall be more accurate as it onl uses rounding in the final calculation step. The standard deviation of the test scores is approimatel 17.6%. STSD 5

7 When data is presented in a frequenc table the following computational formulae for populations standard deviation, σ, and sample standard deviation, s, can be used. ( Σf) ( Σf) Σf Σf Σf Σf σ s Σf Σf 1 If the data is presented in a grouped or interval manner, the mid-values are used as with the calculation of the mean. The table heading for calculations will include. f f f Eamples STSD-D Calculate the standard deviations for each of the following data sets. (a) Number of pins knocked down in ten-pin bowling matches Pins () f f f Σ f 60 Σ f 40 Σ f 384 s Σf ( Σf) Σf Σf The standard deviation of the number of pins knocked down is approimatel.41 pins. (b) Heights of students Heights f f f Σ f 0 Σ f 315 Σ f s Σf ( Σf) Σf Σf The standard deviation of the heights is approimatel 38.33cm. STSD 6

8 Eercise STSD-D1 Calculate the standard deviations for each of the following data sets. (a) Hocke goals scored. 5, 4, 3,,, 1, 0, 0, 1,, 3 (b) Points scored in basketball games. (d) Babies weights Points Scored Frequenc (f) Bab Weight (kg) Freq (f) () Total 10 Total 15 (c) Number of tping errors. (e) ATM withdrawals STSD-E Tping (f) Withdrawals (f) errors ($) Total Total 30 Co-efficient of Variation Without an understanding of the relative size of the standard deviation compared to the original data, the standard deviation is somewhat meaningless for use with the comparison of data sets. To address this problem the coefficient of variation is used. The coefficient of variation, CV, gives the standard deviation as a percentage of the mean of the data set. s CV 100% CV σ 100% μ for a sample for a population Eample STSD-E1 Calculate the coefficient of variation for the following data set. The price, in cents, of a stock over five trading das was 5, 58, 55, 57, Σ 81 Σ 1583 s.77 CV 100% 100% 4.93% 56. n ( Σ) Σ n s n The coefficient of variation for the stock prices is 4.93%. The prices have not showed a large variation over the five das of trading STSD 7

9 The coefficient of variation is often used to compare the variabilit of two data sets. It allows comparison regardless of the units of measurement used for each set of data. The larger the coefficient of variation, the more the data varies. Eample STSD-E The results of two tests are shown below. Compare the variabilit of these data sets. Test 1 (out of 15 marks): 9 s Test (out of 50 marks): 7 s 8 s s CVtest1 100% CVtest 100% 8 100%.% 100% 9.6% 9 7 The results in the second test show a great variation than those in the first test. Eercise STSD-E1 1. Calculate the coefficient of variation for each of the following data sets. (a) Stock prices: 8, 10, 9, 10, 11 (b) Test results: 10, 5, 8, 9,, 1, 5, 7, 5, 8. Compare the variation of the following data sets. (a) Data set A: 35, 38, 34, 36, 38, 35, 36, 37, 36 Data set B: 36, 0, 45, 40, 5, 46, 6, 6, 3 (b) Bo s Heights: 141.6cm s 15.1cm Girl s Heights: 143.7cm s 8.4cm STSD-F Normal Distribution and z-scores Another use of the standard deviation is to convert data to a standard score or z-score. The z-score indicates the number of standard deviations a raw score deviates from the mean of the data set and in which direction, i.e. is the value greater or less than the mean? The following formula allows a raw score,, from a data set to be converted to its equivalent standard value, z, in a new data set with a mean of zero and a standard deviation of one. A z-score can be positive or negative: μ z sample z population s σ positive z-score raw score greater than the mean negative z-score raw score less than the mean. STSD 8

10 Eamples STSD-F1 1. Given the scores 4, 7, 8, 1, 5 determine the z-score for each raw score. 5 5 n ( ) 8 64 s n 1 1 n Σ 5 Σ 155 raw score z-score meaning z standard deviations below the mean z standard deviations above the mean z standard deviations above the mean z standard deviations below the mean z at the mean. Given a data set with a mean of 10 and a standard deviation of, determine the z-score for each of the following raw scores, is 1 standard deviations below the mean. z z z 3 10 is 0 standard deviations from the mean, it is equal to the mean. 16 is 3 standard deviations above the mean. The z-scores also allow comparisons of scores from different sources with different means and/or standard deviations. Eample STSD-F Jenn obtained results of 48 in her English eam and 75 in her Histor eam. Compare her results in the different subjects considering: English eam : class mean was 45 and the standard deviation.5 Histor eam : class mean was 78 and the standard deviation z English z Histor Jenn s English result is 1.33 standard deviations above the class mean, while her Histor was 1.5 standard deviations below the class mean. STSD 9

11 It is also possible to determine a raw score for a given z-score, i.e. it is possible to find a value that is a specified number of standard deviations from a mean. The z-score formula is transformed to + s z sample μ + σ z population Eamples STSD-F3 A data set has a mean of 10 and a standard deviation of. Find a value that is: (i) 3 standard deviations above the mean z 3 + s z is three standard deviations above the mean. (ii) standard deviations below the mean z + s z is two standard deviations below the mean. Eercise STSD-F1 1. Given the scores 56, 8, 74, 69, 94 determine the z-score for each raw score.. Given a data set with a mean of 54 and a standard deviation of 3., determine the z-score for each of the following raw scores,. (i) 81 (ii) Peter obtained results of 63 in Maths and 58 in Geograph. For Maths the class mean was 58 and the standard deviation 3.4, and for Geograph the class mean was 55 and the standard deviation.3. Compared to the rest of the class did Peter do better in Maths or Geograph? 4. A data set has a mean of 54 and a standard deviation of 3.. Find a value that is: (i) standard deviations below the mean. (ii) 1.5 standard deviations above the mean. The distribution of z-scores is described b the standard normal curve. Normal distributions are smmetric about the mean, with scores more concentrated in the centre of the distribution than in the tails. Normal distributions are often described as bell shaped. mean Data collected on heights, weights, reading abilities, memor and IQ scores often are approimated b normal distributions. The standard normal distribution is a normal distribution with a mean of zero and a standard deviation of one STSD 10

12 For normall distributed data 50% of the data is below the mean and 50% above the mean. 50% 50% In a normal distribution, the distance between the mean and a given z-score corresponds to a proportion of the total area under the curve, and hence can be related to a proportion of a population. The total area under a normal distribution curve is taken as equal to 1 or 100%. The values of the proportions, written as decimals, for various z-scores are provide in a statistical table, Normal Distribution Areas (see Appendi). The values in the Normal Distribution Areas table give a proportion value for the area between the mean and the raw score greater than the mean, converted to a positive z-score. + z z As the distribution is smmetric, it is possible to calculate proportions for raw scores less than the mean. If the calculated z-score is negative, the corresponding positive value from the table is used. So values for z are the same distance from the mean whether the are a number of standard deviations more or a number of standard deviations less than the mean, and will result in the same proportion. A B z +z proportion A proportion B In a normal distribution approimatel: 68% of values lie within ± 1 s.d. of the mean 68% % of values lie within ± s.d. of the mean 95% 99% of values lie within ± 3 s.d. of the mean 99% 3 3 STSD 11

13 When using the Normal Distribution Areas table, the z-score is structured from the row and column headings of the table and the required proportion is found in the middle of the table at the intersection of the corresponding rows and columns. Eamples STSD-F4 Use the Normal Distribution Areas table to determine the proportions that correspond to the following z-scores. (i) z row,.00 column proportion % (ii) z row,.00 column proportion % (iii) z row,.0 column proportion % (iv) z. use z row,.00 column proportion % (v) z.1 use z row,.01 column proportion % z It is also possible to find a z-score for a required proportion from the Normal Distribution Areas table. It is necessar to find the proportion in the middle of the table and read back to the row and column headings to determine the z-score. Eamples STSD-F5 Use the Normal Distribution Areas table to determine the z-scores that correspond to the following proportions. (i) 48.1% row,.00 column The z-score would be either z.1 or z.1. (ii) 48.68% row,.0 column The z-score would be either z. or z.. z STSD 1

14 Values from the Normal Distribution Areas table enable the determination of proportions of the data set that lie above a raw value, below a raw value or even between two raw values within the data set. Drawing a quick sketch of the distribution curve with the position of the mean and the raw scores(s) and shading the required area can assist with the understanding of the required calculations. Eamples STSD-F6 1. The weights of chips in packets have a mean of 50g and standard deviation of 3g. (a) Find the proportion of the packets of chips with a weight between 50g and 5g z (from the table) % of the chip packets have a weight between 50g and 5g. (b) Find the proportion of the packets of chips with a weight between 47g and 50g z (from the table) % of the chip packets have a weight between 47g and 50g. (c) Find the proportion of the packets of chips with a weight greater than 55g z Area > % of the chip packets have a weight greater than 55g. (d) Find the proportion of the packets of chips with a weight less than 45g z Area < % of the chip packets have a weight less than 45g. (e) Find the proportion of the packets of chips with a weight between 45g and 5g z z Area between 45 & % of the chip packets have a weight between 45g and 5g. STSD 13

15 Eamples STSD-F6 continued 1. (f) Find the proportion of the packets of chips with a weight between 5g and 55g z z Area between 5 & % of the chip packets have a weight between 5g and 55g.. (a) If the compan selling the chips in the previous question rejects the 5% of the chip packets that are too light, at what weight should the chip packets be rejected? 5% 50g s 3g proportion below mean s z z between 1.64 and Packets should be rejected if the weigh 45g or less. (b) Between which weights do 80% of the chip packets fall? 45% ( ) 80% 40% + 40% 50g s 3g ± s z 50 ± and ( ) proportion 0.4 above and below mean z ± % of the packets weigh between 46.16g and 53.84g. 3. If there are 80 packets in a vending machine, how man packets would be epected to weight more than 55g? z Area > Number > Approimatel 4 chip packets would be epected to have a weight of greater than 55g. Eercise STSD-F 1. If scores are normall distributed with a mean of 30 and a standard deviation of 5, what percentage of the scores is (i) greater than 30? (ii) between 8 and 30? (iii) greater than 37? (iv) between 8 and 34? (v) between 6 and 8? STSD 14

16 Eercise STSD-F continued. IQ scores have a mean of 100 and a standard deviation of 16. What proportion of the population would have an IQ of: (i) greater than 13? (ii) less than 91? (iii) between 80 and 10? (iv) between 80 and 91? (v) If Shane is smarter than 75% of the population, what is his IQ score? 3. The heights of bos in a school are approimatel normall distributed with a mean of 140cm and a standard deviation of 0cm. (a) Find the probabilit that a bo selected at random would have a height of less than 175cm. (b) If there are 400 bos in the school how man would be epected to be taller than 175cm? (c) If 15% of the bos have a height that is less than the girls mean height, what is the girls mean height? 4. Charlie is a sprinter who runs 00m in an average time of.4 seconds with a standard deviation of 0.6s. Charlie s times are approimatel normall distributed. (a) To win a given race a time of 1.9s is required. What is the probabilit that Charlie can win the race? (b) The race club that the sprinter is a member of has the record time for the 00m race at 0.7s. What is the likelihood that Charlie will be able to break the record? (c) A sponsor of athletics carnivals offers $100 ever time a sprinter breaks.5s. If Charlie competes in 80 races over a ear how much sponsorship mone can he epect? STSD-G Chebshev s Theorem A Russian mathematician, P.L. Chebshev, developed a theorem that approimated the proportion of data values ling within a given number of standard deviations of the mean regardless of whether the data is normall distributed or not. Chebshev s theorem states: For an data set, at least 1 k 1 of the values lie within k standard deviations either side of the mean. ( k > 1) So for an set of data at least: % % 3 9 of the values lie within standard deviations. of the values lie with in 3 standard deviations. This theorem allows the determination of the least percentage of values that must lie between certain bounds identified b standard deviations. STSD 15

17 Eamples STSD-G1 1. The heights of adult dogs in a town have a mean of 67.3cm and a standard deviation of 3.4cm. (a) What can be concluded from Chebshev s theorem about the percentage of dogs in the town that have heights between 58.8cm and 75.8cm? z z At least 84% of the adult dogs would have heights between cm and 75.8cm. 84%. (b) What would be the range of heights that would include at least 75% of the dogs? 1 75% 1 k k k k 1 k k k 4 Chebshev s theorem suggests that 75% of the heights are within ± standard deviations of the mean. μ ± zσ 67.3 ± and % of the dog s heights would range between 60.5cm and 74.1cm. Eercise STSD-G1 1. The weights of cattle have a mean of 434kg and standard deviation of 69kg. What percentage of cattle will weigh between 330.5kg and 537.5kg?. The age of pensioners residing in a retirement village has a mean of 74 ears and standard deviation of 4.5 ears. What is the age range of pensioners that contains at least 89% of the residents? 3. It was found that for a batch of softdrink bottles, the mean content was 994ml. If 75% of the bottles contained between 898ml and 1090ml, what was the standard deviation for the softdrink batch? 4. On a test the mean is 50 marks and standard deviation 11. At most, what percentage of the results will be less than 17 and greater than 83 marks? STSD-H Correlation and Scatterplots When two different data variables, quantities, are collected from the one source it is possible to determine if a relationship eists between the variables. A simple method of determining the relationship between two variables, if it eists, is b constructing a scatterplot. A scatterplot (scatter graph or scatter diagram) is a graph that is created b plotting one variable, quantit, on the horizontal ais and the other on the vertical ais. If one variable is likel to be dependent on the other, the dependent variable should be plotted on the vertical ais and the independent variable on the -ais. The scales on the vertical and horizontal aes do not need to be the same or even use the same units. Also the aes do not need to commence at zero. STSD 16

18 Eamples STSD-H1 1. The heights and weights of 10 students are recorded below. Construct a scatterplot for this data. Student Height (cm) Weight (kg) Adam Brent Charlie David Edd Fred Gar Harr Ian John Scatterplot of Weight against Height Weight (kg) Adam ( 167,5) John ( 181, 61) Height (cm). For one week the midda temperature and the number of hot drinks sold were recorded. Construct a scatterplot for this data. Sun Mon Tues Wed Thur Fri Sat Temp ( C) Number of Drinks Scatterplot of Hot Drinks Sales Against Temperature 70 Number of Drinks Sold Midda Temperature (oc) STSD 17

19 Eercise STSD-H1 Construct scatterplots for the following data sets. 1. Student A B C D E F G H I J Test 1 ( /50) Final Eam % Da Mon Tue Wed Thur Fri Sat Sun Temperature ( C) Softdrink Sales (cans) Name Ann Lee Ma Jan Tom Wes Height (cm) Shoe Size Speed (km/hr) Fuel Econom (km/l) The points in a scatterplot often tend towards approimating a line. It is possible to summarise the points of a scatterplot b drawing a line through the plot as a whole, not necessaril through the individual points. This line is called the line of best fit. The line of best fit line need not pass through an of the original data points, but is used to represent the entire scatterplot. A line of best fit can be thought of as an average for the scatterplot, in a wa similar to a mean is the average of a list of values. To sketch a line of best fit for a scatterplot: 1. calculate the mean of the independent variable values,, and the mean of the dependent variable values, ;. plot this mean point, (, ) to the scatterplot; 3. sketch a through the mean point, that has a slope the follows the general trend of the points of scatterplot. Wherever possible the number of points below the line should equal the number of points above. Outling points need not strongl influence the line of best fit, and are often not included. The line of best fit can be used to predict values for data associated with the scatterplot. STSD 18

20 Eamples STSD-H 1. The heights and weights of 10 students are recorded below. Construct a scatterplot for this data and draw a line of best fit. Student Height (cm) Weight (kg) Adam Brent Charlie David Edd Fred Gar Harr Ian John Means Σ 1697 Σ n 10 n Weight Against Height Weight (kg) ( 169.7,56.3) Height (cm). For one week the midda temperature and the number of hot drinks sold were recorded. Construct a scatterplot for this data. Sun Mon Tues Wed Thur Fri Sat means Temp ( C) Number of Drinks Drink Sales Against Temperature Drink Sales ( 9.43, 46.86) Temperature Eercise STSD-H Draw lines of best fit on scatterplots drawn in Eercise STSD-H1. STSD 19

21 Correlation is a measure of the relationship between two measures, variables, on sets of data. Correlation can be positive or negative. A positive correlation means that as one variable increases the other variable increases, eg. height of a child and age of the child. Negative correlation implies as one variable increases the other variable decrease, eg. value of a car and age of the car. If variables have no correlation there is no relationship between the variables, i.e. one measure does not affect the other. Scatterplots enable the visual determination of whether correlation eists between variables and the tpe of correlation. no correlation perfect positive correlation positive correlation perfect negative correlation negative correlation Eercise STSD-H3 1. For each of the following scatterplots determine if the correlation is perfect positive, positive, no correlation, negative or perfect negative. A B C D E F G H. Select the scatterplot from those above that would best describe the relationship between the following variables. (i) Height at 4 ears, height at 16 ears. (ii) (iii) (iv) (v) Age of a used car, price of a used car. Temperature at 6am, temperature at 3pm. Shoe size of mother, number of children in famil. Average eam result, class size. 3. For each of the scatterplots drawn in eercise STSD- H1 state the tpe of correlation between the variables. STSD 0

22 STSD-I Correlation Coefficient and Regression Equation It is possible to quantif the correlation between variables. This is done b calculating a correlation coefficient. A correlation coefficient measures the strength of the linear relationship between variables. Correlation coefficients can range from 1 to +1. A value of 1 represents a perfect negative correlation and a value of +1 represents a perfect positive correlation. If a data set has a correlation coefficient of zero there is no correlation between the variables. perfect positive correlation r 1 positive correlation r 0.7 no correlation r 0 perfect negative correlation r 1 negative correlation r 0.7 The most widel used tpe of correlation coefficient is Pearson s, r, simple linear correlation. The value of r is determined with the formula below r Σ( X X)( Y Y) ( X X) ( Y Y) Σ Σ This formula uses the sums of deviations from the means in both the X values and Y values. However for ease of calculation the following calculation formula is often used. r nσxy ΣXΣY ( nσx ( ΣX) ) nσy ( ΣY) ( ) where, n number of data points Σ X sum of the X values Σ Y sum of the Y values Σ XY sum of the product of each set of X and Y values Σ X sum of X Σ Y sum of Y ( Σ X ) the square of the sum of the X values ( Σ Y ) the square of the sum of they values STSD 1

23 To assist in calculations data can be set up in a table and the following headings used: X Y X Y XY Eample STSD-I1 Calculate the correlation coefficient for the height weight data below. Height, X Weight, Y X Y XY Σ X 1697 Σ Y 563 Σ X 8867 Σ Y 3661 Σ XY 9638 r nσxy ΣXΣY nσx ΣX nσy ΣY ( ( ) ) ( ) ( ) ( )( ) There is a positive correlation between the height and weight of students. NOTE: r, sometimes referred to as the co-efficient of determination, represents the proportion (percentage) of the relationship between the variables that can be eplained b a linear relationship. The greater the r value, the greater the linear relationship between the variables. Eample STSD-I For the previous height/weight eample r r approimatel 78% of the relationship can be eplained b a linear correlation. This is a moderatel strong correlation. Eercise STSD-I1 Determine the correlation co-efficient for the following data sets. 1. Temperature ( C) Hot drink sales (cups) Speed (km/hr) Fuel econom (km/l) Data collected of students results for sitting Test 1 and the final eam (eercise STSD H1 question 1) Determine the correlation co-efficient for this data. X : Test 1 results (out of 15 marks) Y : Final eam result (out of 50 marks) Σ X 99, Σ Y 649, Σ X 9849, Σ Y 46387, Σ XY 137, n 10 STSD

24 The relationship between two sets of data can be represented b a linear equation called a regression equation. The regression equation gives the variation of the dependent variable for a given change in the independent variable. It is etremel important to correctl determine which variable is dependent. The regression equation can be used to construct the regression line (line of best fit) on the associated scatterplot. Because the equation is for a line, the regression equation takes on the general linear equation format, m+ c. Usuall, however, for a regression equation this is written as α + β, where α is the -intercept and β the slope of the line. ΣXY nx Y α + β where β ΣX nx α Y βx The slope of the line depends on whether the correlation is positive or negative. A regression equation can be used to predict dependent variables from independent inputs within the range of the scatterplot values. It should not be used: to predict given to predict outside the bounds of given values. The stronger the correlation between the two variables the better the prediction made b the regression equation. Again setting up a table of values, with the following headings, can assist in calculations. X Y X XY Eamples STSD-I3 1. Calculate the correlation coefficient for the height/weight data below. The weight of a student should depend on the height rather than vice versa, so height is the independent,, variable and weight the dependent,, variable. Height, X Weight, Y X XY Σ X 1697 Σ Y 563 Σ X 8867 Σ XY 9638 ΣX 1697 X ΣXY nx Y n 10 β ΣY 563 ΣX nx Y n α Y βx The regression equation is OR w 1.08h 16.7 STSD 3

25 Eamples STSD-I3 continued. Use the regression equation to predict the following: (i) The weight of a student who is 160cm tall. h 160 w 1.08h kg The student should weigh approimatel 46.1kg. (ii) The weight of a student who is 175cm tall. h 175 w 1.08h kg The student should weigh approimatel 6.3kg. (iii) The weight of a student who is 185cm tall. Can not be predicted as input is outside range of recorded heights. (iv) The height of a student who weighs 65kg. Regression equation can not be used to predict height from weight. Eercise STSD-I 1. Determine the regression equation for each the following data sets. (Use sums calculated in the previous eercise and the equations to predict the requested values.) (a) Temperature ( C) Hot drink sales (cups) (i) (ii) Predict the drink sales when the temperature is 10 C. Predict the drink sales if the temperature is 5 C. (b) Speed (km/hr) Fuel econom (km/l) (i) (ii) Predict the fuel econom for a speed of 75km/hr. Predict what speed a car would be travelling if it was getting 17.5km/l.. Data collected of students results for sitting Test 1 and the final eam (eercise STSD H1 question 1) X : Test 1 results (out of 15 marks) Y : Final eam result (out of 50 marks) Σ X 99, Σ Y 649, Σ X 9849, Σ Y 46387, Σ XY 137, n 10 Find the regression equation for this data. STSD 4

26 STSD-J Summar The mean is a measure of central tendenc. Standard deviation measures spread or dispersion of a data set. The coefficient of variation, CV, gives the standard deviation as a percentage of the mean of the data set. The z-score indicates how far, the number of standard deviations, a raw score deviates from the mean of the data set. The following formulae can be used to calculate the the given statistical measures. Statistical Measure Population Formula Sample Formula Mean Mean from frequenc table μ N n f f μ f f Standard Deviation ( Σ) Σ N σ N s Σ ( Σ) n 1 n Standard Deviation from frequenc table σ Σf ( Σ ) f Σf Σf s Σf ( Σf) Σf Σf 1 Coefficient of Variation CV σ 100% CV s 100% μ z-score μ z σ z s Raw Score from z-score μ + σ z + s z The standard normal distribution is a normal distribution with a mean of zero and a standard deviation of one. The distance between the mean and a given z-score corresponds to a proportion of the total area under the curve, and hence can be related to a proportion of a population. The total area under a normal distribution curve is taken as equal to 1 or 100%. The values in the Normal Distribution Areas table give a proportion value for the area between the mean and the raw score greater than the mean, converted to a positive z-score. + z In a normal distribution approimatel: 68% of values lie within ± 1 s.d. of the mean; 95% of values lie within ± s.d. of the mean; and 99% of values lie within ± 3 s.d. of the mean. 1 Chebshev s theorem states: for an data set, at least ( 1 k ) deviations either side of the mean. ( k > 1). of the values lie within k standard STSD 5

27 A scatterplot is a graph that is created b plotting one variable, quantit, on the horizontal ais and the other on the vertical ais. To sketch a line of best fit for a scatterplot: 1. calculate the mean of the independent variable values,, and the mean of the dependent variable values, ;. plot this mean point, (, ) to the scatterplot; 3. sketch a line through the mean point, that has a slope that follows the general trend of the points of scatterplot. Correlation is a measure of the relationship between two measures, variables, on sets of data. Correlation can be positive or negative. no correlation perfect positive correlation positive correlation perfect negative correlation negative correlation A correlation coefficient measures the strength of the linear relationship between variables. The most widel used tpe of correlation coefficient is Pearson s, r, simple linear correlation. The value of r is determined with the calculation formula r nσxy ΣXΣY ( nσx ( ΣX) ) nσy ( ΣY) ( ) r, sometimes referred to as the co-efficient of determination, represents the proportion (percentage) of the relationship between the variables that can be eplained b a linear relationship. The relationship between two sets of data can be represented b a linear equation called a regression equation. ΣXY nx Y α + β where β ΣX nx α Y βx A regression equation can be used to predict dependent variables from independent inputs within the range of the scatterplot values. It should not be used: to predict, the independent variable, given, the dependent variable. to predict outside the bounds of given values. The stronger the correlation between the two variables the better the prediction made b the regression equation. STSD 6

28 STSD-K Review Eercise 1. For each of the following data sets calculate (i) the mean (ii) the standard deviation (iii) the coefficient of variation. (a) (b) (c) Store Sales for a week $55 $698 $547 $70 $645 $451 Student Mark in a 5 Mark Test Mark Frequenc Dail Rainfall in millimetres Rainfall (mm) Frequenc (das) A soft-drink filling machine uses cans with a maimum capacit of 340ml. The machine is set to output softdrink with a mean capacit of 330ml. It has been found that due to machine error the amount outputted varies with a standard deviation of 8ml and the amount outputted is normall distributed. (a) What proportion of cans will have between 330 ml and 340ml of softdrink? (b) What percentage of cans will have between 35ml and 340ml? (c) What percentage of cans will overflow? (d) If the smallest 5% of drinks must be rejected, what is the smallest amount which will be accepted? 3 (a) If a set of data has a mean of 76 and a standard deviation of 8.8, what is the interval that should contain at least 75% of the data? (b) A data set has a mean of 87 and a standard deviation of 98. At least what percentage of values should lie been 58 and 107? (c) A set of data has a mean of 468. If 89% of the data values lie between 336 and 600, what is the standard deviation for the data set? STSD 7

29 Eercise STSD-K continued 4. (a) Match each of the correlation coefficients with a scatterplot below. (i) r 0.6 (iv) r 0.9 (ii) r 0 (v) r 1 (iii) r 0.9 (b) Which scatterplot best approimates the correlation between each of the two variables below? (a scatterplot ma be used more than once) (i) das on a good diet, weight (ii) temperature outside, temperature in a non-air conditioned car (iii) hand span, height (iv) rainfall, level of water in river (v) length of finger nails, intelligence A B C D E F G 5. The average test results for a standard eamination and corresponding class size were recorded for five schools. The results are summarised in the table below. School Class Size Test Result A 8 8% B 33 50% C 5 80% D 14 98% E 0 90% Use the class size/test result data to: (a) construct a scatterplot (b) draw a line of best fit (c) determine the correlation coefficient (d) comment on the correlation (e) determine the regression equation (f) use the regression equation to predict (i) the epected result if a school had a class size of 30 students (ii) the epected result for a class of 10 students (iii) how man student there would be in a class if the test result was 75% STSD 8

30 STSD-L Appendi Normal Distribution Areas Table 0 z z STSD 9

31 STSD-Y Inde Topic Page Chebshev's Theorem... STSD 15 Coefficient of Determination... STSD Co-efficient of Variation... STSD 7 Correlation... STSD 0 Correlation Coefficient... STSD 1 Deviation... STSD 3 Line of Best Fit... STSD 18 Mean... STSD Mean population... STSD Mean sample... STSD Measure of Central Tendenc... STSD Negative Correlation... STSD 0 No Correlation... STSD 0 Normal Distribution... STSD 11 Normal Distribution table... STSD 1, 9 Population mean... STSD Population standard deviation... STSD 3 Positive Correlation... STSD 0 Raw Score... STSD 10 Regression Equation... STSD 3 Regression Line... STSD 3 Sample mean... STSD Sample standard deviation... STSD 3 Scatterplot... STSD 16 Standard Deviation... STSD 3 Standard Deviation calculation... STSD 4 Standard Deviation population... STSD 3 Standard Deviation sample... STSD 3 Standard Normal Curve... STSD 10 Standard Normal Distribution... STSD 10 Variance... STSD 5 bar... STSD z-scores... STSD 8 STSD 30

32 STSD 31

33 SOLUTIONS Statistics and Standard Deviation STSD-Z Solutions STSD -B Calculating Means... STSD 33 STSD -D Calculating Standard Deviations... STSD 34 STSD -E Coefficient of Variation... STSD 35 STSD -F Normal Distribution and z-scores STSD F1 z-scores... STSD 36 STSD F Normal Distributions... STSD 37 STSD -G Chebshev s Theorem... STSD 40 STSD -H Correlation and Scatterplots STSD H1/H Scatterplots/Lines of Best Fit... STSD 41 STSD H3 Correlation... STSD 43 STSD -I Correlation Coefficient and Regression Equation STSD I1 Correlation Coefficient... STSD 44 STSD I Regression Equation... STSD 45 STSD -K Review Eercise... STSD 46 STSD 3

34 SOLUTIONS Statistics and Standard Deviation STSD B1 Calculating Means (a) mean n The mean number of hocke goals scored is approimatel.09 goals. (b) Points Scored () Frequenc (f) f Total Σ f 10 Σ f 18 f mean f (c) The mean number of points scored is 1.8 points. Tping errors () Frequenc (f) f Total Σ f 0 Σ f 1 The mean number of tping errors is 1.05 errors. f mean f (d) Bab Weight (kg) Mid-value () Frequenc (f) f Total Σ f 15 Σ f 5.1 f mean f The mean bab weight is approimatel 3.47kg (e) Withdrawals ($) Mid-value () Frequenc (f) f Total Σ f 30 Σ f 3350 f mean f 3350 The mean withdrawal was approimatel $ STSD 33

35 SOLUTIONS Statistics and Standard Deviation STSD-D1 Calculating Standard Deviations (a) Σ 3 Σ 73 ( Σ) Σ n s n The standard deviation of the number of goals scored is approimatel 1.58 goals. (b) Points () f f f Σ f 10 Σ f 18 Σ f 1658 ( Σf) Σf Σf 10 s 1.48 Σf The standard deviation of number of basketball points is approimatel 1.48 points. (c) Errors () f f f Σ f 0 Σ f 1 Σ f 37 s Σf ( Σf) Σf Σf The standard deviation of the number of tping errors is approimatel 0.89 errors. (d) Weights f f f Σ f 15 Σ f 5.1 Σ f ( Σf) Σf Σf 15 s 0.33 Σf The standard deviation of the bab weights is approimatel 0.33kg. STSD 34

36 SOLUTIONS Statistics and Standard Deviation STSD-D1 continued (e) Withdrawals f f f Σ f 30 Σ f 3350 Σ f ( Σf) Σf Σf 30 s Σf The standard deviation of the ATM withdrawals is approimatel $ STSD-E1 Coefficient of Variation 1. (a) n ( Σ) Σ n s n s CV 100% % % (b) n ( Σ) Σ n s n s CV 100%.9 100% %. (a) Data Set A: n ( Σ) Σ n s n s CV 100% % % Data Set B: n ( Σ) Σ n s n There is greater relative variation in Data set B s CV 100% % % STSD 35

37 SOLUTIONS Statistics and Standard Deviation STSD-E1 continued. (b) Bos Heights: 141.6cm s 15.1cm Girls Heights: 143.7cm s 8.4cm There is greater relative variation in bos heights. s CV 100% % % s CV 100% % % STSD-F1 1. z-scores n 5 ( ) s n n Σ 375 Σ 8933 raw score z-score meaning z standard deviations below the mean z standard deviations above the mean z standard deviations below the mean z standard deviations below the mean z standard deviations above the mean. (i) s.d. above the mean z (ii) s.d. above the mean z z Maths 1.47 z Geograph Peter s Maths result is 1.47 standard deviations above the class mean, while his geograph was 1.30 standard deviations above the class mean. So Peter did slightl better with his Maths result compared to the rest of the class. 4. (i) z + s z is two standard deviations below the mean. (ii) z s z is 1.5 standard deviations above the mean. STSD 36

38 SOLUTIONS Statistics and Standard Deviation STSD-F 1. (i) Normal Distributions 30 50% 50% of the scores are greater than 30, the mean. (ii) z % of the scores are between 8 and 30. (iii) z % of the scores are greater than 37. Area > (iv) z8 0.4 z % of the score are between 8 and Area between 8 & (v) z6 0.8 z % of the score are between 6 and 8. Area between 6 & (i) Area > z % of the population have IQs greater than 13. STSD 37

39 SOLUTIONS Statistics and Standard Deviation STSD-F. (ii) continued % z Area < % of the population have IQs less than 91. (iii) z z Area between 80 & % of the population have IQs between 80 and 10. (iv) z z Area between 80 & % of the population have IQs between 80 and 91. (v) 75% 50% + 5% ( ) proportion 0.5 above mean + s z z between 0.67 and Shane would have an IQ of over 110. STSD 38

40 SOLUTIONS Statistics and Standard Deviation STSD-F 3. (a) continued % z Area < % of the bos have a height of less than 175cm. (b) Area > Bos > z Approimatel 16 bos have a height of greater than 175cm. (c) 15% 35% 140 proportion 0.35 ( below mean) s z z between 1.03 and The girls mean height is approimatel 119.3cm. 4. (a) (b) z Area < Charlie has approimatel 0% probabilit of winning the race (c) z Area < Charlie has approimatel 0.3% probabilit of breaking the record % Area < z Charlie epect $4540 in prize mone Prize $100 $4540 STSD 39

41 SOLUTIONS Statistics and Standard Deviation STSD-G1 1.. z Chebshev s Theorem z % 1 89% 1 k k k 1 k k ± 3 At least 56% of the cattle would have weights between kg and kg. z s z z 3 s z At least 89% of the pensioners would have ages between 60.5 ears and 87.5 ears % 1 k k k 1 k k ± z + s z z s z s OR s s s s 48 s 48 The standard deviation is 48ml z17 3 z At least 89% of the test result would be between 17 and 83, therefore 100% 89% 11% of the results will be less than and greater than 83 marks 89% STSD 40

42 SOLUTIONS Statistics and Standard Deviation STSD-H1/H 1. Scatterplots/Lines of Best Fit Student A B C D E F G H I J Test 1 ( /50) Final Eam % mean test1 9.9 mean test Scatterplot of Final Eam against Test 1 Final Eam Result (%) ( 9.9, 64.9) Test 1 Results (/50). Da Mon Tue Wed Thur Fri Sat Sun Temperature ( C) Softdrink Sales (cans) mean temp mean sales Scatterplot of Drink Sales against Temperature Softdrink Sales (cans) ( 30.43,7.14) Temperature (oc) STSD 41

43 SOLUTIONS Statistics and Standard Deviation STSD-H1/H continued 3. Name Ann Lee Ma Jan Tom Wes Height (cm) Shoe Size mean height mean shoesize Shoe Size against Height Shoe Size 9 8 ( ,9.5) Height (cm) 4. Speed (km/hr) Fuel Econom (km/l) mean speed mean econom Fuel Econom against Speed 0 18 Fuel Econom (km/l) ( 83.75,16.09) Speed (km/hr) STSD 4

44 SOLUTIONS Statistics and Standard Deviation STSD-H3 Correlation 1. A perfect positive B- negative C - positive D - positive E - negative F perfect negative G no correlation H perfect positive. (i) Height at 4ears, height at 16 ears. C (ii) Age of a used car, price of a used car. E (iii) Temperature at 6am, temperature at 3pm. C (iv) Shoe size of mother, number of children in famil. G (v) Average eam result, class size. E positive correlation the better a student did in Test 1 the better the did for the final eamination 3. positive correlation the taller a person the larger their shoe size. Shoe Size against Height Scatterplot of Final Eam against Test Final Eam (%) Shoe Size Test 1 Results (/50) Height (cm). positive correlation the hotter the temperature the more softdrinks sold. Scatterplot of Drink Sales against Temperature 4. negative correlation the faster a car the less economical it is. Fuel Econom against Speed Softdrink Sales (cans) Fuel Econom (km/l) Temperature (oc) Speed (km/hr) STSD 43

45 SOLUTIONS Statistics and Standard Deviation STSD-I1 Correlation Coefficient 1. Temperature, X Hot Drink Sales Y X Y XY Σ X 66 Σ Y 38 Σ X 69 Σ Y 1666 Σ XY 845 r ( )( ) There is a strong negative correlation, , between the sales of hot drinks and the temperature. r Speed, X Econom, Y X Y XY Σ X 670 Σ Y 18.7 Σ X 6350 Σ Y Σ XY r ( )( ) There is a strong negative correlation, 0.99, between the speed and the fuel econom. r r ( )( ) There is a positive correlation, 0.93, between the result in test 1 and the result in final eam. r STSD 44

46 SOLUTIONS Statistics and Standard Deviation STSD-I Regression Equation 1. (a) Temperature, X Hot Drink Sales, Y X XY Σ X 66 Σ Y 38 Σ X 69 Σ XY 845 X Y ΣX n ΣY n ΣXY nx Y α Y βx β ΣX nx ( 3.57) The regression equation is (i) It would be epected that 45 hot drinks would be sold at 10 C. (ii) Can not be predicted as input is outside range of recorded temperatures. (b) Speed, X Econom, Y X XY Σ X 670 Σ Y 18.7 Σ X 6350 Σ XY X Y ΣX n ΣY n ΣXY nx Y β ΣX nx The regression equation is α Y βx ( 0.093) (i) It would be epected that at 75km/hr the econom would be appro. 16.9km/l. (ii) Regression equation can not be used to predict speed from econom.. X ΣX n Y ΣY n ΣXY nx Y β ΣX nx ( ) α Y βx The regression equation is STSD 45

47 SOLUTIONS Statistics and Standard Deviation STSD-K Review Eercise 1. (a) Store Sales for a week (b) Sales () Σ 3613 Σ 8943 CV n $60.17 s 100% % % ( Σ) Σ n s n (i) the mean sales is approimatel $60.17 (ii) the standard deviation of the sales is approimatel $103.6 (iii) the coefficient of variation is approimatel 17.15% Student Mark in a 5 Mark Test Mark ( ) Frequenc, ( ) f f Σ f 30 Σ f 97 Σ f 361 f (c) f f s Σf ( Σf) Σf Σf CV s 100% % % (i) the mean mark is approimatel 3.3 (ii) the standard deviation of the marks is approimatel 1.8 (iii) the coefficient of variation is approimatel 39.6% Dail Rainfall in millimetres Rainfall f f Σ f 1 Σ f 47 Σ f 3779 f f ( Σf) Σf s f Σf s CV 100% 47 Σf % % 6.61 (i) the mean rainfall was approimatel 11.76mm (ii) the standard deviation of the rainfall was approimatel 6.61mm (iii) the coefficient of variation is approimatel 56.% STSD 46

48 SOLUTIONS Statistics and Standard Deviation STSD-K. (a) continued z A proportion of of the cans between 330 ml and 340ml of softdrink. (b) (c) z ( between 0.6 & 0.63) z % of the cans between 35 ml and 340ml of softdrink z Area > % of the cans will overflow. Area between 35 ml & 340ml (d) 5% % ( ) proportion 0.45 below mean z s z between 1.64 and The smallest amount of softdrink that would be accepted would be ml. 3. (a) (b) 1 75% 1 k k k 1 k 4 k ± 0.5 z z z + s z z s z At least 75% of values would lie between 18.4 and % At least 84% of the values should lie been 58 and 107. STSD 47

49 SOLUTIONS Statistics and Standard Deviation STSD-K continued 3. (c) 1 z + 3 z 3 89% 1 k s z s z s 3 OR s k s 3s s 44 s 44 k k 9 k ± The standard deviation is (a) (i) r 0.6 C (iv) r 0.9 D (ii) r 0 G (v) r 1 F (iii) r 0.9 E (b) (i) Das on a good diet, weight B (ii) temperature outside, temperature in a non-airconditioned car D (iii) hand span, height C (iv) rainfall, level of water in river C (v) length of finger nails, intelligence G 5. School Class Size, X Test Result, Y X Y XY A B C D E Σ X 10 Σ Y 400 Σ X 3094 Σ Y 3338 Σ XY 9118 (a)/(b) X Y 80 5 Scatterplot of Test Result against Class size Test result (%) ( 4,80) Class Size STSD 48

50 SOLUTIONS Statistics and Standard Deviation STSD-K 5. (c) (d) (e) r continued ( )( ) There is a negative correlation between the class size and test result. The smaller the class the better the test result. r ΣXY nx Y α Y βx β ΣX nx (.5) The regression equation is (f) (i) (ii) (iii) It would be epected that the test result would be 66.5% when the class size was 30 students. Can not be predicted as input is outside range of recorded class sizes. Regression equation can not be used to predict class size from test result. STSD 49

Chapter 13 Introduction to Linear Regression and Correlation Analysis

Chapter 13 Introduction to Linear Regression and Correlation Analysis Chapter 3 Student Lecture Notes 3- Chapter 3 Introduction to Linear Regression and Correlation Analsis Fall 2006 Fundamentals of Business Statistics Chapter Goals To understand the methods for displaing

More information

MBA 611 STATISTICS AND QUANTITATIVE METHODS

MBA 611 STATISTICS AND QUANTITATIVE METHODS MBA 611 STATISTICS AND QUANTITATIVE METHODS Part I. Review of Basic Statistics (Chapters 1-11) A. Introduction (Chapter 1) Uncertainty: Decisions are often based on incomplete information from uncertain

More information

Solving Quadratic Equations by Graphing. Consider an equation of the form. y ax 2 bx c a 0. In an equation of the form

Solving Quadratic Equations by Graphing. Consider an equation of the form. y ax 2 bx c a 0. In an equation of the form SECTION 11.3 Solving Quadratic Equations b Graphing 11.3 OBJECTIVES 1. Find an ais of smmetr 2. Find a verte 3. Graph a parabola 4. Solve quadratic equations b graphing 5. Solve an application involving

More information

Density Curve. A density curve is the graph of a continuous probability distribution. It must satisfy the following properties:

Density Curve. A density curve is the graph of a continuous probability distribution. It must satisfy the following properties: Density Curve A density curve is the graph of a continuous probability distribution. It must satisfy the following properties: 1. The total area under the curve must equal 1. 2. Every point on the curve

More information

Example: Boats and Manatees

Example: Boats and Manatees Figure 9-6 Example: Boats and Manatees Slide 1 Given the sample data in Table 9-1, find the value of the linear correlation coefficient r, then refer to Table A-6 to determine whether there is a significant

More information

Pearson s Correlation Coefficient

Pearson s Correlation Coefficient Pearson s Correlation Coefficient In this lesson, we will find a quantitative measure to describe the strength of a linear relationship (instead of using the terms strong or weak). A quantitative measure

More information

So, using the new notation, P X,Y (0,1) =.08 This is the value which the joint probability function for X and Y takes when X=0 and Y=1.

So, using the new notation, P X,Y (0,1) =.08 This is the value which the joint probability function for X and Y takes when X=0 and Y=1. Joint probabilit is the probabilit that the RVs & Y take values &. like the PDF of the two events, and. We will denote a joint probabilit function as P,Y (,) = P(= Y=) Marginal probabilit of is the probabilit

More information

1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number

1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number 1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number A. 3(x - x) B. x 3 x C. 3x - x D. x - 3x 2) Write the following as an algebraic expression

More information

INVESTIGATIONS AND FUNCTIONS 1.1.1 1.1.4. Example 1

INVESTIGATIONS AND FUNCTIONS 1.1.1 1.1.4. Example 1 Chapter 1 INVESTIGATIONS AND FUNCTIONS 1.1.1 1.1.4 This opening section introduces the students to man of the big ideas of Algebra 2, as well as different was of thinking and various problem solving strategies.

More information

2. Simple Linear Regression

2. Simple Linear Regression Research methods - II 3 2. Simple Linear Regression Simple linear regression is a technique in parametric statistics that is commonly used for analyzing mean response of a variable Y which changes according

More information

FINAL EXAM REVIEW MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

FINAL EXAM REVIEW MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. FINAL EXAM REVIEW MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Determine whether or not the relationship shown in the table is a function. 1) -

More information

Chapter 9 Descriptive Statistics for Bivariate Data

Chapter 9 Descriptive Statistics for Bivariate Data 9.1 Introduction 215 Chapter 9 Descriptive Statistics for Bivariate Data 9.1 Introduction We discussed univariate data description (methods used to eplore the distribution of the values of a single variable)

More information

Univariate Regression

Univariate Regression Univariate Regression Correlation and Regression The regression line summarizes the linear relationship between 2 variables Correlation coefficient, r, measures strength of relationship: the closer r is

More information

DESCRIPTIVE STATISTICS. The purpose of statistics is to condense raw data to make it easier to answer specific questions; test hypotheses.

DESCRIPTIVE STATISTICS. The purpose of statistics is to condense raw data to make it easier to answer specific questions; test hypotheses. DESCRIPTIVE STATISTICS The purpose of statistics is to condense raw data to make it easier to answer specific questions; test hypotheses. DESCRIPTIVE VS. INFERENTIAL STATISTICS Descriptive To organize,

More information

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Ch. Correlation and Regression. Correlation Interpret Scatter Plots and Correlations MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Provide an appropriate

More information

1. a. standard form of a parabola with. 2 b 1 2 horizontal axis of symmetry 2. x 2 y 2 r 2 o. standard form of an ellipse centered

1. a. standard form of a parabola with. 2 b 1 2 horizontal axis of symmetry 2. x 2 y 2 r 2 o. standard form of an ellipse centered Conic Sections. Distance Formula and Circles. More on the Parabola. The Ellipse and Hperbola. Nonlinear Sstems of Equations in Two Variables. Nonlinear Inequalities and Sstems of Inequalities In Chapter,

More information

Correlation key concepts:

Correlation key concepts: CORRELATION Correlation key concepts: Types of correlation Methods of studying correlation a) Scatter diagram b) Karl pearson s coefficient of correlation c) Spearman s Rank correlation coefficient d)

More information

North Carolina Community College System Diagnostic and Placement Test Sample Questions

North Carolina Community College System Diagnostic and Placement Test Sample Questions North Carolina Communit College Sstem Diagnostic and Placement Test Sample Questions 0 The College Board. College Board, ACCUPLACER, WritePlacer and the acorn logo are registered trademarks of the College

More information

C3: Functions. Learning objectives

C3: Functions. Learning objectives CHAPTER C3: Functions Learning objectives After studing this chapter ou should: be familiar with the terms one-one and man-one mappings understand the terms domain and range for a mapping understand the

More information

4. Continuous Random Variables, the Pareto and Normal Distributions

4. Continuous Random Variables, the Pareto and Normal Distributions 4. Continuous Random Variables, the Pareto and Normal Distributions A continuous random variable X can take any value in a given range (e.g. height, weight, age). The distribution of a continuous random

More information

1.6. Piecewise Functions. LEARN ABOUT the Math. Representing the problem using a graphical model

1.6. Piecewise Functions. LEARN ABOUT the Math. Representing the problem using a graphical model . Piecewise Functions YOU WILL NEED graph paper graphing calculator GOAL Understand, interpret, and graph situations that are described b piecewise functions. LEARN ABOUT the Math A cit parking lot uses

More information

DATA INTERPRETATION AND STATISTICS

DATA INTERPRETATION AND STATISTICS PholC60 September 001 DATA INTERPRETATION AND STATISTICS Books A easy and systematic introductory text is Essentials of Medical Statistics by Betty Kirkwood, published by Blackwell at about 14. DESCRIPTIVE

More information

MEASURES OF VARIATION

MEASURES OF VARIATION NORMAL DISTRIBTIONS MEASURES OF VARIATION In statistics, it is important to measure the spread of data. A simple way to measure spread is to find the range. But statisticians want to know if the data are

More information

M122 College Algebra Review for Final Exam

M122 College Algebra Review for Final Exam M122 College Algebra Review for Final Eam Revised Fall 2007 for College Algebra in Contet All answers should include our work (this could be a written eplanation of the result, a graph with the relevant

More information

AP Statistics Solutions to Packet 2

AP Statistics Solutions to Packet 2 AP Statistics Solutions to Packet 2 The Normal Distributions Density Curves and the Normal Distribution Standard Normal Calculations HW #9 1, 2, 4, 6-8 2.1 DENSITY CURVES (a) Sketch a density curve that

More information

The Slope-Intercept Form

The Slope-Intercept Form 7.1 The Slope-Intercept Form 7.1 OBJECTIVES 1. Find the slope and intercept from the equation of a line. Given the slope and intercept, write the equation of a line. Use the slope and intercept to graph

More information

Exercise 1.12 (Pg. 22-23)

Exercise 1.12 (Pg. 22-23) Individuals: The objects that are described by a set of data. They may be people, animals, things, etc. (Also referred to as Cases or Records) Variables: The characteristics recorded about each individual.

More information

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Final Exam Review MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. 1) A researcher for an airline interviews all of the passengers on five randomly

More information

Simple linear regression

Simple linear regression Simple linear regression Introduction Simple linear regression is a statistical method for obtaining a formula to predict values of one variable from another where there is a causal relationship between

More information

Zeros of Polynomial Functions. The Fundamental Theorem of Algebra. The Fundamental Theorem of Algebra. zero in the complex number system.

Zeros of Polynomial Functions. The Fundamental Theorem of Algebra. The Fundamental Theorem of Algebra. zero in the complex number system. _.qd /7/ 9:6 AM Page 69 Section. Zeros of Polnomial Functions 69. Zeros of Polnomial Functions What ou should learn Use the Fundamental Theorem of Algebra to determine the number of zeros of polnomial

More information

The Big Picture. Correlation. Scatter Plots. Data

The Big Picture. Correlation. Scatter Plots. Data The Big Picture Correlation Bret Hanlon and Bret Larget Department of Statistics Universit of Wisconsin Madison December 6, We have just completed a length series of lectures on ANOVA where we considered

More information

6.4 Normal Distribution

6.4 Normal Distribution Contents 6.4 Normal Distribution....................... 381 6.4.1 Characteristics of the Normal Distribution....... 381 6.4.2 The Standardized Normal Distribution......... 385 6.4.3 Meaning of Areas under

More information

MATH REVIEW SHEETS BEGINNING ALGEBRA MATH 60

MATH REVIEW SHEETS BEGINNING ALGEBRA MATH 60 MATH REVIEW SHEETS BEGINNING ALGEBRA MATH 60 A Summar of Concepts Needed to be Successful in Mathematics The following sheets list the ke concepts which are taught in the specified math course. The sheets

More information

1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96

1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96 1 Final Review 2 Review 2.1 CI 1-propZint Scenario 1 A TV manufacturer claims in its warranty brochure that in the past not more than 10 percent of its TV sets needed any repair during the first two years

More information

Florida Algebra I EOC Online Practice Test

Florida Algebra I EOC Online Practice Test Florida Algebra I EOC Online Practice Test Directions: This practice test contains 65 multiple-choice questions. Choose the best answer for each question. Detailed answer eplanations appear at the end

More information

LESSON EIII.E EXPONENTS AND LOGARITHMS

LESSON EIII.E EXPONENTS AND LOGARITHMS LESSON EIII.E EXPONENTS AND LOGARITHMS LESSON EIII.E EXPONENTS AND LOGARITHMS OVERVIEW Here s what ou ll learn in this lesson: Eponential Functions a. Graphing eponential functions b. Applications of eponential

More information

Chapter 10. Key Ideas Correlation, Correlation Coefficient (r),

Chapter 10. Key Ideas Correlation, Correlation Coefficient (r), Chapter 0 Key Ideas Correlation, Correlation Coefficient (r), Section 0-: Overview We have already explored the basics of describing single variable data sets. However, when two quantitative variables

More information

2.7 Applications of Derivatives to Business

2.7 Applications of Derivatives to Business 80 CHAPTER 2 Applications of the Derivative 2.7 Applications of Derivatives to Business and Economics Cost = C() In recent ears, economic decision making has become more and more mathematicall oriented.

More information

POLYNOMIAL FUNCTIONS

POLYNOMIAL FUNCTIONS POLYNOMIAL FUNCTIONS Polynomial Division.. 314 The Rational Zero Test.....317 Descarte s Rule of Signs... 319 The Remainder Theorem.....31 Finding all Zeros of a Polynomial Function.......33 Writing a

More information

Solving Absolute Value Equations and Inequalities Graphically

Solving Absolute Value Equations and Inequalities Graphically 4.5 Solving Absolute Value Equations and Inequalities Graphicall 4.5 OBJECTIVES 1. Draw the graph of an absolute value function 2. Solve an absolute value equation graphicall 3. Solve an absolute value

More information

" Y. Notation and Equations for Regression Lecture 11/4. Notation:

 Y. Notation and Equations for Regression Lecture 11/4. Notation: Notation: Notation and Equations for Regression Lecture 11/4 m: The number of predictor variables in a regression Xi: One of multiple predictor variables. The subscript i represents any number from 1 through

More information

THE POWER RULES. Raising an Exponential Expression to a Power

THE POWER RULES. Raising an Exponential Expression to a Power 8 (5-) Chapter 5 Eponents and Polnomials 5. THE POWER RULES In this section Raising an Eponential Epression to a Power Raising a Product to a Power Raising a Quotient to a Power Variable Eponents Summar

More information

The correlation coefficient

The correlation coefficient The correlation coefficient Clinical Biostatistics The correlation coefficient Martin Bland Correlation coefficients are used to measure the of the relationship or association between two quantitative

More information

Algebra I Vocabulary Cards

Algebra I Vocabulary Cards Algebra I Vocabulary Cards Table of Contents Expressions and Operations Natural Numbers Whole Numbers Integers Rational Numbers Irrational Numbers Real Numbers Absolute Value Order of Operations Expression

More information

Module 3: Correlation and Covariance

Module 3: Correlation and Covariance Using Statistical Data to Make Decisions Module 3: Correlation and Covariance Tom Ilvento Dr. Mugdim Pašiƒ University of Delaware Sarajevo Graduate School of Business O ften our interest in data analysis

More information

Session 7 Bivariate Data and Analysis

Session 7 Bivariate Data and Analysis Session 7 Bivariate Data and Analysis Key Terms for This Session Previously Introduced mean standard deviation New in This Session association bivariate analysis contingency table co-variation least squares

More information

5.1. A Formula for Slope. Investigation: Points and Slope CONDENSED

5.1. A Formula for Slope. Investigation: Points and Slope CONDENSED CONDENSED L E S S O N 5.1 A Formula for Slope In this lesson ou will learn how to calculate the slope of a line given two points on the line determine whether a point lies on the same line as two given

More information

INTRODUCTION TO ERRORS AND ERROR ANALYSIS

INTRODUCTION TO ERRORS AND ERROR ANALYSIS INTRODUCTION TO ERRORS AND ERROR ANALYSIS To many students and to the public in general, an error is something they have done wrong. However, in science, the word error means the uncertainty which accompanies

More information

Zero and Negative Exponents and Scientific Notation. a a n a m n. Now, suppose that we allow m to equal n. We then have. a am m a 0 (1) a m

Zero and Negative Exponents and Scientific Notation. a a n a m n. Now, suppose that we allow m to equal n. We then have. a am m a 0 (1) a m 0. E a m p l e 666SECTION 0. OBJECTIVES. Define the zero eponent. Simplif epressions with negative eponents. Write a number in scientific notation. Solve an application of scientific notation We must have

More information

Exponential and Logarithmic Functions

Exponential and Logarithmic Functions Chapter 6 Eponential and Logarithmic Functions Section summaries Section 6.1 Composite Functions Some functions are constructed in several steps, where each of the individual steps is a function. For eample,

More information

Statistics. Measurement. Scales of Measurement 7/18/2012

Statistics. Measurement. Scales of Measurement 7/18/2012 Statistics Measurement Measurement is defined as a set of rules for assigning numbers to represent objects, traits, attributes, or behaviors A variableis something that varies (eye color), a constant does

More information

Summary of Formulas and Concepts. Descriptive Statistics (Ch. 1-4)

Summary of Formulas and Concepts. Descriptive Statistics (Ch. 1-4) Summary of Formulas and Concepts Descriptive Statistics (Ch. 1-4) Definitions Population: The complete set of numerical information on a particular quantity in which an investigator is interested. We assume

More information

CALCULATIONS & STATISTICS

CALCULATIONS & STATISTICS CALCULATIONS & STATISTICS CALCULATION OF SCORES Conversion of 1-5 scale to 0-100 scores When you look at your report, you will notice that the scores are reported on a 0-100 scale, even though respondents

More information

Descriptive Statistics

Descriptive Statistics Y520 Robert S Michael Goal: Learn to calculate indicators and construct graphs that summarize and describe a large quantity of values. Using the textbook readings and other resources listed on the web

More information

15.1. Exact Differential Equations. Exact First-Order Equations. Exact Differential Equations Integrating Factors

15.1. Exact Differential Equations. Exact First-Order Equations. Exact Differential Equations Integrating Factors SECTION 5. Eact First-Order Equations 09 SECTION 5. Eact First-Order Equations Eact Differential Equations Integrating Factors Eact Differential Equations In Section 5.6, ou studied applications of differential

More information

Summer Math Exercises. For students who are entering. Pre-Calculus

Summer Math Exercises. For students who are entering. Pre-Calculus Summer Math Eercises For students who are entering Pre-Calculus It has been discovered that idle students lose learning over the summer months. To help you succeed net fall and perhaps to help you learn

More information

Paper No 19. FINALTERM EXAMINATION Fall 2009 MTH302- Business Mathematics & Statistics (Session - 2) Ref No: Time: 120 min Marks: 80

Paper No 19. FINALTERM EXAMINATION Fall 2009 MTH302- Business Mathematics & Statistics (Session - 2) Ref No: Time: 120 min Marks: 80 Paper No 19 FINALTERM EXAMINATION Fall 2009 MTH302- Business Mathematics & Statistics (Session - 2) Ref No: Time: 120 min Marks: 80 Question No: 1 ( Marks: 1 ) - Please choose one Scatterplots are used

More information

Key Concept. Density Curve

Key Concept. Density Curve MAT 155 Statistical Analysis Dr. Claude Moore Cape Fear Community College Chapter 6 Normal Probability Distributions 6 1 Review and Preview 6 2 The Standard Normal Distribution 6 3 Applications of Normal

More information

Common Tools for Displaying and Communicating Data for Process Improvement

Common Tools for Displaying and Communicating Data for Process Improvement Common Tools for Displaying and Communicating Data for Process Improvement Packet includes: Tool Use Page # Box and Whisker Plot Check Sheet Control Chart Histogram Pareto Diagram Run Chart Scatter Plot

More information

Introduction to Basic Reliability Statistics. James Wheeler, CMRP

Introduction to Basic Reliability Statistics. James Wheeler, CMRP James Wheeler, CMRP Objectives Introduction to Basic Reliability Statistics Arithmetic Mean Standard Deviation Correlation Coefficient Estimating MTBF Type I Censoring Type II Censoring Eponential Distribution

More information

CHAPTER 13 SIMPLE LINEAR REGRESSION. Opening Example. Simple Regression. Linear Regression

CHAPTER 13 SIMPLE LINEAR REGRESSION. Opening Example. Simple Regression. Linear Regression Opening Example CHAPTER 13 SIMPLE LINEAR REGREION SIMPLE LINEAR REGREION! Simple Regression! Linear Regression Simple Regression Definition A regression model is a mathematical equation that descries the

More information

SAMPLE. Polynomial functions

SAMPLE. Polynomial functions Objectives C H A P T E R 4 Polnomial functions To be able to use the technique of equating coefficients. To introduce the functions of the form f () = a( + h) n + k and to sketch graphs of this form through

More information

Section 3-3 Approximating Real Zeros of Polynomials

Section 3-3 Approximating Real Zeros of Polynomials - Approimating Real Zeros of Polynomials 9 Section - Approimating Real Zeros of Polynomials Locating Real Zeros The Bisection Method Approimating Multiple Zeros Application The methods for finding zeros

More information

Direct Variation. 1. Write an equation for a direct variation relationship 2. Graph the equation of a direct variation relationship

Direct Variation. 1. Write an equation for a direct variation relationship 2. Graph the equation of a direct variation relationship 6.5 Direct Variation 6.5 OBJECTIVES 1. Write an equation for a direct variation relationship 2. Graph the equation of a direct variation relationship Pedro makes $25 an hour as an electrician. If he works

More information

MATH 185 CHAPTER 2 REVIEW

MATH 185 CHAPTER 2 REVIEW NAME MATH 18 CHAPTER REVIEW Use the slope and -intercept to graph the linear function. 1. F() = 4 - - Objective: (.1) Graph a Linear Function Determine whether the given function is linear or nonlinear..

More information

2 3 Histograms, Frequency Polygons, and Ogives

2 3 Histograms, Frequency Polygons, and Ogives 48 Chapter 2 Frequenc Distributions and Graphs 4. In Data Analsis, select Histogram and click the [OK] button. 5. In the Histogram dialog bo, tpe A1:A5 as the Input Range. 6. Select New Worksheet Pl, and

More information

CHAPTER TEN. Key Concepts

CHAPTER TEN. Key Concepts CHAPTER TEN Ke Concepts linear regression: slope intercept residual error sum of squares or residual sum of squares sum of squares due to regression mean squares error mean squares regression (coefficient)

More information

7.7 Solving Rational Equations

7.7 Solving Rational Equations Section 7.7 Solving Rational Equations 7 7.7 Solving Rational Equations When simplifying comple fractions in the previous section, we saw that multiplying both numerator and denominator by the appropriate

More information

DISTANCE, CIRCLES, AND QUADRATIC EQUATIONS

DISTANCE, CIRCLES, AND QUADRATIC EQUATIONS a p p e n d i g DISTANCE, CIRCLES, AND QUADRATIC EQUATIONS DISTANCE BETWEEN TWO POINTS IN THE PLANE Suppose that we are interested in finding the distance d between two points P (, ) and P (, ) in the

More information

Answer: C. The strength of a correlation does not change if units change by a linear transformation such as: Fahrenheit = 32 + (5/9) * Centigrade

Answer: C. The strength of a correlation does not change if units change by a linear transformation such as: Fahrenheit = 32 + (5/9) * Centigrade Statistics Quiz Correlation and Regression -- ANSWERS 1. Temperature and air pollution are known to be correlated. We collect data from two laboratories, in Boston and Montreal. Boston makes their measurements

More information

Section 3-7. Marginal Analysis in Business and Economics. Marginal Cost, Revenue, and Profit. 202 Chapter 3 The Derivative

Section 3-7. Marginal Analysis in Business and Economics. Marginal Cost, Revenue, and Profit. 202 Chapter 3 The Derivative 202 Chapter 3 The Derivative Section 3-7 Marginal Analysis in Business and Economics Marginal Cost, Revenue, and Profit Application Marginal Average Cost, Revenue, and Profit Marginal Cost, Revenue, and

More information

EQUATIONS OF LINES IN SLOPE- INTERCEPT AND STANDARD FORM

EQUATIONS OF LINES IN SLOPE- INTERCEPT AND STANDARD FORM . Equations of Lines in Slope-Intercept and Standard Form ( ) 8 In this Slope-Intercept Form Standard Form section Using Slope-Intercept Form for Graphing Writing the Equation for a Line Applications (0,

More information

Core Maths C2. Revision Notes

Core Maths C2. Revision Notes Core Maths C Revision Notes November 0 Core Maths C Algebra... Polnomials: +,,,.... Factorising... Long division... Remainder theorem... Factor theorem... 4 Choosing a suitable factor... 5 Cubic equations...

More information

CURVE FITTING LEAST SQUARES APPROXIMATION

CURVE FITTING LEAST SQUARES APPROXIMATION CURVE FITTING LEAST SQUARES APPROXIMATION Data analysis and curve fitting: Imagine that we are studying a physical system involving two quantities: x and y Also suppose that we expect a linear relationship

More information

Functions and Graphs CHAPTER INTRODUCTION. The function concept is one of the most important ideas in mathematics. The study

Functions and Graphs CHAPTER INTRODUCTION. The function concept is one of the most important ideas in mathematics. The study Functions and Graphs CHAPTER 2 INTRODUCTION The function concept is one of the most important ideas in mathematics. The stud 2-1 Functions 2-2 Elementar Functions: Graphs and Transformations 2-3 Quadratic

More information

NAME DATE PERIOD. 11. Is the relation (year, percent of women) a function? Explain. Yes; each year is

NAME DATE PERIOD. 11. Is the relation (year, percent of women) a function? Explain. Yes; each year is - NAME DATE PERID Functions Determine whether each relation is a function. Eplain.. {(, ), (0, 9), (, 0), (7, 0)} Yes; each value is paired with onl one value.. {(, ), (, ), (, ), (, ), (, )}. No; in the

More information

Simple Regression Theory II 2010 Samuel L. Baker

Simple Regression Theory II 2010 Samuel L. Baker SIMPLE REGRESSION THEORY II 1 Simple Regression Theory II 2010 Samuel L. Baker Assessing how good the regression equation is likely to be Assignment 1A gets into drawing inferences about how close the

More information

3.2 Measures of Spread

3.2 Measures of Spread 3.2 Measures of Spread In some data sets the observations are close together, while in others they are more spread out. In addition to measures of the center, it's often important to measure the spread

More information

Higher. Polynomials and Quadratics 64

Higher. Polynomials and Quadratics 64 hsn.uk.net Higher Mathematics UNIT OUTCOME 1 Polnomials and Quadratics Contents Polnomials and Quadratics 64 1 Quadratics 64 The Discriminant 66 3 Completing the Square 67 4 Sketching Parabolas 70 5 Determining

More information

2.3 Quadratic Functions

2.3 Quadratic Functions 88 Linear and Quadratic Functions. Quadratic Functions You ma recall studing quadratic equations in Intermediate Algebra. In this section, we review those equations in the contet of our net famil of functions:

More information

X X X a) perfect linear correlation b) no correlation c) positive correlation (r = 1) (r = 0) (0 < r < 1)

X X X a) perfect linear correlation b) no correlation c) positive correlation (r = 1) (r = 0) (0 < r < 1) CORRELATION AND REGRESSION / 47 CHAPTER EIGHT CORRELATION AND REGRESSION Correlation and regression are statistical methods that are commonly used in the medical literature to compare two or more variables.

More information

SECTION 2-2 Straight Lines

SECTION 2-2 Straight Lines - Straight Lines 11 94. Engineering. The cross section of a rivet has a top that is an arc of a circle (see the figure). If the ends of the arc are 1 millimeters apart and the top is 4 millimeters above

More information

Name Class Date. Additional Vocabulary Support

Name Class Date. Additional Vocabulary Support - Additional Vocabular Support Rate of Change and Slope Concept List negative slope positive slope rate of change rise run slope slope formula slope of horizontal line slope of vertical line Choose the

More information

1. The parameters to be estimated in the simple linear regression model Y=α+βx+ε ε~n(0,σ) are: a) α, β, σ b) α, β, ε c) a, b, s d) ε, 0, σ

1. The parameters to be estimated in the simple linear regression model Y=α+βx+ε ε~n(0,σ) are: a) α, β, σ b) α, β, ε c) a, b, s d) ε, 0, σ STA 3024 Practice Problems Exam 2 NOTE: These are just Practice Problems. This is NOT meant to look just like the test, and it is NOT the only thing that you should study. Make sure you know all the material

More information

Biostatistics: DESCRIPTIVE STATISTICS: 2, VARIABILITY

Biostatistics: DESCRIPTIVE STATISTICS: 2, VARIABILITY Biostatistics: DESCRIPTIVE STATISTICS: 2, VARIABILITY 1. Introduction Besides arriving at an appropriate expression of an average or consensus value for observations of a population, it is important to

More information

Section 14 Simple Linear Regression: Introduction to Least Squares Regression

Section 14 Simple Linear Regression: Introduction to Least Squares Regression Slide 1 Section 14 Simple Linear Regression: Introduction to Least Squares Regression There are several different measures of statistical association used for understanding the quantitative relationship

More information

Mark Scheme 4767 June 2005 GENERAL INSTRUCTIONS Marks in the mark scheme are explicitly designated as M, A, B, E or G. M marks ("method") are for an attempt to use a correct method (not merely for stating

More information

SECTION 5-1 Exponential Functions

SECTION 5-1 Exponential Functions 354 5 Eponential and Logarithmic Functions Most of the functions we have considered so far have been polnomial and rational functions, with a few others involving roots or powers of polnomial or rational

More information

Week 3&4: Z tables and the Sampling Distribution of X

Week 3&4: Z tables and the Sampling Distribution of X Week 3&4: Z tables and the Sampling Distribution of X 2 / 36 The Standard Normal Distribution, or Z Distribution, is the distribution of a random variable, Z N(0, 1 2 ). The distribution of any other normal

More information

STT315 Chapter 4 Random Variables & Probability Distributions KM. Chapter 4.5, 6, 8 Probability Distributions for Continuous Random Variables

STT315 Chapter 4 Random Variables & Probability Distributions KM. Chapter 4.5, 6, 8 Probability Distributions for Continuous Random Variables Chapter 4.5, 6, 8 Probability Distributions for Continuous Random Variables Discrete vs. continuous random variables Examples of continuous distributions o Uniform o Exponential o Normal Recall: A random

More information

II. DISTRIBUTIONS distribution normal distribution. standard scores

II. DISTRIBUTIONS distribution normal distribution. standard scores Appendix D Basic Measurement And Statistics The following information was developed by Steven Rothke, PhD, Department of Psychology, Rehabilitation Institute of Chicago (RIC) and expanded by Mary F. Schmidt,

More information

Slope-Intercept Form and Point-Slope Form

Slope-Intercept Form and Point-Slope Form Slope-Intercept Form and Point-Slope Form In this section we will be discussing Slope-Intercept Form and the Point-Slope Form of a line. We will also discuss how to graph using the Slope-Intercept Form.

More information

3.4 Statistical inference for 2 populations based on two samples

3.4 Statistical inference for 2 populations based on two samples 3.4 Statistical inference for 2 populations based on two samples Tests for a difference between two population means The first sample will be denoted as X 1, X 2,..., X m. The second sample will be denoted

More information

Chapter 23. Inferences for Regression

Chapter 23. Inferences for Regression Chapter 23. Inferences for Regression Topics covered in this chapter: Simple Linear Regression Simple Linear Regression Example 23.1: Crying and IQ The Problem: Infants who cry easily may be more easily

More information

17. SIMPLE LINEAR REGRESSION II

17. SIMPLE LINEAR REGRESSION II 17. SIMPLE LINEAR REGRESSION II The Model In linear regression analysis, we assume that the relationship between X and Y is linear. This does not mean, however, that Y can be perfectly predicted from X.

More information

Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression

Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression Objectives: To perform a hypothesis test concerning the slope of a least squares line To recognize that testing for a

More information

6. The given function is only drawn for x > 0. Complete the function for x < 0 with the following conditions:

6. The given function is only drawn for x > 0. Complete the function for x < 0 with the following conditions: Precalculus Worksheet 1. Da 1 1. The relation described b the set of points {(-, 5 ),( 0, 5 ),(,8 ),(, 9) } is NOT a function. Eplain wh. For questions - 4, use the graph at the right.. Eplain wh the graph

More information

Def: The standard normal distribution is a normal probability distribution that has a mean of 0 and a standard deviation of 1.

Def: The standard normal distribution is a normal probability distribution that has a mean of 0 and a standard deviation of 1. Lecture 6: Chapter 6: Normal Probability Distributions A normal distribution is a continuous probability distribution for a random variable x. The graph of a normal distribution is called the normal curve.

More information

Common Core Unit Summary Grades 6 to 8

Common Core Unit Summary Grades 6 to 8 Common Core Unit Summary Grades 6 to 8 Grade 8: Unit 1: Congruence and Similarity- 8G1-8G5 rotations reflections and translations,( RRT=congruence) understand congruence of 2 d figures after RRT Dilations

More information

Elementary Statistics

Elementary Statistics Elementary Statistics Chapter 1 Dr. Ghamsary Page 1 Elementary Statistics M. Ghamsary, Ph.D. Chap 01 1 Elementary Statistics Chapter 1 Dr. Ghamsary Page 2 Statistics: Statistics is the science of collecting,

More information