Engineering Mathematics 4

Size: px
Start display at page:

Download "Engineering Mathematics 4"

Transcription

1 Engineering Mathematics 4 David Ramsey Room: B david.ramsey@ul.ie website: January 20, / 94

2 Course Outline 1. Data Collection and Descriptive Statistics. 2. Probability Theory 3. Statistical Inference 2 / 94

3 Course Texts 1. Engineering Mathematics 4 - Ref. No includes exercise lists, statistical tables and past papers, will be available from the print room). 2. Montgomery - Applied Statistics and Probability for Engineers. 3. Stuart - An introduction to statistical analysis for business and industry: a problem solving approach. 4. Montgomery, Runger, Fari - Engineering Statistics. Similar books are available in section 519 of the library. 3 / 94

4 1 Data Collection and Descriptive Statistics Populations of objects and individuals show variation with respect to various traits (e.g. height, political preferences, the working life of a light bulb). It is impractical to observe all the members of the population. In order to describe the distribution of a trait in the population, we select a sample. On the basis of the sample we gain information on the population as a whole. 4 / 94

5 1.1 Types of Variables 1. Qualitive variables: These are normally categorical (non-numerical) variables. We distinguish between two types of qualitative variables: a) nominal: these variables are not naturally ordered in any way (e.g. i. department - mechanical engineering, mathematics, economics ii. industrial sector). b) ordinal: there is a natural order for such categorisations e.g. with respect to smoking, people may be categorised as 1: non-smokers, 2: light smokers and 3: heavy smokers. It can be seen that the higher the category number, the more an individual smokes. Exam grades are ordinal variables. 5 / 94

6 2. Quantitative Variables These are variables which naturally take numerical values (e.g. age, height, number of children). Such variables can be measured or counted. As before we distinguish between two types of quantitative variables. a) Discrete variables: these are variables that take values from a set that can be listed (most commonly integer values, i.e. they are variables that can be counted). For example, number of children, the results of die rolls. 6 / 94

7 b) Continuous variables These are variables that can take values in a given range to any number of decimal places (such variables are measured, normally according to some unit e.g. height, age, weight). It should be noted that such variables are only measured to a given accuracy (i.e. height is measured to the nearest centimetre, age is normally given to the nearest year). If a discrete random variable takes many values (i.e. the population of a town), then for practical purposes it is treated as a continuous variable. 7 / 94

8 1.2 Collection of data Since it is impractical to survey all the individuals in a population, we need to base our analysis on a sample. A population is the entire collection of individuals or objects we wish to describe. A sample is the subset of the population chosen for data collection. A sampling frame is the list used to chose the sample. A unit is a member of the population. A variable is any trait that varies from unit to unit. 8 / 94

9 Collection of data For example, suppose we wish to investigate the political preferences of Irish voters (our variable of interest). The population is made up of people eligible to vote in Irish elections. A unit is any eligible voter. The lists of eligible voters for each constituency may be used as the sampling frame. 9 / 94

10 Collection of data A list of addresses in various constituencies may be used as the sampling frame. However, there is no one to one correspondence between addresses and eligible voters. The sample is the set of people asked for their political preferences. The variable of interest is the party an individual wishes to vote for (other socio-demographic variables may be collected, such as age, occupation, education). The sample size is the number of individuals in a sample and is denoted by n. 10 / 94

11 1.2.1 Parameters and Statistics A parameter is an unkown number describing a population. For example, it may be that 9% of the population of eligible voters wish to vote for the Green Party (we do not, however, observe this population proportion). A statistic is a number describing a sample. For example, 8% of a sample may wish to vote for the Green Party. This is the sample proportion. Statistics may be used to describe a population, but they only estimate the real parameters of the population. 11 / 94

12 Parameters and Statistics - Precision of Statistics Naturally, the statistics from a sample will show some variation around the appropriate parameters e.g. 9% of the population wish to vote for the Green Party, but only 8% in the sample. The greater the sample size, the more precise the results (suppose we take a large number of samples of size n, the larger n the less variable the sample proportion from the various samples, i.e. the smaller the sample variance). 12 / 94

13 Parameters and Statistics - Bias However, there may be intrinsic bias from two possible sources: a) Sampling bias - when a sample is chosen in a way such that some members of the population are more likely to be chosen than others. e.g. Suppose that the Labour party is most popular in Dublin. If we used samples of voters from Dublin to estimate support in the whole of Ireland, we would systematically overestimate the support of the Labour party. b) Non-Sampling Bias This results from mistakes in data entry and/or how interviewees react to being questioned. For example, Fianna Fail supporters may be more likely to hide their preference than other individuals. In this case, we would systematically underestimate the support of Fianna Fail. 13 / 94

14 Non-Sampling Bias Other sources of bias may be: 1. Lack of anonymity. 2. The wording of a question. 3. The desire to give an answer that would please the interviewer (e.g. surveys may systematically overestimate the willingness of individuals to pay extra for environmentally friendly goods). 14 / 94

15 Precision and Bias It should be noted that bias is a characteristic of the way in which data are collected not a single sample. Increasing the sample size will improve the precision of an estimate, but will not affect the bias. 15 / 94

16 1.2.2 Random Sampling Sampling is said to be random if each individual has an equal probability of being chosen to be in the sample and this probability is not affected by who else is chosen to be in the sample. An estimate of a parameter is unbiased if there is no systematic tendency to under- or over-estimate the parameter. Random sampling does not ensure that the estimates of parameters are unbiased. However, the bias does not result from the sampling procedure (see above). 16 / 94

17 Another Example of Sampling Bias - Estimation of the Population Mean e.g. Suppose the population of interest is the Irish population as a whole and the variable of interest height. Suppose I base my estimate of the mean height of the population on the mean height of a sample of students. Since students tend to be on average taller than the population as a whole, I will systematically overestimate the mean height in the population. That is to say, if I consider many samples of students of say size 100, a large majority of such samples would give me an overestimate of the mean height of the population as a whole. 17 / 94

18 1.3 Descriptive Statistics Qualitative (Categorical Data) We may describe qualitative data using a) Frequency tables. b) Bar charts. c) Pie charts. n denotes the total number of observations (the sample size). 18 / 94

19 Frequency tables Frequency tables display how many observations fall into each category (the frequency column), as well as the relative frequency of each category (the proportion of observations falling into each category). Let n i denote the number of observations in category i. The relative frequency of category i is f i, where f i = n i n Multiplying by 100, we obtain the relative frequency as a percentage. If there are missing data we may also give the relative frequencies in terms of the actual number of observations, n i.e. f i = n i n 19 / 94

20 Frequency tables For example 200 students were asked which of the following bands they preferred: Franz Ferdinand, Radiohead or Coldplay. The answers may be presented in the following frequency table Band Frequency Relative Frequency (% ) Coldplay /200 = 31 Franz F /200 = 33 Radiohead /200 = / 94

21 Bar chart In a bar chart the height of a bar represents the relative frequency of a given category (or the number of observations in that category). 21 / 94

22 Pie chart The size of a slice in a pie chart represents the relative frequency of a category. Hence, the angle made by the slice representing category i is given (in degrees) by α i, where α i = 360f i = 360n i n (i.e. we multiply the relative frequency by the number of degrees in a full revolution). If the relative frequency of observations in group i is given in percentage terms, denoted p i. 1% of the observations in the sample correspond to an angle of 3.6 degrees. Thus, α i = 3.6p i. 22 / 94

23 Pie chart 23 / 94

24 Example 1.1 Suppose 1000 randomly chosen voters are asked which party they are going to vote for in the forthcoming election. The results are as given below: Fianna Fail: 360 Fine Gael: 270 Green Party: 100 Labour: 90 Progressive Democrats: 70 No Answer: / 94

25 Example 1.1 The frequency table is as follows: Party Frequency Rel. Freq. (% ) Of non-missing data Fianna Fail = Fine Gael = Green Party = Labour = Prog. Dems = 7.87 No answer / 94

26 Example 1.1 The bar chart illustrating just the non-missing data is given by 26 / 94

27 Example 1.1 The pie chart for the sample as a whole (i.e. including those who gave no answer) is derived as follows: The angle made by the slice representing the support of Fianna Fail is given by = The angle made by the slice representing the support of Fianna Fail is given by = 97.20, etc. 27 / 94

28 Example 1.1 The pie chart is as follows: 28 / 94

29 1.3.2 Graphical Presentation of Quantitative Data Discrete data can be presented in the form of frequency tables and/or bar charts (as above). The distribution of continuous data can be presented using a) A histogram. b) Its empirical distribution function (also referred to as an OGIVE). 29 / 94

30 Histograms for continuous variables In order to draw a histogram for a continuous variable, we need to categorise the data into intervals of equal length. The end points of these intervals should be round numbers. The number of categories used should be approximately n (normally between 5 and 20 categories are used). For example, if we have 30 observations then we should use about categories. Hence, 5 and 6 are sensible choices for the number of categories. Let k be the number of categories. 30 / 94

31 Histograms In order to choose the length of each interval, L, we use L x max x min = r k k, where x max is smallest round number larger than all the observations and x min is the largest round number smaller than all the observations. The difference between these numbers is an estimate of the range of the data, denoted r. If necessary L is rounded upwards, so that the intervals are of nice length and the whole range of the data is covered. 31 / 94

32 Histograms The intervals used are [x min, x min + L], (x min + L, x min + 2L],..., (x max L, x max ]. In general the lower end-point of an interval is assumed not to belong to that interval (to avoid a number belonging to two classes). 32 / 94

33 Histograms A histogram is very similar to a bar chart. The height of the block corresponding to an interval is the relative frequency of observations in that block. Thus, the height of a block is the number of observations in that interval divided by the total number of observations. 33 / 94

34 The Ogive Suppose we have a reasonably large amount of data. In order to draw the empirical distribution function (ogive), we first split the data into categories as when drawing a histogram. Let x 0, x 1, x 2,..., x k be the endpoints of the intervals formed (k denotes the number of intervals). That is to say the i-th interval is (x i 1, x i ] 34 / 94

35 The Ogive The ogive is a graph of the cumulative relative frequency. The cumulative relative frequency at an endpoint x i is the proportion of observations less than x i. Note that no observations are smaller than the lower endpoint of the first interval (x 0 ), i.e. at x 0 the cumulative relative frequency is 0. The cumulative relative frequency at the upper endpoint of an interval can be calculated by adding the relative frequency of observations in that interval to the cumulative relative frequency at the lower endpoint of that interval (the upper endpoint of the previous interval). 35 / 94

36 The Ogive Hence, we can calculate the cumulative relative frequency at the endpoints of each interval. We then draw a scatter plot for the k + 1 values of the cumulative relative frequency at the endpoints of each interval. The X-coordinate of each of these k + 1 points is an endpoint of one of the intervals and the Y-coordinate is the cumulative relative frequency at that endpoint. To draw the OGIVE we connect each of these points to the next using a straight line. Note that the height of the OGIVE at the final endpoint is by definition / 94

37 Example 1.2 We observe the height of 20 individuals (in cm). The data are given below 172, 165, 188, 162, 178, 183, 171, 158, 174, 184, 167, 175, 192, 170, 179, 187, 163, 156, 178, 182. Draw a histogram and OGIVE representing these data. 37 / 94

38 Example 1.2 We first consider the histogram. First we choose the number of classes and the corresponding intervals , thus we should choose 4 or 5 intervals. 38 / 94

39 Example 1.2 The tallest individual is 192cm tall and the shortest 156cm. 200cm is the smallest round number larger than all the observations and 150cm is the largest round number smaller than all the observations. Thus, we take the range to be 50. To calculate the length of the intervals L = r k. Taking k to be 4, L = Taking k = 5, L = 10 (a nicer length). Hence, it seems reasonable to use 5 intervals of length 10, starting at / 94

40 Example 1.2 If we assume that the upper endpoint of an interval belongs to that interval, then we have the intervals [150,160], (160, 170], (170,180], (180,190], (190,200]. Now we count how many observations fall into each interval and hence the relative frequency of observations in each interval. 40 / 94

41 Example 1.2 Height (x) No. of Observations Rel. Frequency 150 x /20 = < x /20 = < x /20 = < x /20 = < x /20 = / 94

42 Example 1.2 The histogram is given below: 42 / 94

43 Interpretation of the histogram of a continuous variable A histogram is an estimator of the density function of a variable (see the chapter on the distribution of random variables in Section 2). The distribution of height seems to be reasonably symmetrical around 175cm. 43 / 94

44 Example The Ogive From the frequency table, the cumulative relative frequencies at the endpoints of the intervals are given by: Endpoint Cum. Rel. Freq = = = = 1 The Ogive is given on the next slide 44 / 94

45 The Ogive / 94

46 The Ogive The graph of the ogive can be used to estimate percentiles of a distribution. By definition α% of observations are less than the α-percentile. For example, the 60-percentile of height may be estimated by drawing a horizontal line from 0.6 on the y-axis until it hits the cumulative distribution function. The 60-percentile is the value on the x-axis directly below this point of intersection. This is illustrated on the next slide. 46 / 94

47 Estimation of percentiles / 94

48 The Ogive The 60-percentile of height in this example is approximately 178cm. The value of the cumulative relative frequency at the upper endpoint of the final interval is by definition / 94

49 1.3.3 Symmetry and Skewness of Distributions From a histogram we may infer whether the distribution of a random variable is symmetric or not. The histogram of height shows that the distribution is reasonably symmetric (even if the distribution of height in the population were symmetric, we would normally observe some small deviation from symmetry in the histogram as we observe only a sample). 49 / 94

50 Right-Skewed distributions A distribution is said to be right-skewed if there are observations a long way to the right of the centre of the distribution, but not a long way to the left. For example, the distribution of the age of students is right-skewed. Must students will be around 20 years of age. None will be much younger, but there are some mature students. The distribution of wages will also be right-skewed. 50 / 94

51 A right-skewed distribution 51 / 94

52 Left-skewed distributions A distribution is said to be left-skewed if there are observations a long way to the left of the centre of the distribution, but not a long way to the right. For example, the distribution of weight of participants in the Oxford-Cambridge boat race will have a left-skewed distribution. This is due to the fact that the majority of participants will be heavy rowers, while a minority will be very light coxes. 52 / 94

53 A Leftskewed Distribution 53 / 94

54 1.4 Numerical Methods of Describing Quantitative Data We consider two types of measure: 1. Measures of centrality - give information regarding the location of the centre of the distribution (the mean, median). 2. Measures of variability (dispersion) - give information regarding the level of variation (the range, variance, standard deviation, interquartile range). 54 / 94

55 1.4.1 Measures of centrality 1. The Sample Mean, x. Suppose we have a sample of n observations, the mean is given by the sum of the observations divided by the number of observations. x = 1 n n x i, i=1 where x i is the value of the i-th observation. 55 / 94

56 The Population Mean µ denotes the population mean. If there are N units in the population, then N i=1 µ = x i, N where x i is the value of the trait for individual i in the population. µ is normally unknown. The sample mean x (a statistic) is an estimator of the population mean µ (a parameter). 56 / 94

57 2. The sample median Q 2 In order to calculate the sample median, we first order the observations from the smallest to the largest. The order statistic x (i) is the i-th smallest observation in a sample (i.e. x (1) is the smallest observation and x (n) is the largest observation). The notation for the median comes from the fact that the median is the second quartile (see quartiles in the section on measures of dispersion). 57 / 94

58 The sample median Q 2 If n is odd, then the median is the observation which appears in the centre of the ordered list of observations. Hence, Q 2 = x (0.5[n+1]). If n is even, then the median is the average of the two observations which appear in the centre of the ordered list of observations. Hence, Q 2 = 0.5[x (0.5n) + x (0.5n+1) ] One half of the observations are smaller than the median and one half are greater. 58 / 94

59 The sample median One advantage of the median as a measure of centrality is that it is less sensitive to extreme observations (which may be errors) than the mean. When the distribution is skewed, it is preferable to use the median as a measure of centrality. e.g. the median wage rather than the average wage should be used as a measure of what the average man on the street earns. The distribution of wages is right-skewed and the small proportion of people who earn very high wages will have a significant effect on the mean. The mean is greater than the median. For left-skewed distributions the mean is less than the median. 59 / 94

60 The sample median When we have a large number of observations the 50% -percentile can be used to approximate the median. This can be read from the OGIVE. Using the OGIVE of height considered earlier, the median height is approximately 175cm. A more accurate method of approximating the median in such cases is considered in the section on grouped data. 60 / 94

61 1.4.2 Measures of Dispersion - 1. The Range The range is defined to be the largest observation minus the smallest observation. Since the range is only based on 2 observations it conveys little information and is sensitive to extreme values (errors). 61 / 94

62 2. The sample variance s 2 The sample variance is a measure of the average square distance from the mean. The formula for the sample variance is given by s 2 = 1 n 1 n (x i x) 2. s 2 0 and s 2 = 0 if and only if all the observations are equal to each other. i=1 62 / 94

63 3. The sample standard deviation s The sample standard deviation is given by the square root of the variance. It (and hence the sample variance) can be calculated on a scientific calculator by using the σ n 1 or s n 1 function as appropriate. In simple terms, the standard deviation is a measure of the average distance of an observation from the mean. It cannot be greater than the maximum deviation from the mean. 63 / 94

64 4. The interquartile range The i-th quartile, Q i, is taken to be the value such that i quarters of the observations are less than Q i. Thus, Q 2 is the sample median. If n+1 4 is an integer, then the lower quartile Q 1 is given by Q 1 = x ( n+1 4 ) Otherwise, if a is the integer part of n+1 4 [this is obtained by simply removing everything after the decimal point], then Q 1 = 0.5[x (a) + x (a+1) ] 64 / 94

65 The interquartile range If 3n+3 4 is an integer, then the upper quartile Q 3 is given by Q 3 = x ( 3n+3 4 ) Otherwise, if b is the integer part of 3n+3 4, then Q 3 = 0.5[x (b) + x (b+1) ] 65 / 94

66 The interquartile range When there is a large amount of data, the quartiles Q 1, Q 2, Q 3 can be calculated from the ogive as the 25-th, 50-th and 75-th percentiles, respectively. From the OGIVE, the lower and upper quartiles for the height data given previously are approximately 166cm and 182cm, respectively. The interquartile range (IQR) is the difference between the upper and lower quartiles IQR = Q 3 Q 1 i.e. for the height data approximately 16cm. 66 / 94

67 The interquartile range The quartiles can be used to display the distribution of a trait in the form of a box plot. The central line represents the median, the ends of the box represent the lower and upper quartiles. Points outside the whiskers represent outliers. Box plots can be used to investigate whether a distribution is skewed (see following diagrams). 67 / 94

68 A symmetric distribution The ends of the boxes and the whiskers are symmetrically distributed about the median. 68 / 94

69 A right skewed distribution There are several outliers much greater than the median, but none much smaller than the median. The upper endpoint of the whisker is much further from the median than the lower endpoint. 69 / 94

70 A left skewed distribution There are several outliers much smaller than the median, but none much greater than the median. The upper endpoint of the whisker is much closer to the median than the lower endpoint. 70 / 94

71 Choice of the measure of dispersion The units of all the measures used so far (except for the variance) are the same units as those used for the measurement of observations. The units of variance are the square of the units of measurement. For example, if we observe velocity in metres per second, the variance is measured in metres squared per second squared. For this reason the standard deviation is generally preferred to the variance as a measure of dispersion. If a distribution is skewed then the interquartile range is a more reliable measure of the dispersion of a random variable than the standard deviation. 71 / 94

72 Comparison of the dispersion of two variables Sometimes we wish to compare the dispersion of two variables. In cases where different units are used to measure the two variables or the means of two variables are very different, it may be useful to use a measure of dispersion which does not depend on the units in which it is measured. The coefficient of variation C.V. does not depend on the units of measurement. It is the standard deviation divided by the sample mean C.V. = s x. 72 / 94

73 Example The sample mean Calculate the measures of centrality and dispersion defined above for the following data. There are 6 items of data hence, 6, 9, 12, 9, 8, 10 x = 6 i=1 x i 6 = = 9 73 / 94

74 Example The sample median In order to calculate the median, we first order the data. If an observation occurs k times, then it must appear k times in the list of ordered data. The ordered list of data is 6, 8, 9, 9, 10, 12. Since there is an even number of data (n = 6), the median is the average of the two observations in the middle of this ordered list. Hence, Q 2 = 0.5[x (n/2) + x (1+ n 2 ) ] = 0.5[x (3) + x (4) ] = / 94

75 Example The range The range is the difference between the largest and the smallest observations Range = 12 6 = / 94

76 Example The variance and standard deviation The variance is given by s 2 = 1 n 1 n (x i x) 2 i=1 = (6 9)2 + (9 9) 2 + (12 9) 2 + (9 9) 2 + (8 9) 2 + (10 9) 2 5 =4 The standard deviation is given by s = s 2 = / 94

77 Example The interquartile range In order to calculate the interquartile range, we first calculate the lower and upper quartiles. n = 6, hence n+1 4 = The integer part of this number is 1. Hence, the lower quartile is Q 1 = 0.5[x (1) + x (2) ] = 0.5(6 + 8) = 7 Similarly, 3n+3 4 = The integer part of this number is 5. Hence, the upper quartile is Q 3 = 0.5[x (5) + x (6) ] = 0.5( ) = 11. Hence, IQR = 11 7 = / 94

78 Example The coefficient of variation C.V. = s x = 2 9. Suppose a variable is by definition positive, e.g. height, weight. A coefficient of variation above 1 is accepted to be very large (such variation may occur in the case of wages when wage inequality is high). With regard to the physical traits of people, values for the coefficient of variation of around 0.1 to 0.3 are common (in humans the coefficient of variation of height is around 0.1, the coefficient of variation for weight is somewhat bigger). 78 / 94

79 1.5 Measures of Location and Dispersion for Grouped Data - a) Discrete Random Variables A die was rolled 100 times and the following data were obtained Result No. of observations / 94

80 Grouped discrete data Suppose the possible results are {x 1, x 2,..., x k } and the result x i occurs f i times. The total number of observations is n = k f i. i=1 The sum of the observations is given by k x i f i. i=1 80 / 94

81 Grouped discrete data It follows that the sample mean is given by k i=1 x = f ix i n The variance of the observations is given by s 2 = 1 n 1 k f i (x i x) 2 i=1 81 / 94

82 Grouped discrete data The following table is useful in calculating the sample mean x i f i f i x i Hence, the sample mean is x = = / 94

83 Grouped discrete data Once the mean has been calculated, we can add two columns for (x i x) 2 and f i (x i x) 2 : x i f i f i x i (x i x) 2 f i (x i x) = = = = = = / 94

84 Grouped discrete data The sample variance is given by 1 n 1 k i=1 f i (x i x) 2 = = / 94

85 Calculation of the sample median for grouped discrete data In this case we know the exact values of the observations and hence we can order the data. In this way we can calculate the median. Since there are 100 observations, the median is Q 2 = 0.5[x (50) + x (51) ] 85 / 94

86 Calculation of the sample median for grouped discrete data The 15 smallest observations are equal to 1 i.e. x (1) = x (2) =... = x (15) = 1. The next 18 smallest observations are equal to 2 i.e. x (16) = x (17) =... = x (33) = 2. The next 20 smallest observations are all equal to 3 i.e. x (34) = x (35) =... = x (53) = / 94

87 Calculation of the sample median for grouped discrete data It follows that Hence, x (50) = x (51) = 3. Q 2 = 0.5[x (50) + x (51) ] = / 94

88 1.5 Measures of Location and Dispersion for Grouped Data - a) Continuous Random Variables In such cases we have data grouped into intervals. Let x i be the centre of the i-th interval and f i the number of observations in the i-th interval. The approach to calculating the sample mean and variance is the same as in the case of discrete data. In order to carry out the calculations, we assume that each observation is in the middle of the appropriate interval. 88 / 94

89 Example 1.4 Consider the grouped data from Example 1.2 Height (x) x i f i f i x i 150 x < x < x < x < x Thus, the sample mean is x = = / 94

90 Example 1.4 Now we can add the remaining 2 columns of the table. x i f i f i x i (x i x) 2 f i (x i x) = = = = The variance is = The standard deviation is = / 94

91 Estimating the median for grouped continuous data Now we consider a more accurate method of estimating the median than the graphical method presented earlier using the OGIVE. The first step is to find the interval in which the median lies. This is the interval in which the cumulative relative frequency (crf) equals 0.5. Since crf(170)=0.35<0.5 and crf(180)=0.7>0.5, the median lies in the interval (170,180). 91 / 94

92 Estimating the median for grouped continuous data The median can now be estimated using geometry. Consider the graph of the OGIVE on the interval (170, 180). We know the height of the OGIVE at 170 and 180 and know that at the median the height must be 0.5. We can construct the following pair of similar triangles. 0.7 c.r.f. h 2 = h 1 = Q 2 = y Height / 94

93 Estimating the median for grouped continuous data The height of the large triangle is h 2 = 0.35 (the proportion of observations in that interval). The length of the large triangle is the length of the interval d = 10 The cumulative relative frequency (c.r.f.) at the median is 0.5. The height of the small triangle is 0.5 minus the c.r.f. at the lower endpoint. Thus, h 1 = The median is equal to y + 170, where y is the length of the small triangle (170 is the lower endpoint of the interval). 93 / 94

94 Estimating the median for grouped continuous data Since the triangles are similar the ratio of the length to height is constant, i.e. h 2 d = h 1 y. Hence, y = h 1d = 4.3. h It follows that the median is approximately Q 2 = = / 94

Descriptive statistics Statistical inference statistical inference, statistical induction and inferential statistics

Descriptive statistics Statistical inference statistical inference, statistical induction and inferential statistics Descriptive statistics is the discipline of quantitatively describing the main features of a collection of data. Descriptive statistics are distinguished from inferential statistics (or inductive statistics),

More information

STATS8: Introduction to Biostatistics. Data Exploration. Babak Shahbaba Department of Statistics, UCI

STATS8: Introduction to Biostatistics. Data Exploration. Babak Shahbaba Department of Statistics, UCI STATS8: Introduction to Biostatistics Data Exploration Babak Shahbaba Department of Statistics, UCI Introduction After clearly defining the scientific problem, selecting a set of representative members

More information

Exploratory data analysis (Chapter 2) Fall 2011

Exploratory data analysis (Chapter 2) Fall 2011 Exploratory data analysis (Chapter 2) Fall 2011 Data Examples Example 1: Survey Data 1 Data collected from a Stat 371 class in Fall 2005 2 They answered questions about their: gender, major, year in school,

More information

4. Continuous Random Variables, the Pareto and Normal Distributions

4. Continuous Random Variables, the Pareto and Normal Distributions 4. Continuous Random Variables, the Pareto and Normal Distributions A continuous random variable X can take any value in a given range (e.g. height, weight, age). The distribution of a continuous random

More information

Introduction to Statistics for Psychology. Quantitative Methods for Human Sciences

Introduction to Statistics for Psychology. Quantitative Methods for Human Sciences Introduction to Statistics for Psychology and Quantitative Methods for Human Sciences Jonathan Marchini Course Information There is website devoted to the course at http://www.stats.ox.ac.uk/ marchini/phs.html

More information

HISTOGRAMS, CUMULATIVE FREQUENCY AND BOX PLOTS

HISTOGRAMS, CUMULATIVE FREQUENCY AND BOX PLOTS Mathematics Revision Guides Histograms, Cumulative Frequency and Box Plots Page 1 of 25 M.K. HOME TUITION Mathematics Revision Guides Level: GCSE Higher Tier HISTOGRAMS, CUMULATIVE FREQUENCY AND BOX PLOTS

More information

Variables. Exploratory Data Analysis

Variables. Exploratory Data Analysis Exploratory Data Analysis Exploratory Data Analysis involves both graphical displays of data and numerical summaries of data. A common situation is for a data set to be represented as a matrix. There is

More information

6.4 Normal Distribution

6.4 Normal Distribution Contents 6.4 Normal Distribution....................... 381 6.4.1 Characteristics of the Normal Distribution....... 381 6.4.2 The Standardized Normal Distribution......... 385 6.4.3 Meaning of Areas under

More information

Expression. Variable Equation Polynomial Monomial Add. Area. Volume Surface Space Length Width. Probability. Chance Random Likely Possibility Odds

Expression. Variable Equation Polynomial Monomial Add. Area. Volume Surface Space Length Width. Probability. Chance Random Likely Possibility Odds Isosceles Triangle Congruent Leg Side Expression Equation Polynomial Monomial Radical Square Root Check Times Itself Function Relation One Domain Range Area Volume Surface Space Length Width Quantitative

More information

Probability and Statistics Vocabulary List (Definitions for Middle School Teachers)

Probability and Statistics Vocabulary List (Definitions for Middle School Teachers) Probability and Statistics Vocabulary List (Definitions for Middle School Teachers) B Bar graph a diagram representing the frequency distribution for nominal or discrete data. It consists of a sequence

More information

Descriptive Statistics

Descriptive Statistics Y520 Robert S Michael Goal: Learn to calculate indicators and construct graphs that summarize and describe a large quantity of values. Using the textbook readings and other resources listed on the web

More information

MBA 611 STATISTICS AND QUANTITATIVE METHODS

MBA 611 STATISTICS AND QUANTITATIVE METHODS MBA 611 STATISTICS AND QUANTITATIVE METHODS Part I. Review of Basic Statistics (Chapters 1-11) A. Introduction (Chapter 1) Uncertainty: Decisions are often based on incomplete information from uncertain

More information

Exercise 1.12 (Pg. 22-23)

Exercise 1.12 (Pg. 22-23) Individuals: The objects that are described by a set of data. They may be people, animals, things, etc. (Also referred to as Cases or Records) Variables: The characteristics recorded about each individual.

More information

Northumberland Knowledge

Northumberland Knowledge Northumberland Knowledge Know Guide How to Analyse Data - November 2012 - This page has been left blank 2 About this guide The Know Guides are a suite of documents that provide useful information about

More information

Diagrams and Graphs of Statistical Data

Diagrams and Graphs of Statistical Data Diagrams and Graphs of Statistical Data One of the most effective and interesting alternative way in which a statistical data may be presented is through diagrams and graphs. There are several ways in

More information

Center: Finding the Median. Median. Spread: Home on the Range. Center: Finding the Median (cont.)

Center: Finding the Median. Median. Spread: Home on the Range. Center: Finding the Median (cont.) Center: Finding the Median When we think of a typical value, we usually look for the center of the distribution. For a unimodal, symmetric distribution, it s easy to find the center it s just the center

More information

The right edge of the box is the third quartile, Q 3, which is the median of the data values above the median. Maximum Median

The right edge of the box is the third quartile, Q 3, which is the median of the data values above the median. Maximum Median CONDENSED LESSON 2.1 Box Plots In this lesson you will create and interpret box plots for sets of data use the interquartile range (IQR) to identify potential outliers and graph them on a modified box

More information

BNG 202 Biomechanics Lab. Descriptive statistics and probability distributions I

BNG 202 Biomechanics Lab. Descriptive statistics and probability distributions I BNG 202 Biomechanics Lab Descriptive statistics and probability distributions I Overview The overall goal of this short course in statistics is to provide an introduction to descriptive and inferential

More information

Pie Charts. proportion of ice-cream flavors sold annually by a given brand. AMS-5: Statistics. Cherry. Cherry. Blueberry. Blueberry. Apple.

Pie Charts. proportion of ice-cream flavors sold annually by a given brand. AMS-5: Statistics. Cherry. Cherry. Blueberry. Blueberry. Apple. Graphical Representations of Data, Mean, Median and Standard Deviation In this class we will consider graphical representations of the distribution of a set of data. The goal is to identify the range of

More information

Descriptive Statistics. Purpose of descriptive statistics Frequency distributions Measures of central tendency Measures of dispersion

Descriptive Statistics. Purpose of descriptive statistics Frequency distributions Measures of central tendency Measures of dispersion Descriptive Statistics Purpose of descriptive statistics Frequency distributions Measures of central tendency Measures of dispersion Statistics as a Tool for LIS Research Importance of statistics in research

More information

Why Taking This Course? Course Introduction, Descriptive Statistics and Data Visualization. Learning Goals. GENOME 560, Spring 2012

Why Taking This Course? Course Introduction, Descriptive Statistics and Data Visualization. Learning Goals. GENOME 560, Spring 2012 Why Taking This Course? Course Introduction, Descriptive Statistics and Data Visualization GENOME 560, Spring 2012 Data are interesting because they help us understand the world Genomics: Massive Amounts

More information

3: Summary Statistics

3: Summary Statistics 3: Summary Statistics Notation Let s start by introducing some notation. Consider the following small data set: 4 5 30 50 8 7 4 5 The symbol n represents the sample size (n = 0). The capital letter X denotes

More information

Describing, Exploring, and Comparing Data

Describing, Exploring, and Comparing Data 24 Chapter 2. Describing, Exploring, and Comparing Data Chapter 2. Describing, Exploring, and Comparing Data There are many tools used in Statistics to visualize, summarize, and describe data. This chapter

More information

DESCRIPTIVE STATISTICS. The purpose of statistics is to condense raw data to make it easier to answer specific questions; test hypotheses.

DESCRIPTIVE STATISTICS. The purpose of statistics is to condense raw data to make it easier to answer specific questions; test hypotheses. DESCRIPTIVE STATISTICS The purpose of statistics is to condense raw data to make it easier to answer specific questions; test hypotheses. DESCRIPTIVE VS. INFERENTIAL STATISTICS Descriptive To organize,

More information

Lecture 2: Descriptive Statistics and Exploratory Data Analysis

Lecture 2: Descriptive Statistics and Exploratory Data Analysis Lecture 2: Descriptive Statistics and Exploratory Data Analysis Further Thoughts on Experimental Design 16 Individuals (8 each from two populations) with replicates Pop 1 Pop 2 Randomly sample 4 individuals

More information

Lecture 1: Review and Exploratory Data Analysis (EDA)

Lecture 1: Review and Exploratory Data Analysis (EDA) Lecture 1: Review and Exploratory Data Analysis (EDA) Sandy Eckel seckel@jhsph.edu Department of Biostatistics, The Johns Hopkins University, Baltimore USA 21 April 2008 1 / 40 Course Information I Course

More information

DESCRIPTIVE STATISTICS - CHAPTERS 1 & 2 1

DESCRIPTIVE STATISTICS - CHAPTERS 1 & 2 1 DESCRIPTIVE STATISTICS - CHAPTERS 1 & 2 1 OVERVIEW STATISTICS PANIK...THE THEORY AND METHODS OF COLLECTING, ORGANIZING, PRESENTING, ANALYZING, AND INTERPRETING DATA SETS SO AS TO DETERMINE THEIR ESSENTIAL

More information

Week 1. Exploratory Data Analysis

Week 1. Exploratory Data Analysis Week 1 Exploratory Data Analysis Practicalities This course ST903 has students from both the MSc in Financial Mathematics and the MSc in Statistics. Two lectures and one seminar/tutorial per week. Exam

More information

Algebra 1 2008. Academic Content Standards Grade Eight and Grade Nine Ohio. Grade Eight. Number, Number Sense and Operations Standard

Algebra 1 2008. Academic Content Standards Grade Eight and Grade Nine Ohio. Grade Eight. Number, Number Sense and Operations Standard Academic Content Standards Grade Eight and Grade Nine Ohio Algebra 1 2008 Grade Eight STANDARDS Number, Number Sense and Operations Standard Number and Number Systems 1. Use scientific notation to express

More information

Summary of Formulas and Concepts. Descriptive Statistics (Ch. 1-4)

Summary of Formulas and Concepts. Descriptive Statistics (Ch. 1-4) Summary of Formulas and Concepts Descriptive Statistics (Ch. 1-4) Definitions Population: The complete set of numerical information on a particular quantity in which an investigator is interested. We assume

More information

Chapter 1: Looking at Data Section 1.1: Displaying Distributions with Graphs

Chapter 1: Looking at Data Section 1.1: Displaying Distributions with Graphs Types of Variables Chapter 1: Looking at Data Section 1.1: Displaying Distributions with Graphs Quantitative (numerical)variables: take numerical values for which arithmetic operations make sense (addition/averaging)

More information

Biostatistics: DESCRIPTIVE STATISTICS: 2, VARIABILITY

Biostatistics: DESCRIPTIVE STATISTICS: 2, VARIABILITY Biostatistics: DESCRIPTIVE STATISTICS: 2, VARIABILITY 1. Introduction Besides arriving at an appropriate expression of an average or consensus value for observations of a population, it is important to

More information

Exploratory Data Analysis

Exploratory Data Analysis Exploratory Data Analysis Johannes Schauer johannes.schauer@tugraz.at Institute of Statistics Graz University of Technology Steyrergasse 17/IV, 8010 Graz www.statistics.tugraz.at February 12, 2008 Introduction

More information

Glencoe. correlated to SOUTH CAROLINA MATH CURRICULUM STANDARDS GRADE 6 3-3, 5-8 8-4, 8-7 1-6, 4-9

Glencoe. correlated to SOUTH CAROLINA MATH CURRICULUM STANDARDS GRADE 6 3-3, 5-8 8-4, 8-7 1-6, 4-9 Glencoe correlated to SOUTH CAROLINA MATH CURRICULUM STANDARDS GRADE 6 STANDARDS 6-8 Number and Operations (NO) Standard I. Understand numbers, ways of representing numbers, relationships among numbers,

More information

2 Describing, Exploring, and

2 Describing, Exploring, and 2 Describing, Exploring, and Comparing Data This chapter introduces the graphical plotting and summary statistics capabilities of the TI- 83 Plus. First row keys like \ R (67$73/276 are used to obtain

More information

AP STATISTICS REVIEW (YMS Chapters 1-8)

AP STATISTICS REVIEW (YMS Chapters 1-8) AP STATISTICS REVIEW (YMS Chapters 1-8) Exploring Data (Chapter 1) Categorical Data nominal scale, names e.g. male/female or eye color or breeds of dogs Quantitative Data rational scale (can +,,, with

More information

The correlation coefficient

The correlation coefficient The correlation coefficient Clinical Biostatistics The correlation coefficient Martin Bland Correlation coefficients are used to measure the of the relationship or association between two quantitative

More information

In mathematics, there are four attainment targets: using and applying mathematics; number and algebra; shape, space and measures, and handling data.

In mathematics, there are four attainment targets: using and applying mathematics; number and algebra; shape, space and measures, and handling data. MATHEMATICS: THE LEVEL DESCRIPTIONS In mathematics, there are four attainment targets: using and applying mathematics; number and algebra; shape, space and measures, and handling data. Attainment target

More information

Using SPSS, Chapter 2: Descriptive Statistics

Using SPSS, Chapter 2: Descriptive Statistics 1 Using SPSS, Chapter 2: Descriptive Statistics Chapters 2.1 & 2.2 Descriptive Statistics 2 Mean, Standard Deviation, Variance, Range, Minimum, Maximum 2 Mean, Median, Mode, Standard Deviation, Variance,

More information

Session 7 Bivariate Data and Analysis

Session 7 Bivariate Data and Analysis Session 7 Bivariate Data and Analysis Key Terms for This Session Previously Introduced mean standard deviation New in This Session association bivariate analysis contingency table co-variation least squares

More information

Pennsylvania System of School Assessment

Pennsylvania System of School Assessment Pennsylvania System of School Assessment The Assessment Anchors, as defined by the Eligible Content, are organized into cohesive blueprints, each structured with a common labeling system that can be read

More information

The Big Picture. Describing Data: Categorical and Quantitative Variables Population. Descriptive Statistics. Community Coalitions (n = 175)

The Big Picture. Describing Data: Categorical and Quantitative Variables Population. Descriptive Statistics. Community Coalitions (n = 175) Describing Data: Categorical and Quantitative Variables Population The Big Picture Sampling Statistical Inference Sample Exploratory Data Analysis Descriptive Statistics In order to make sense of data,

More information

THE BINOMIAL DISTRIBUTION & PROBABILITY

THE BINOMIAL DISTRIBUTION & PROBABILITY REVISION SHEET STATISTICS 1 (MEI) THE BINOMIAL DISTRIBUTION & PROBABILITY The main ideas in this chapter are Probabilities based on selecting or arranging objects Probabilities based on the binomial distribution

More information

MATHS LEVEL DESCRIPTORS

MATHS LEVEL DESCRIPTORS MATHS LEVEL DESCRIPTORS Number Level 3 Understand the place value of numbers up to thousands. Order numbers up to 9999. Round numbers to the nearest 10 or 100. Understand the number line below zero, and

More information

Common Core Unit Summary Grades 6 to 8

Common Core Unit Summary Grades 6 to 8 Common Core Unit Summary Grades 6 to 8 Grade 8: Unit 1: Congruence and Similarity- 8G1-8G5 rotations reflections and translations,( RRT=congruence) understand congruence of 2 d figures after RRT Dilations

More information

Data Exploration Data Visualization

Data Exploration Data Visualization Data Exploration Data Visualization What is data exploration? A preliminary exploration of the data to better understand its characteristics. Key motivations of data exploration include Helping to select

More information

CALCULATIONS & STATISTICS

CALCULATIONS & STATISTICS CALCULATIONS & STATISTICS CALCULATION OF SCORES Conversion of 1-5 scale to 0-100 scores When you look at your report, you will notice that the scores are reported on a 0-100 scale, even though respondents

More information

CA200 Quantitative Analysis for Business Decisions. File name: CA200_Section_04A_StatisticsIntroduction

CA200 Quantitative Analysis for Business Decisions. File name: CA200_Section_04A_StatisticsIntroduction CA200 Quantitative Analysis for Business Decisions File name: CA200_Section_04A_StatisticsIntroduction Table of Contents 4. Introduction to Statistics... 1 4.1 Overview... 3 4.2 Discrete or continuous

More information

Means, standard deviations and. and standard errors

Means, standard deviations and. and standard errors CHAPTER 4 Means, standard deviations and standard errors 4.1 Introduction Change of units 4.2 Mean, median and mode Coefficient of variation 4.3 Measures of variation 4.4 Calculating the mean and standard

More information

with functions, expressions and equations which follow in units 3 and 4.

with functions, expressions and equations which follow in units 3 and 4. Grade 8 Overview View unit yearlong overview here The unit design was created in line with the areas of focus for grade 8 Mathematics as identified by the Common Core State Standards and the PARCC Model

More information

Pre-Algebra 2008. Academic Content Standards Grade Eight Ohio. Number, Number Sense and Operations Standard. Number and Number Systems

Pre-Algebra 2008. Academic Content Standards Grade Eight Ohio. Number, Number Sense and Operations Standard. Number and Number Systems Academic Content Standards Grade Eight Ohio Pre-Algebra 2008 STANDARDS Number, Number Sense and Operations Standard Number and Number Systems 1. Use scientific notation to express large numbers and small

More information

Probability Distributions

Probability Distributions CHAPTER 5 Probability Distributions CHAPTER OUTLINE 5.1 Probability Distribution of a Discrete Random Variable 5.2 Mean and Standard Deviation of a Probability Distribution 5.3 The Binomial Distribution

More information

Statistics I for QBIC. Contents and Objectives. Chapters 1 7. Revised: August 2013

Statistics I for QBIC. Contents and Objectives. Chapters 1 7. Revised: August 2013 Statistics I for QBIC Text Book: Biostatistics, 10 th edition, by Daniel & Cross Contents and Objectives Chapters 1 7 Revised: August 2013 Chapter 1: Nature of Statistics (sections 1.1-1.6) Objectives

More information

Foundation of Quantitative Data Analysis

Foundation of Quantitative Data Analysis Foundation of Quantitative Data Analysis Part 1: Data manipulation and descriptive statistics with SPSS/Excel HSRS #10 - October 17, 2013 Reference : A. Aczel, Complete Business Statistics. Chapters 1

More information

Module 4: Data Exploration

Module 4: Data Exploration Module 4: Data Exploration Now that you have your data downloaded from the Streams Project database, the detective work can begin! Before computing any advanced statistics, we will first use descriptive

More information

1.3 Measuring Center & Spread, The Five Number Summary & Boxplots. Describing Quantitative Data with Numbers

1.3 Measuring Center & Spread, The Five Number Summary & Boxplots. Describing Quantitative Data with Numbers 1.3 Measuring Center & Spread, The Five Number Summary & Boxplots Describing Quantitative Data with Numbers 1.3 I can n Calculate and interpret measures of center (mean, median) in context. n Calculate

More information

How To Write A Data Analysis

How To Write A Data Analysis Mathematics Probability and Statistics Curriculum Guide Revised 2010 This page is intentionally left blank. Introduction The Mathematics Curriculum Guide serves as a guide for teachers when planning instruction

More information

Def: The standard normal distribution is a normal probability distribution that has a mean of 0 and a standard deviation of 1.

Def: The standard normal distribution is a normal probability distribution that has a mean of 0 and a standard deviation of 1. Lecture 6: Chapter 6: Normal Probability Distributions A normal distribution is a continuous probability distribution for a random variable x. The graph of a normal distribution is called the normal curve.

More information

STA-201-TE. 5. Measures of relationship: correlation (5%) Correlation coefficient; Pearson r; correlation and causation; proportion of common variance

STA-201-TE. 5. Measures of relationship: correlation (5%) Correlation coefficient; Pearson r; correlation and causation; proportion of common variance Principles of Statistics STA-201-TE This TECEP is an introduction to descriptive and inferential statistics. Topics include: measures of central tendency, variability, correlation, regression, hypothesis

More information

Descriptive Statistics

Descriptive Statistics Descriptive Statistics Suppose following data have been collected (heights of 99 five-year-old boys) 117.9 11.2 112.9 115.9 18. 14.6 17.1 117.9 111.8 16.3 111. 1.4 112.1 19.2 11. 15.4 99.4 11.1 13.3 16.9

More information

Good luck! BUSINESS STATISTICS FINAL EXAM INSTRUCTIONS. Name:

Good luck! BUSINESS STATISTICS FINAL EXAM INSTRUCTIONS. Name: Glo bal Leadership M BA BUSINESS STATISTICS FINAL EXAM Name: INSTRUCTIONS 1. Do not open this exam until instructed to do so. 2. Be sure to fill in your name before starting the exam. 3. You have two hours

More information

CAMI Education linked to CAPS: Mathematics

CAMI Education linked to CAPS: Mathematics - 1 - TOPIC 1.1 Whole numbers _CAPS curriculum TERM 1 CONTENT Mental calculations Revise: Multiplication of whole numbers to at least 12 12 Ordering and comparing whole numbers Revise prime numbers to

More information

STAT355 - Probability & Statistics

STAT355 - Probability & Statistics STAT355 - Probability & Statistics Instructor: Kofi Placid Adragni Fall 2011 Chap 1 - Overview and Descriptive Statistics 1.1 Populations, Samples, and Processes 1.2 Pictorial and Tabular Methods in Descriptive

More information

1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number

1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number 1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number A. 3(x - x) B. x 3 x C. 3x - x D. x - 3x 2) Write the following as an algebraic expression

More information

Chapter 4. Probability and Probability Distributions

Chapter 4. Probability and Probability Distributions Chapter 4. robability and robability Distributions Importance of Knowing robability To know whether a sample is not identical to the population from which it was selected, it is necessary to assess the

More information

Basics of Statistics

Basics of Statistics Basics of Statistics Jarkko Isotalo 30 20 10 Std. Dev = 486.32 Mean = 3553.8 0 N = 120.00 2400.0 2800.0 3200.0 3600.0 4000.0 4400.0 4800.0 2600.0 3000.0 3400.0 3800.0 4200.0 4600.0 5000.0 Birthweights

More information

Introduction to Quantitative Methods

Introduction to Quantitative Methods Introduction to Quantitative Methods October 15, 2009 Contents 1 Definition of Key Terms 2 2 Descriptive Statistics 3 2.1 Frequency Tables......................... 4 2.2 Measures of Central Tendencies.................

More information

MATH BOOK OF PROBLEMS SERIES. New from Pearson Custom Publishing!

MATH BOOK OF PROBLEMS SERIES. New from Pearson Custom Publishing! MATH BOOK OF PROBLEMS SERIES New from Pearson Custom Publishing! The Math Book of Problems Series is a database of math problems for the following courses: Pre-algebra Algebra Pre-calculus Calculus Statistics

More information

EXAM #1 (Example) Instructor: Ela Jackiewicz. Relax and good luck!

EXAM #1 (Example) Instructor: Ela Jackiewicz. Relax and good luck! STP 231 EXAM #1 (Example) Instructor: Ela Jackiewicz Honor Statement: I have neither given nor received information regarding this exam, and I will not do so until all exams have been graded and returned.

More information

Math 1. Month Essential Questions Concepts/Skills/Standards Content Assessment Areas of Interaction

Math 1. Month Essential Questions Concepts/Skills/Standards Content Assessment Areas of Interaction Binghamton High School Rev.9/21/05 Math 1 September What is the unknown? Model relationships by using Fundamental skills of 2005 variables as a shorthand way Algebra Why do we use variables? What is a

More information

Intro to Statistics 8 Curriculum

Intro to Statistics 8 Curriculum Intro to Statistics 8 Curriculum Unit 1 Bar, Line and Circle Graphs Estimated time frame for unit Big Ideas 8 Days... Essential Question Concepts Competencies Lesson Plans and Suggested Resources Bar graphs

More information

Business Statistics. Successful completion of Introductory and/or Intermediate Algebra courses is recommended before taking Business Statistics.

Business Statistics. Successful completion of Introductory and/or Intermediate Algebra courses is recommended before taking Business Statistics. Business Course Text Bowerman, Bruce L., Richard T. O'Connell, J. B. Orris, and Dawn C. Porter. Essentials of Business, 2nd edition, McGraw-Hill/Irwin, 2008, ISBN: 978-0-07-331988-9. Required Computing

More information

Descriptive Statistics and Measurement Scales

Descriptive Statistics and Measurement Scales Descriptive Statistics 1 Descriptive Statistics and Measurement Scales Descriptive statistics are used to describe the basic features of the data in a study. They provide simple summaries about the sample

More information

business statistics using Excel OXFORD UNIVERSITY PRESS Glyn Davis & Branko Pecar

business statistics using Excel OXFORD UNIVERSITY PRESS Glyn Davis & Branko Pecar business statistics using Excel Glyn Davis & Branko Pecar OXFORD UNIVERSITY PRESS Detailed contents Introduction to Microsoft Excel 2003 Overview Learning Objectives 1.1 Introduction to Microsoft Excel

More information

Big Ideas in Mathematics

Big Ideas in Mathematics Big Ideas in Mathematics which are important to all mathematics learning. (Adapted from the NCTM Curriculum Focal Points, 2006) The Mathematics Big Ideas are organized using the PA Mathematics Standards

More information

Characteristics of Binomial Distributions

Characteristics of Binomial Distributions Lesson2 Characteristics of Binomial Distributions In the last lesson, you constructed several binomial distributions, observed their shapes, and estimated their means and standard deviations. In Investigation

More information

Describing and presenting data

Describing and presenting data Describing and presenting data All epidemiological studies involve the collection of data on the exposures and outcomes of interest. In a well planned study, the raw observations that constitute the data

More information

Mathematical Conventions Large Print (18 point) Edition

Mathematical Conventions Large Print (18 point) Edition GRADUATE RECORD EXAMINATIONS Mathematical Conventions Large Print (18 point) Edition Copyright 2010 by Educational Testing Service. All rights reserved. ETS, the ETS logo, GRADUATE RECORD EXAMINATIONS,

More information

Introduction to Environmental Statistics. The Big Picture. Populations and Samples. Sample Data. Examples of sample data

Introduction to Environmental Statistics. The Big Picture. Populations and Samples. Sample Data. Examples of sample data A Few Sources for Data Examples Used Introduction to Environmental Statistics Professor Jessica Utts University of California, Irvine jutts@uci.edu 1. Statistical Methods in Water Resources by D.R. Helsel

More information

Measurement with Ratios

Measurement with Ratios Grade 6 Mathematics, Quarter 2, Unit 2.1 Measurement with Ratios Overview Number of instructional days: 15 (1 day = 45 minutes) Content to be learned Use ratio reasoning to solve real-world and mathematical

More information

DATA INTERPRETATION AND STATISTICS

DATA INTERPRETATION AND STATISTICS PholC60 September 001 DATA INTERPRETATION AND STATISTICS Books A easy and systematic introductory text is Essentials of Medical Statistics by Betty Kirkwood, published by Blackwell at about 14. DESCRIPTIVE

More information

Statistics Revision Sheet Question 6 of Paper 2

Statistics Revision Sheet Question 6 of Paper 2 Statistics Revision Sheet Question 6 of Paper The Statistics question is concerned mainly with the following terms. The Mean and the Median and are two ways of measuring the average. sumof values no. of

More information

Biggar High School Mathematics Department. National 5 Learning Intentions & Success Criteria: Assessing My Progress

Biggar High School Mathematics Department. National 5 Learning Intentions & Success Criteria: Assessing My Progress Biggar High School Mathematics Department National 5 Learning Intentions & Success Criteria: Assessing My Progress Expressions & Formulae Topic Learning Intention Success Criteria I understand this Approximation

More information

AP Statistics Solutions to Packet 2

AP Statistics Solutions to Packet 2 AP Statistics Solutions to Packet 2 The Normal Distributions Density Curves and the Normal Distribution Standard Normal Calculations HW #9 1, 2, 4, 6-8 2.1 DENSITY CURVES (a) Sketch a density curve that

More information

Grade 6 Mathematics Assessment. Eligible Texas Essential Knowledge and Skills

Grade 6 Mathematics Assessment. Eligible Texas Essential Knowledge and Skills Grade 6 Mathematics Assessment Eligible Texas Essential Knowledge and Skills STAAR Grade 6 Mathematics Assessment Mathematical Process Standards These student expectations will not be listed under a separate

More information

Exploratory Data Analysis. Psychology 3256

Exploratory Data Analysis. Psychology 3256 Exploratory Data Analysis Psychology 3256 1 Introduction If you are going to find out anything about a data set you must first understand the data Basically getting a feel for you numbers Easier to find

More information

BASIC STATISTICAL METHODS FOR GENOMIC DATA ANALYSIS

BASIC STATISTICAL METHODS FOR GENOMIC DATA ANALYSIS BASIC STATISTICAL METHODS FOR GENOMIC DATA ANALYSIS SEEMA JAGGI Indian Agricultural Statistics Research Institute Library Avenue, New Delhi-110 012 seema@iasri.res.in Genomics A genome is an organism s

More information

DesCartes (Combined) Subject: Mathematics Goal: Statistics and Probability

DesCartes (Combined) Subject: Mathematics Goal: Statistics and Probability DesCartes (Combined) Subject: Mathematics Goal: Statistics and Probability RIT Score Range: Below 171 Below 171 Data Analysis and Statistics Solves simple problems based on data from tables* Compares

More information

6. Decide which method of data collection you would use to collect data for the study (observational study, experiment, simulation, or survey):

6. Decide which method of data collection you would use to collect data for the study (observational study, experiment, simulation, or survey): MATH 1040 REVIEW (EXAM I) Chapter 1 1. For the studies described, identify the population, sample, population parameters, and sample statistics: a) The Gallup Organization conducted a poll of 1003 Americans

More information

Statistics 100 Sample Final Questions (Note: These are mostly multiple choice, for extra practice. Your Final Exam will NOT have any multiple choice!

Statistics 100 Sample Final Questions (Note: These are mostly multiple choice, for extra practice. Your Final Exam will NOT have any multiple choice! Statistics 100 Sample Final Questions (Note: These are mostly multiple choice, for extra practice. Your Final Exam will NOT have any multiple choice!) Part A - Multiple Choice Indicate the best choice

More information

CHAPTER THREE. Key Concepts

CHAPTER THREE. Key Concepts CHAPTER THREE Key Concepts interval, ordinal, and nominal scale quantitative, qualitative continuous data, categorical or discrete data table, frequency distribution histogram, bar graph, frequency polygon,

More information

Simple linear regression

Simple linear regression Simple linear regression Introduction Simple linear regression is a statistical method for obtaining a formula to predict values of one variable from another where there is a causal relationship between

More information

Prentice Hall Mathematics Courses 1-3 Common Core Edition 2013

Prentice Hall Mathematics Courses 1-3 Common Core Edition 2013 A Correlation of Prentice Hall Mathematics Courses 1-3 Common Core Edition 2013 to the Topics & Lessons of Pearson A Correlation of Courses 1, 2 and 3, Common Core Introduction This document demonstrates

More information

A Picture Really Is Worth a Thousand Words

A Picture Really Is Worth a Thousand Words 4 A Picture Really Is Worth a Thousand Words Difficulty Scale (pretty easy, but not a cinch) What you ll learn about in this chapter Why a picture is really worth a thousand words How to create a histogram

More information

Course Text. Required Computing Software. Course Description. Course Objectives. StraighterLine. Business Statistics

Course Text. Required Computing Software. Course Description. Course Objectives. StraighterLine. Business Statistics Course Text Business Statistics Lind, Douglas A., Marchal, William A. and Samuel A. Wathen. Basic Statistics for Business and Economics, 7th edition, McGraw-Hill/Irwin, 2010, ISBN: 9780077384470 [This

More information

Chapter 3 RANDOM VARIATE GENERATION

Chapter 3 RANDOM VARIATE GENERATION Chapter 3 RANDOM VARIATE GENERATION In order to do a Monte Carlo simulation either by hand or by computer, techniques must be developed for generating values of random variables having known distributions.

More information

Summarizing and Displaying Categorical Data

Summarizing and Displaying Categorical Data Summarizing and Displaying Categorical Data Categorical data can be summarized in a frequency distribution which counts the number of cases, or frequency, that fall into each category, or a relative frequency

More information

Numeracy and mathematics Experiences and outcomes

Numeracy and mathematics Experiences and outcomes Numeracy and mathematics Experiences and outcomes My learning in mathematics enables me to: develop a secure understanding of the concepts, principles and processes of mathematics and apply these in different

More information

Mathematics (Project Maths Phase 1)

Mathematics (Project Maths Phase 1) 2012. M128 S Coimisiún na Scrúduithe Stáit State Examinations Commission Leaving Certificate Examination, 2012 Sample Paper Mathematics (Project Maths Phase 1) Paper 2 Ordinary Level Time: 2 hours, 30

More information

99.37, 99.38, 99.38, 99.39, 99.39, 99.39, 99.39, 99.40, 99.41, 99.42 cm

99.37, 99.38, 99.38, 99.39, 99.39, 99.39, 99.39, 99.40, 99.41, 99.42 cm Error Analysis and the Gaussian Distribution In experimental science theory lives or dies based on the results of experimental evidence and thus the analysis of this evidence is a critical part of the

More information