Statistics: Descriptive Statistics & Probability

Size: px
Start display at page:

Download "Statistics: Descriptive Statistics & Probability"

Transcription

1 Statistics: Descriptive Statistics & Mediterranean Agronomic Institute of Chania & University of Crete MS.c Program Business Economics and Management 7 October 2013, Vol. 1.1 Statistics: Descriptive Statistics &

2 Description 1 2 Basic Notions of 3 Statistics: Descriptive Statistics &

3 Description 1 2 Basic Notions of 3 Statistics: Descriptive Statistics &

4 Describing the Statistical Problem Definition Statistics is a mathematical branch of science that deals with uncertain (or random) phenomena with the help of sampling. Phenomena 1 Random (or Uncertain): Outcome of tossing a coin, the outcome on betting, maximum car speed of a car, daily rain percipitation, number of daily births, return of a stock index, the level of sales, number of student attendence, etc 2 Non-Random (Certain): The sunrise, the sunset, gravity on earth, etc 3 Chaotic: Extreme financial events, earthquakes, tsunami, etc Statistics: Descriptive Statistics &

5 Describing the Statistical Problem Definition Statistics is a mathematical branch of science that deals with uncertain (or random) phenomena with the help of sampling. Phenomena 1 Random (or Uncertain): Outcome of tossing a coin, the outcome on betting, maximum car speed of a car, daily rain percipitation, number of daily births, return of a stock index, the level of sales, number of student attendence, etc 2 Non-Random (Certain): The sunrise, the sunset, gravity on earth, etc 3 Chaotic: Extreme financial events, earthquakes, tsunami, etc Statistics: Descriptive Statistics &

6 Describing the Statistical Problem Definition Statistics is a mathematical branch of science that deals with uncertain (or random) phenomena with the help of sampling. Phenomena 1 Random (or Uncertain): Outcome of tossing a coin, the outcome on betting, maximum car speed of a car, daily rain percipitation, number of daily births, return of a stock index, the level of sales, number of student attendence, etc 2 Non-Random (Certain): The sunrise, the sunset, gravity on earth, etc 3 Chaotic: Extreme financial events, earthquakes, tsunami, etc Statistics: Descriptive Statistics &

7 Describing the Statistical Problem Definition Statistics is a mathematical branch of science that deals with uncertain (or random) phenomena with the help of sampling. Phenomena 1 Random (or Uncertain): Outcome of tossing a coin, the outcome on betting, maximum car speed of a car, daily rain percipitation, number of daily births, return of a stock index, the level of sales, number of student attendence, etc 2 Non-Random (Certain): The sunrise, the sunset, gravity on earth, etc 3 Chaotic: Extreme financial events, earthquakes, tsunami, etc Statistics: Descriptive Statistics &

8 beamer-tu-logo Quotations on randomness 1 Aristotle: The probable is what usually happens 2 Democritus: Everything existing in the universe is the fruit of chance 3 Plato (to Phaedon): I know too well that these arguments from probabilities are imposters, and unless great caution is observed in the use of them, they are apt to be deceptive 4 Heraclitus: There is nothing permanent except change 5 Descartes (in Discourse on Method): It is a truth very certain that when it is not in our power to determine what is true we ought to follow what is most probable Statistics: Descriptive Statistics &

9 beamer-tu-logo Random Variable (r.v.) Random variable is the result of a random experiment which is characterised by uncertainty 1 Number of heads when tossing a coin ten (10) times 2 The daily consumption of a person 3 The maximum car speed of a car 4 A stock return today 5 Rain precipitation on a day 6 The number of student attendence Sampling The statistical sampling through the collection of a representative number of random variables can talk about the statistical characteristics of this random variable Statistics: Descriptive Statistics &

10 beamer-tu-logo Random Variable (r.v.) Random variable is the result of a random experiment which is characterised by uncertainty 1 Number of heads when tossing a coin ten (10) times 2 The daily consumption of a person 3 The maximum car speed of a car 4 A stock return today 5 Rain precipitation on a day 6 The number of student attendence Sampling The statistical sampling through the collection of a representative number of random variables can talk about the statistical characteristics of this random variable Statistics: Descriptive Statistics &

11 Defining Sample and Population Sample Sample, is a smaller number (a subset) of the people or objects that exist within a population. Population Population is refereed to as the universe, this is the entire set of people or objects of interest. It could be: 1 Infinite tossing a coin experiments 2 All adult citizens in a country 3 All cars driven in a country 4 All stock returns 5 All places where rain happens etc. 6 All students in a country Statistics: Descriptive Statistics &

12 Defining Sample and Population Sample Sample, is a smaller number (a subset) of the people or objects that exist within a population. Population Population is refereed to as the universe, this is the entire set of people or objects of interest. It could be: 1 Infinite tossing a coin experiments 2 All adult citizens in a country 3 All cars driven in a country 4 All stock returns 5 All places where rain happens etc. 6 All students in a country Statistics: Descriptive Statistics &

13 Defining Sample and Population (cont.) Important!! A sample is said to be representative if its members tend to have the same characteristics (e.g., region, shopping behaviour, age, income, educational level) as the population from which they were selected. 1 For example, if 45% of the population consists of female drivers, we would like our sample to also include 45% females. 2 When a sample is so large as to include all members of the population, it is referred to as a complete census. Statistics: Descriptive Statistics &

14 Descriptive Statistics A simple definition In descriptive statistics, we simply summarize and describe the data we have collected. For example: 1 Observing the car speed at a specific location in an avenue you diagnose that the mean speed is 15.4 mph. Also the probability that a car will get speed at least 20 mph is 24%. This is descriptive statistics. You are merely describing the data that you have recorded!! 2 According to the Bureau of the Census, there has been an increase of 200% on the average UK gas consumption after the year Rain precipitation data from different location are characterised by large variation Statistics: Descriptive Statistics &

15 Descriptive Statistics A simple definition In descriptive statistics, we simply summarize and describe the data we have collected. For example: 1 Observing the car speed at a specific location in an avenue you diagnose that the mean speed is 15.4 mph. Also the probability that a car will get speed at least 20 mph is 24%. This is descriptive statistics. You are merely describing the data that you have recorded!! 2 According to the Bureau of the Census, there has been an increase of 200% on the average UK gas consumption after the year Rain precipitation data from different location are characterised by large variation Statistics: Descriptive Statistics &

16 Descriptive Statistics A simple definition In descriptive statistics, we simply summarize and describe the data we have collected. For example: 1 Observing the car speed at a specific location in an avenue you diagnose that the mean speed is 15.4 mph. Also the probability that a car will get speed at least 20 mph is 24%. This is descriptive statistics. You are merely describing the data that you have recorded!! 2 According to the Bureau of the Census, there has been an increase of 200% on the average UK gas consumption after the year Rain precipitation data from different location are characterised by large variation Statistics: Descriptive Statistics &

17 Descriptive Statistics A simple definition In descriptive statistics, we simply summarize and describe the data we have collected. For example: 1 Observing the car speed at a specific location in an avenue you diagnose that the mean speed is 15.4 mph. Also the probability that a car will get speed at least 20 mph is 24%. This is descriptive statistics. You are merely describing the data that you have recorded!! 2 According to the Bureau of the Census, there has been an increase of 200% on the average UK gas consumption after the year Rain precipitation data from different location are characterised by large variation Statistics: Descriptive Statistics &

18 Statistical Inference A simple definition In inferential statistics, sometimes referred to as inductive statistics, we go beyond mere description of the data and arrive at inferences regarding the phenomenon or phenomena for which sample data were obtained. For example: 1 Observing the car speed taken by so many cars, the circulation regulator may impose a more realistic speed limit. 2 Observing the average gas consumption, the ministry of energy may decide to turn its energy production to different directions in order to match future needs. 3 Due to observed large variation in rain percipitation data from different location, the Meteorology office decides to increase the number of locations where data is collected Statistics: Descriptive Statistics &

19 Statistical Inference A simple definition In inferential statistics, sometimes referred to as inductive statistics, we go beyond mere description of the data and arrive at inferences regarding the phenomenon or phenomena for which sample data were obtained. For example: 1 Observing the car speed taken by so many cars, the circulation regulator may impose a more realistic speed limit. 2 Observing the average gas consumption, the ministry of energy may decide to turn its energy production to different directions in order to match future needs. 3 Due to observed large variation in rain percipitation data from different location, the Meteorology office decides to increase the number of locations where data is collected Statistics: Descriptive Statistics &

20 Statistical Inference A simple definition In inferential statistics, sometimes referred to as inductive statistics, we go beyond mere description of the data and arrive at inferences regarding the phenomenon or phenomena for which sample data were obtained. For example: 1 Observing the car speed taken by so many cars, the circulation regulator may impose a more realistic speed limit. 2 Observing the average gas consumption, the ministry of energy may decide to turn its energy production to different directions in order to match future needs. 3 Due to observed large variation in rain percipitation data from different location, the Meteorology office decides to increase the number of locations where data is collected Statistics: Descriptive Statistics &

21 Statistical Inference A simple definition In inferential statistics, sometimes referred to as inductive statistics, we go beyond mere description of the data and arrive at inferences regarding the phenomenon or phenomena for which sample data were obtained. For example: 1 Observing the car speed taken by so many cars, the circulation regulator may impose a more realistic speed limit. 2 Observing the average gas consumption, the ministry of energy may decide to turn its energy production to different directions in order to match future needs. 3 Due to observed large variation in rain percipitation data from different location, the Meteorology office decides to increase the number of locations where data is collected Statistics: Descriptive Statistics &

22 Types Qualitative Qualitative data are words that cannot be defined by numbers. Some of the variables associated with people or objects are qualitative in nature, indicating that the person or object belongs in a category. For example: 1 You are either male or female 2 You are less than 25 years old or not 3 Your have a small or a large household 4 You are located in Crete or not 5 You are located in a mountainous area or not Statistics: Descriptive Statistics &

23 Types (cont.) Quantitative Quantitative data is collective data that can be measured by numbers. There are two types of quantitative variables: discrete and continuous. 1 Discrete quantitative variables can take on only certain values along an interval, with the possible values having gaps between them. Examples of discrete quantitative variables would be the number of employees on the payroll of a manufacturing firm, the number students attending a class, or the number of births are given per each calendar day. Discrete variables in business statistics usually consist of observations that we can count having integer values. 2 Continuous quantitative variables can take on a value at any point along an interval. For example, the stock index return can take at a given moment the value of or This will depend on the accuracy with which the volume can be measured. The possible values that could be taken on would have no gaps between them. Other examples of continuous quantitative variables are the car speed, the rain precipitation etc beamer-tu-logo Statistics: Descriptive Statistics &

24 Types (cont.) Quantitative Quantitative data is collective data that can be measured by numbers. There are two types of quantitative variables: discrete and continuous. 1 Discrete quantitative variables can take on only certain values along an interval, with the possible values having gaps between them. Examples of discrete quantitative variables would be the number of employees on the payroll of a manufacturing firm, the number students attending a class, or the number of births are given per each calendar day. Discrete variables in business statistics usually consist of observations that we can count having integer values. 2 Continuous quantitative variables can take on a value at any point along an interval. For example, the stock index return can take at a given moment the value of or This will depend on the accuracy with which the volume can be measured. The possible values that could be taken on would have no gaps between them. Other examples of continuous quantitative variables are the car speed, the rain precipitation etc beamer-tu-logo Statistics: Descriptive Statistics &

25 Types (cont.) Dummy Researchers sometimes convert Qualitative to Quantitative data using the so-called Dummy data. Examples: 1 For car speed data the Sex qualitative specification can take the answer YES for male motorists and the answer NO for female. This can be quantified in 1 for male and 0 for female. 2 For rain precipitation data the location qualitative specification can take the answer YES for mountainous location and the answer NO for the no-mountenous location. This can be quantified in 1 and 0 respectively. Statistics: Descriptive Statistics &

26 Types (cont.) Types of quantitative data 1 Cross-section data: These data might refer to people, companies, locations, countries given time t x 1,...,x N where N the total amount of Cross-sectional data given time t. (Important!!: Ordering of data does not matter) 2 Time-series data:these data might refer to people, companies, locations, countries collected in an an array of time interval given location l x 1,...,x T where T the total amount of time-series data given location l (Important!!: Ordering of data does matter) Statistics: Descriptive Statistics &

27 Types (cont.) transformation in quantitative data In various time-series data there is a necessity to take raw data from one source and then transform them into a different form for the empirical analysis 1 The percentage change of sales 2 The percentage change of a stock index Suppose, x 1,...,x T represent stock index time-series. Then for t {1,...,(T 1)} (x t+1 x t) x t 100 stands for the percentage change in this index. Statistics: Descriptive Statistics &

28 Values Table: Car Speed x i f i F i Statistics: Descriptive Statistics &

29 Classified Values Table: Car Speed x i f i F i [0 5] 2 2 [6 10] 7 9 [11 15] [16 20] [21 25] 7 50 Statistics: Descriptive Statistics &

30 Car Speed car speed car speed Frequency speed Statistics: Descriptive Statistics &

31 Advertisement Expenditure in 000,000 s Euro advertisement in thous. Euro advertisement in thous. Euro Frequency adv Statistics: Descriptive Statistics &

32 Gas consumption in UK (quarterly data for ) Gas consumption in UK Time Statistics: Descriptive Statistics &

33 Toyota car sales in Greece (monthly data for ) Toyota car sales in Greece (monthly) Time Statistics: Descriptive Statistics &

34 beamer-tu-logo Main Objectives of 1 Describe data using measures of location tendency 2 Describe data using measures of dispersion 3 Describe data using probability measures 4 Compare data of different measure using standardises techniques 5 Express relationship between two (2) different random variables Statistics: Descriptive Statistics &

35 beamer-tu-logo Main Objectives of 1 Describe data using measures of location tendency 2 Describe data using measures of dispersion 3 Describe data using probability measures 4 Compare data of different measure using standardises techniques 5 Express relationship between two (2) different random variables Statistics: Descriptive Statistics &

36 beamer-tu-logo Main Objectives of 1 Describe data using measures of location tendency 2 Describe data using measures of dispersion 3 Describe data using probability measures 4 Compare data of different measure using standardises techniques 5 Express relationship between two (2) different random variables Statistics: Descriptive Statistics &

37 beamer-tu-logo Main Objectives of 1 Describe data using measures of location tendency 2 Describe data using measures of dispersion 3 Describe data using probability measures 4 Compare data of different measure using standardises techniques 5 Express relationship between two (2) different random variables Statistics: Descriptive Statistics &

38 beamer-tu-logo Main Objectives of 1 Describe data using measures of location tendency 2 Describe data using measures of dispersion 3 Describe data using probability measures 4 Compare data of different measure using standardises techniques 5 Express relationship between two (2) different random variables Statistics: Descriptive Statistics &

39 beamer-tu-logo Main Objectives of 1 Describe data using measures of location tendency 2 Describe data using measures of dispersion 3 Describe data using probability measures 4 Compare data of different measure using standardises techniques 5 Express relationship between two (2) different random variables Statistics: Descriptive Statistics &

40 Mean Description Mean (or Mean Average) of expresses the arithmetic mean of a random variable for a given sample of our experiment How to estimate it? 1 For a sample of N observations x = 1 N N i=1 x i 2 For a sample of N = N i=1 f i N i=1 x = f ix i N i=1 f i with f i frequencies for each x i measure 3 For the populationµ x = 1 N N i=1 x i beamer-tu-logo Statistics: Descriptive Statistics &

41 Variance Description Variance (or Volatility) expresses a measure of a random variable dispension from the mean How to estimate it? 1 For a sample of N observations (N 30) S 2 = 1 N N (x i x) 2 i=1 2 For a sample N < 30 S 2 = 1 N 1 N (x i x) 2 i=1 Statistics: Descriptive Statistics &

42 Variance How to estimate it? (cont.) 1 For the population (and for samples with N 30) σ 2 = 1 N N (x i µ) 2 1 N i=1 N i=1 x 2 i µ 2 Standard Deviation 1 For the sample 2 For the population S = S 2 σ = σ 2 Statistics: Descriptive Statistics &

43 Car Speed data Estimate mean and variance when using Discrete Values x = 15.4 S 2 = 26 How to estimate mean and variance when using Classified Values 1 Set median values for x i, like z i. These are: {3, 8, 13, 18, 23} 2 Set new z and S 2 values such as: N i=1 z = f iz i N i=1 f i = 15 S 2 = 1 N N zi 2 f i z 2 = 26 i=1 beamer-tu-logo Statistics: Descriptive Statistics &

44 Median (M) Description Median expresses a measure of location tendency that is assigned to a value of the random variable that has 50% of the probability (or frequency) of the whole sample How to estimate it? M = {x : F(x) N/2} Statistics: Descriptive Statistics &

45 Percentiles Description Percentiles expresses a measure of central tendency that is assined to a value of the random variable that has P% of the probability (or frequency) of the whole sample How to estimate it? 1 1st Percentile Q 1 = {x : F(x) N/4} 2 2nd Percentile (Median) Q 2 = {x : F(x) N/2} 3 3nd Percentile Q 3 = {x : F(x) 3 N/4} Statistics: Descriptive Statistics &

46 Car Speed data Estimate the Median using Classified Values Q 2 = l i + δ i f i [ N 2 F i 1], 1 l i, the lower bound of the class where F(x) N/2 2 δ i the width of a class 3 f i the frequency of the class such that F(x) N/2 4 F i 1 the bounded frequency which represents the i 1 class of the F(x) N/2 condition Statistics: Descriptive Statistics &

47 Mode Description Mode expresses a location measure that inform as about the value of our random variable with the highest frequency (or probability) How to estimate it? m x = {x : f(x) = max(f 1,...,f N )} Problems with estimating m x 1 The Mode (m x) estimation may depend on the classes assigned for the analysed random variable. A possible change in the width of class assigned may change the mode of a random variable. When no classes assigned then mode is an objective measures of central tendency. 2 The same m x may appear for two (2) or even more random variable values or classes. This are the bimodal and the multimodal cases respectively. beamer-tu-logo Statistics: Descriptive Statistics &

48 Car Speed data Estimate the mode using Discrete Values m x = 20 Estimate the mode using Classified Values m x [10, 20] 1 Not precise estimation 2 May assign mode to values with very low frequency, such as 11 and 16 Statistics: Descriptive Statistics &

49 Absolute Measures of Linear Relationship (2 random variables) Covariance Covariance is the measure of how much two random variables move together. If two variables tend to move together in the same direction, then the covariance between the two variables will be positive. If two variables move in the opposite direction, the covariance will be negative. If there is no tendency for two variables to move one way or the other, then the covariance will be zero. How to estimate it? 1 For a sample of N observations 2 For the population S 2 x,y = N i=1 (x i x)(y i ȳ) N 1 σ 2 x,y = N i=1 (x i µ x)(y i µ y) N beamer-tu-logo Statistics: Descriptive Statistics &

50 Relative of Linear Relationship (2 random variables) Correlation Correlation is the relative measure of linear relationship between two (2) random variables How to estimate it? 1 For a sample of N observations ρ x,y = σx,y σ x σ y 2 For the population r x,y = sx,y s x s y Statistics: Descriptive Statistics &

51 beamer-tu-logo Correlation, explained 1 The correlation s sign depends on the Covariance sign (e.g. positive covariance lead to positive correlation) 2 1 ρ x,y 1, 1 r x,y 1 3 When ρ x,y, r x,y 0, we have nearly uncorrelated random variables 4 ρ x,y, r x,y > 0, we have possitivly and when < 0 negatively correlated random variables Statistics: Descriptive Statistics &

52 Sales versus Advertisement (in thousand Euro s) Sales (thous. Euro) Advert. (thous. Euro) Statistics: Descriptive Statistics &

53 Car speed (mph) versus Distance to stop (in ft) Stopping distance (ft) Speed (mph) Statistics: Descriptive Statistics &

54 Car weights versus Miles per gallon Miles Per Gallon Car Weight Statistics: Descriptive Statistics &

55 Correlation using data Correlation in Sales vs Advertisement r x,y = Correlation in Car speed (mph) versus Distance to stop r x,y = Correlation in Car weights versus Miles per gallon r x,y = Statistics: Descriptive Statistics &

56 Coefficient of Variation (CV) Description Coefficient of Variation is a relative dispersion measure for a random variable that expresses the standard deviation as a percentage of the arithmetic mean. How to estimate it? 1 For a sample of N observations CV = 100 S x 2 For the population CV = σ µ 100 Statistics: Descriptive Statistics &

57 beamer-tu-logo Why the CV is used? We use it when we like to compare the variation of two population (or samples) of different measures. Example: When sales data of two (2) different population (with different currencies) are analysed for their dispersion one can use the variance measure. However, variance expresses the an absolute measure of variation on the currency of each population. Here, the CV can demonstrate a relative dispersion measure in order for the dispersions to be comparable. Statistics: Descriptive Statistics &

58 Skewness Coefficient Description Skewness Coefficient is a measure of asymmetry for the frequency (or probability) distribution of a random variable How to estimate it? 1 Pearson s first coefficient of skewness of the distribution of a random variable using: µx mx a 1 = σ x 2 Skewness coefficient with µ 3 x = 1 N a 1 = µ3 x σ 3 x N (x i µ) 3 i=1 beamer-tu-logo Statistics: Descriptive Statistics &

59 beamer-tu-logo Skewness, explained 1 When a 1 > 0 we have positive skewness with µ x > m x (positive asymmetry) 2 When a 1 < 0 we have negative skewness with µ x < m x (negative asymmetry) 3 When a 1 0 we have zero (0) skewness (symmetry) Statistics: Descriptive Statistics &

60 Skewness Paradigm Statistics: Descriptive Statistics &

61 Kurtosis Coefficient Description Kurtosis Coefficient is a measure that assesses how flat or peaked is the frequency (or probability) distribution of a random variable How to estimate it? Pearson s coefficient of kurtosis a 2 = µ4 x σ 4 x with µ 4 x = 1 N N (x i µ) 4 i=1 Statistics: Descriptive Statistics &

62 beamer-tu-logo Kurtosis, explained 1 When a 2 3 > 0 we have more peaked distribution (leptokurtic or fat-tailed distribution) 2 When a 2 3 < 0 we have less peaked distribution (platykurtic or long-tailed distribution ) 3 When a 2 3 = 0 we have mesokurtic distribution Statistics: Descriptive Statistics &

63 Kurtosis Paradigm Statistics: Descriptive Statistics &

64 Skewness and Kurtosis using Advertisement data Estimate Skewness and Kurtosis when using Discrete Values a 1 = , Positive Asymmetry a 2 = , Platykurtic Estimate Skewness and Kurtosis when using Classified Values a 1 = , Positive Asymmetry a 2 = , Platykurtic 1 Similar signs in Asymmetry and Kurtosis 2 Much stronger positive asymmetry in the Classified Values (look at the graph!!!) Statistics: Descriptive Statistics &

65 Description Basic Notions of 1 2 Basic Notions of 3 Statistics: Descriptive Statistics &

66 Basic Notions of Definition is the likelihood for specific outcome of a random experiment to happen Number of possible outcomes in which the event occurs Total number of possible outcomes 1 Example I: The probability of having head when tossing a coin (theoretical probability) 1/2 2 Example III: The probability of having King when selecting a playing card (theoretical probability) 4/52 3 Example IV: The probability of Car speed 4 in the Car speed data sampling experiment (empirical probability) 2/50 beamer-tu-logo Statistics: Descriptive Statistics &

67 Basic Notions of Definition is the likelihood for specific outcome of a random experiment to happen Number of possible outcomes in which the event occurs Total number of possible outcomes 1 Example I: The probability of having head when tossing a coin (theoretical probability) 1/2 2 Example III: The probability of having King when selecting a playing card (theoretical probability) 4/52 3 Example IV: The probability of Car speed 4 in the Car speed data sampling experiment (empirical probability) 2/50 beamer-tu-logo Statistics: Descriptive Statistics &

68 Basic Notions of Definition is the likelihood for specific outcome of a random experiment to happen Number of possible outcomes in which the event occurs Total number of possible outcomes 1 Example I: The probability of having head when tossing a coin (theoretical probability) 1/2 2 Example III: The probability of having King when selecting a playing card (theoretical probability) 4/52 3 Example IV: The probability of Car speed 4 in the Car speed data sampling experiment (empirical probability) 2/50 beamer-tu-logo Statistics: Descriptive Statistics &

69 Basic Notions of Definition is the likelihood for specific outcome of a random experiment to happen Number of possible outcomes in which the event occurs Total number of possible outcomes 1 Example I: The probability of having head when tossing a coin (theoretical probability) 1/2 2 Example III: The probability of having King when selecting a playing card (theoretical probability) 4/52 3 Example IV: The probability of Car speed 4 in the Car speed data sampling experiment (empirical probability) 2/50 beamer-tu-logo Statistics: Descriptive Statistics &

70 (cont.) Basic Notions of Types of 1 Empirical : The probability estimated as an outcome of empirical experiment 2 Theoretical : The probability estimated as an empirical of theoretical experiment Important!! 1 Theoretical probabilities never coincide with the empirical 2 As researcher increase the sample N to increase the accuracy of probability estimation 3 The empirical probabilities converge to the theoretical as N Statistics: Descriptive Statistics &

71 (cont.) Basic Notions of Types of 1 Empirical : The probability estimated as an outcome of empirical experiment 2 Theoretical : The probability estimated as an empirical of theoretical experiment Important!! 1 Theoretical probabilities never coincide with the empirical 2 As researcher increase the sample N to increase the accuracy of probability estimation 3 The empirical probabilities converge to the theoretical as N Statistics: Descriptive Statistics &

72 (cont.) Basic Notions of Types of 1 Empirical : The probability estimated as an outcome of empirical experiment 2 Theoretical : The probability estimated as an empirical of theoretical experiment Important!! 1 Theoretical probabilities never coincide with the empirical 2 As researcher increase the sample N to increase the accuracy of probability estimation 3 The empirical probabilities converge to the theoretical as N Statistics: Descriptive Statistics &

73 (cont.) Basic Notions of Types of 1 Empirical : The probability estimated as an outcome of empirical experiment 2 Theoretical : The probability estimated as an empirical of theoretical experiment Important!! 1 Theoretical probabilities never coincide with the empirical 2 As researcher increase the sample N to increase the accuracy of probability estimation 3 The empirical probabilities converge to the theoretical as N Statistics: Descriptive Statistics &

74 Car Speed using probabilities Basic Notions of car speed car speed Density speed Statistics: Descriptive Statistics &

75 Car Speed using probabilities Basic Notions of Estimating Probabilities 1 P(x = 4) = P(x 7) = = P(x 20) = 1 P(x 19) = 0.24 Statistics: Descriptive Statistics &

76 (cont.) Basic Notions of Sample Space Sample space (Ω) is the collection of all posible outcomes in a random experiment Ω = {E 1,...,E k } Sample Space for Car Speed data Ω = {4, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 22, 23, 24, 25} Redefining For a given sample space Ω, then the probabilities follow the following: 1 0 P(E i ) 1, i = 1,...,k 2 The sum of the probabilities k P(E i ) = 1 i=1 Statistics: Descriptive Statistics & beamer-tu-logo

77 (cont.) Basic Notions of Laws of Symbolises the union of events Symbolises the intersection of events 1 When E + E = Ω the events are condidered mutually exclussive and have P(E 1 E 2 ) = 0 2 For E 1 and E 2 with E 1 E 2, the union of the events P(E 1 E 2 ) = P(E 1 )+P(E 2 ) P(E 1 E 2 ) 3 For E 1 and E 2 with E 1 E 2 =, the union of the events for i = 1,...,k P(E 1 E 2 ) = P(E 1 )+P(E 2 ) 4 For E 1 and E 2 independent events we have P(E 1 E 2 ) = P(E 1 ) P(E 2 ) Eg: E 1 the event of having even outcome when tossing one coin, and E 2 the event of having even outcome when tossing another coin beamer-tu-logo Statistics: Descriptive Statistics &

78 (cont.) Basic Notions of Conditional 1 Marginal : The probability that a given event will occur. No other events are taken into consideration. A typical expression is P(A) for the A event. 2 Joint : The probability that two or more events will all occur. A typical expression is P(A B) for the A and B events. 3 Conditional : The probability that an event will occur, given that another event has already happened. A typical expression is P(A B), with the verbal description, the probability of A, given B. Statistics: Descriptive Statistics &

79 (cont.) Basic Notions of Conditional Supose data from two (2) random variables. E i the quality for a product being high-medium-low for i = 1 = 2 and i = 3 respectively. On the other hand we can divide the our sample space into A and B for the A-market area and the B-market area respectively. We can assign conditional frequencies (probabilities) to the 2-entry table as follows: A B E 1 P(E 1 A) P(E 1 B) P(E 1 ) E 2 P(E 2 A) P(E 2 B) P(E 2 ) E 3 P(E 3 A) P(E 3 B) P(E 3 ) P(A) P(B) P(Ω) Statistics: Descriptive Statistics &

80 (cont.) Basic Notions of Conditional (cont.) Using data we can derive the following 2-entry table: A B E 1 P(E 1 A) = 0.48 P(E 1 B) = 0.12 P(E 1 ) = 0.60 E 2 P(E 2 A) = 0.15 P(E 2 B) = 0.10 P(E 2 ) = 0.25 E 3 P(E 3 A) = P(E 3 B) = P(E 3 ) = 0.15 P(A) = P(B) = P(Ω) = 1 Statistics: Descriptive Statistics &

81 (cont.) Basic Notions of Conditional (cont.) One can derive the probability of having a product of high quality conditional on being in the A market area as: where the marginal probability. and P(E 1 A) = P(E 1 A), P(A) P(A) = P(E 1 A)+P(E 2 A)+P(E 3 A), P(E 1 A) = P(E 1 A) P(A) the joint probabilities. P(E 1 B) = P(E 1 B) P(B) Statistics: Descriptive Statistics &

82 (cont.) Basic Notions of Conditional (cont.) One can derive the probability of having a product of at least medium quality conditional on being in the A market area as: where the marginal probability. and P(E 1 E 2 A) = P((E 1 E 2 ) A), P(A) P(A) = P(E 1 A)+P(E 2 A)+P(E 3 A), P((E 1 E 2 ) A) = P((E 1 A) (E 2 A)) = P(E 1 A)+P(E 2 A) = 0.63 So, the P(E 1 E 2 A) = Question? Is this probability bigger than that of having a product of at least medium quality conditional on being in the B market area? beamer-tu-logo P(E 1 E 2 B) = Statistics: Descriptive Statistics &

83 (cont.) Basic Notions of Conditional (cont.) One can derive the probability of having a product of at least medium quality conditional on being in the A market area as: where the marginal probability. and P(E 1 E 2 A) = P((E 1 E 2 ) A), P(A) P(A) = P(E 1 A)+P(E 2 A)+P(E 3 A), P((E 1 E 2 ) A) = P((E 1 A) (E 2 A)) = P(E 1 A)+P(E 2 A) = 0.63 So, the P(E 1 E 2 A) = Question? Is this probability bigger than that of having a product of at least medium quality conditional on being in the B market area? beamer-tu-logo P(E 1 E 2 B) = Statistics: Descriptive Statistics &

84 Description 1 2 Basic Notions of 3 Statistics: Descriptive Statistics &

85 Anderson D.R., Sweeney, D.J, Statistics for Business and Economics. South-Western College Pub; 11th edition, 2010 Weiers R.M., to Business Statistics. South-Western College Pub; 7th edition, Koop G., Analysis of Economic. Wiley Pub; 2nd edition, Statistics: Descriptive Statistics &

86 Anderson D.R., Sweeney, D.J, Statistics for Business and Economics. South-Western College Pub; 11th edition, 2010 Weiers R.M., to Business Statistics. South-Western College Pub; 7th edition, Koop G., Analysis of Economic. Wiley Pub; 2nd edition, Statistics: Descriptive Statistics &

87 Anderson D.R., Sweeney, D.J, Statistics for Business and Economics. South-Western College Pub; 11th edition, 2010 Weiers R.M., to Business Statistics. South-Western College Pub; 7th edition, Koop G., Analysis of Economic. Wiley Pub; 2nd edition, Statistics: Descriptive Statistics &

MBA 611 STATISTICS AND QUANTITATIVE METHODS

MBA 611 STATISTICS AND QUANTITATIVE METHODS MBA 611 STATISTICS AND QUANTITATIVE METHODS Part I. Review of Basic Statistics (Chapters 1-11) A. Introduction (Chapter 1) Uncertainty: Decisions are often based on incomplete information from uncertain

More information

CA200 Quantitative Analysis for Business Decisions. File name: CA200_Section_04A_StatisticsIntroduction

CA200 Quantitative Analysis for Business Decisions. File name: CA200_Section_04A_StatisticsIntroduction CA200 Quantitative Analysis for Business Decisions File name: CA200_Section_04A_StatisticsIntroduction Table of Contents 4. Introduction to Statistics... 1 4.1 Overview... 3 4.2 Discrete or continuous

More information

Week 1. Exploratory Data Analysis

Week 1. Exploratory Data Analysis Week 1 Exploratory Data Analysis Practicalities This course ST903 has students from both the MSc in Financial Mathematics and the MSc in Statistics. Two lectures and one seminar/tutorial per week. Exam

More information

II. DISTRIBUTIONS distribution normal distribution. standard scores

II. DISTRIBUTIONS distribution normal distribution. standard scores Appendix D Basic Measurement And Statistics The following information was developed by Steven Rothke, PhD, Department of Psychology, Rehabilitation Institute of Chicago (RIC) and expanded by Mary F. Schmidt,

More information

DESCRIPTIVE STATISTICS. The purpose of statistics is to condense raw data to make it easier to answer specific questions; test hypotheses.

DESCRIPTIVE STATISTICS. The purpose of statistics is to condense raw data to make it easier to answer specific questions; test hypotheses. DESCRIPTIVE STATISTICS The purpose of statistics is to condense raw data to make it easier to answer specific questions; test hypotheses. DESCRIPTIVE VS. INFERENTIAL STATISTICS Descriptive To organize,

More information

Descriptive Statistics. Purpose of descriptive statistics Frequency distributions Measures of central tendency Measures of dispersion

Descriptive Statistics. Purpose of descriptive statistics Frequency distributions Measures of central tendency Measures of dispersion Descriptive Statistics Purpose of descriptive statistics Frequency distributions Measures of central tendency Measures of dispersion Statistics as a Tool for LIS Research Importance of statistics in research

More information

E3: PROBABILITY AND STATISTICS lecture notes

E3: PROBABILITY AND STATISTICS lecture notes E3: PROBABILITY AND STATISTICS lecture notes 2 Contents 1 PROBABILITY THEORY 7 1.1 Experiments and random events............................ 7 1.2 Certain event. Impossible event............................

More information

4. Continuous Random Variables, the Pareto and Normal Distributions

4. Continuous Random Variables, the Pareto and Normal Distributions 4. Continuous Random Variables, the Pareto and Normal Distributions A continuous random variable X can take any value in a given range (e.g. height, weight, age). The distribution of a continuous random

More information

Statistics. Measurement. Scales of Measurement 7/18/2012

Statistics. Measurement. Scales of Measurement 7/18/2012 Statistics Measurement Measurement is defined as a set of rules for assigning numbers to represent objects, traits, attributes, or behaviors A variableis something that varies (eye color), a constant does

More information

Chapter 4. Probability and Probability Distributions

Chapter 4. Probability and Probability Distributions Chapter 4. robability and robability Distributions Importance of Knowing robability To know whether a sample is not identical to the population from which it was selected, it is necessary to assess the

More information

Descriptive Statistics and Measurement Scales

Descriptive Statistics and Measurement Scales Descriptive Statistics 1 Descriptive Statistics and Measurement Scales Descriptive statistics are used to describe the basic features of the data in a study. They provide simple summaries about the sample

More information

Summary of Formulas and Concepts. Descriptive Statistics (Ch. 1-4)

Summary of Formulas and Concepts. Descriptive Statistics (Ch. 1-4) Summary of Formulas and Concepts Descriptive Statistics (Ch. 1-4) Definitions Population: The complete set of numerical information on a particular quantity in which an investigator is interested. We assume

More information

Additional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm

Additional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm Mgt 540 Research Methods Data Analysis 1 Additional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm http://web.utk.edu/~dap/random/order/start.htm

More information

Probability and Statistics Vocabulary List (Definitions for Middle School Teachers)

Probability and Statistics Vocabulary List (Definitions for Middle School Teachers) Probability and Statistics Vocabulary List (Definitions for Middle School Teachers) B Bar graph a diagram representing the frequency distribution for nominal or discrete data. It consists of a sequence

More information

Exploratory Data Analysis

Exploratory Data Analysis Exploratory Data Analysis Johannes Schauer johannes.schauer@tugraz.at Institute of Statistics Graz University of Technology Steyrergasse 17/IV, 8010 Graz www.statistics.tugraz.at February 12, 2008 Introduction

More information

Foundation of Quantitative Data Analysis

Foundation of Quantitative Data Analysis Foundation of Quantitative Data Analysis Part 1: Data manipulation and descriptive statistics with SPSS/Excel HSRS #10 - October 17, 2013 Reference : A. Aczel, Complete Business Statistics. Chapters 1

More information

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Final Exam Review MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. 1) A researcher for an airline interviews all of the passengers on five randomly

More information

Module 3: Correlation and Covariance

Module 3: Correlation and Covariance Using Statistical Data to Make Decisions Module 3: Correlation and Covariance Tom Ilvento Dr. Mugdim Pašiƒ University of Delaware Sarajevo Graduate School of Business O ften our interest in data analysis

More information

Fairfield Public Schools

Fairfield Public Schools Mathematics Fairfield Public Schools AP Statistics AP Statistics BOE Approved 04/08/2014 1 AP STATISTICS Critical Areas of Focus AP Statistics is a rigorous course that offers advanced students an opportunity

More information

1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number

1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number 1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number A. 3(x - x) B. x 3 x C. 3x - x D. x - 3x 2) Write the following as an algebraic expression

More information

Probability and Statistics Prof. Dr. Somesh Kumar Department of Mathematics Indian Institute of Technology, Kharagpur

Probability and Statistics Prof. Dr. Somesh Kumar Department of Mathematics Indian Institute of Technology, Kharagpur Probability and Statistics Prof. Dr. Somesh Kumar Department of Mathematics Indian Institute of Technology, Kharagpur Module No. #01 Lecture No. #15 Special Distributions-VI Today, I am going to introduce

More information

SKEWNESS. Measure of Dispersion tells us about the variation of the data set. Skewness tells us about the direction of variation of the data set.

SKEWNESS. Measure of Dispersion tells us about the variation of the data set. Skewness tells us about the direction of variation of the data set. SKEWNESS All about Skewness: Aim Definition Types of Skewness Measure of Skewness Example A fundamental task in many statistical analyses is to characterize the location and variability of a data set.

More information

Data Modeling & Analysis Techniques. Probability & Statistics. Manfred Huber 2011 1

Data Modeling & Analysis Techniques. Probability & Statistics. Manfred Huber 2011 1 Data Modeling & Analysis Techniques Probability & Statistics Manfred Huber 2011 1 Probability and Statistics Probability and statistics are often used interchangeably but are different, related fields

More information

Joint Exam 1/P Sample Exam 1

Joint Exam 1/P Sample Exam 1 Joint Exam 1/P Sample Exam 1 Take this practice exam under strict exam conditions: Set a timer for 3 hours; Do not stop the timer for restroom breaks; Do not look at your notes. If you believe a question

More information

There are three kinds of people in the world those who are good at math and those who are not. PSY 511: Advanced Statistics for Psychological and Behavioral Research 1 Positive Views The record of a month

More information

Descriptive Statistics

Descriptive Statistics Y520 Robert S Michael Goal: Learn to calculate indicators and construct graphs that summarize and describe a large quantity of values. Using the textbook readings and other resources listed on the web

More information

Descriptive statistics parameters: Measures of centrality

Descriptive statistics parameters: Measures of centrality Descriptive statistics parameters: Measures of centrality Contents Definitions... 3 Classification of descriptive statistics parameters... 4 More about central tendency estimators... 5 Relationship between

More information

8. THE NORMAL DISTRIBUTION

8. THE NORMAL DISTRIBUTION 8. THE NORMAL DISTRIBUTION The normal distribution with mean μ and variance σ 2 has the following density function: The normal distribution is sometimes called a Gaussian Distribution, after its inventor,

More information

AP Statistics Solutions to Packet 2

AP Statistics Solutions to Packet 2 AP Statistics Solutions to Packet 2 The Normal Distributions Density Curves and the Normal Distribution Standard Normal Calculations HW #9 1, 2, 4, 6-8 2.1 DENSITY CURVES (a) Sketch a density curve that

More information

Normal distribution. ) 2 /2σ. 2π σ

Normal distribution. ) 2 /2σ. 2π σ Normal distribution The normal distribution is the most widely known and used of all distributions. Because the normal distribution approximates many natural phenomena so well, it has developed into a

More information

Measures of Central Tendency and Variability: Summarizing your Data for Others

Measures of Central Tendency and Variability: Summarizing your Data for Others Measures of Central Tendency and Variability: Summarizing your Data for Others 1 I. Measures of Central Tendency: -Allow us to summarize an entire data set with a single value (the midpoint). 1. Mode :

More information

Descriptive Statistics

Descriptive Statistics Descriptive Statistics Primer Descriptive statistics Central tendency Variation Relative position Relationships Calculating descriptive statistics Descriptive Statistics Purpose to describe or summarize

More information

Statistics Review PSY379

Statistics Review PSY379 Statistics Review PSY379 Basic concepts Measurement scales Populations vs. samples Continuous vs. discrete variable Independent vs. dependent variable Descriptive vs. inferential stats Common analyses

More information

Elementary Statistics

Elementary Statistics Elementary Statistics Chapter 1 Dr. Ghamsary Page 1 Elementary Statistics M. Ghamsary, Ph.D. Chap 01 1 Elementary Statistics Chapter 1 Dr. Ghamsary Page 2 Statistics: Statistics is the science of collecting,

More information

Algebra 1 2008. Academic Content Standards Grade Eight and Grade Nine Ohio. Grade Eight. Number, Number Sense and Operations Standard

Algebra 1 2008. Academic Content Standards Grade Eight and Grade Nine Ohio. Grade Eight. Number, Number Sense and Operations Standard Academic Content Standards Grade Eight and Grade Nine Ohio Algebra 1 2008 Grade Eight STANDARDS Number, Number Sense and Operations Standard Number and Number Systems 1. Use scientific notation to express

More information

Continuous Random Variables

Continuous Random Variables Chapter 5 Continuous Random Variables 5.1 Continuous Random Variables 1 5.1.1 Student Learning Objectives By the end of this chapter, the student should be able to: Recognize and understand continuous

More information

Curriculum Map Statistics and Probability Honors (348) Saugus High School Saugus Public Schools 2009-2010

Curriculum Map Statistics and Probability Honors (348) Saugus High School Saugus Public Schools 2009-2010 Curriculum Map Statistics and Probability Honors (348) Saugus High School Saugus Public Schools 2009-2010 Week 1 Week 2 14.0 Students organize and describe distributions of data by using a number of different

More information

Using Excel for inferential statistics

Using Excel for inferential statistics FACT SHEET Using Excel for inferential statistics Introduction When you collect data, you expect a certain amount of variation, just caused by chance. A wide variety of statistical tests can be applied

More information

MATH 10: Elementary Statistics and Probability Chapter 5: Continuous Random Variables

MATH 10: Elementary Statistics and Probability Chapter 5: Continuous Random Variables MATH 10: Elementary Statistics and Probability Chapter 5: Continuous Random Variables Tony Pourmohamad Department of Mathematics De Anza College Spring 2015 Objectives By the end of this set of slides,

More information

ST 371 (IV): Discrete Random Variables

ST 371 (IV): Discrete Random Variables ST 371 (IV): Discrete Random Variables 1 Random Variables A random variable (rv) is a function that is defined on the sample space of the experiment and that assigns a numerical variable to each possible

More information

Introduction to Quantitative Methods

Introduction to Quantitative Methods Introduction to Quantitative Methods October 15, 2009 Contents 1 Definition of Key Terms 2 2 Descriptive Statistics 3 2.1 Frequency Tables......................... 4 2.2 Measures of Central Tendencies.................

More information

Bayes Theorem. Bayes Theorem- Example. Evaluation of Medical Screening Procedure. Evaluation of Medical Screening Procedure

Bayes Theorem. Bayes Theorem- Example. Evaluation of Medical Screening Procedure. Evaluation of Medical Screening Procedure Bayes Theorem P(C A) P(A) P(A C) = P(C A) P(A) + P(C B) P(B) P(E B) P(B) P(B E) = P(E B) P(B) + P(E A) P(A) P(D A) P(A) P(A D) = P(D A) P(A) + P(D B) P(B) Cost of procedure is $1,000,000 Data regarding

More information

3. Data Analysis, Statistics, and Probability

3. Data Analysis, Statistics, and Probability 3. Data Analysis, Statistics, and Probability Data and probability sense provides students with tools to understand information and uncertainty. Students ask questions and gather and use data to answer

More information

Chapter 1: Looking at Data Section 1.1: Displaying Distributions with Graphs

Chapter 1: Looking at Data Section 1.1: Displaying Distributions with Graphs Types of Variables Chapter 1: Looking at Data Section 1.1: Displaying Distributions with Graphs Quantitative (numerical)variables: take numerical values for which arithmetic operations make sense (addition/averaging)

More information

Statistics I for QBIC. Contents and Objectives. Chapters 1 7. Revised: August 2013

Statistics I for QBIC. Contents and Objectives. Chapters 1 7. Revised: August 2013 Statistics I for QBIC Text Book: Biostatistics, 10 th edition, by Daniel & Cross Contents and Objectives Chapters 1 7 Revised: August 2013 Chapter 1: Nature of Statistics (sections 1.1-1.6) Objectives

More information

Algebra I Vocabulary Cards

Algebra I Vocabulary Cards Algebra I Vocabulary Cards Table of Contents Expressions and Operations Natural Numbers Whole Numbers Integers Rational Numbers Irrational Numbers Real Numbers Absolute Value Order of Operations Expression

More information

An Introduction to Basic Statistics and Probability

An Introduction to Basic Statistics and Probability An Introduction to Basic Statistics and Probability Shenek Heyward NCSU An Introduction to Basic Statistics and Probability p. 1/4 Outline Basic probability concepts Conditional probability Discrete Random

More information

COMMON CORE STATE STANDARDS FOR

COMMON CORE STATE STANDARDS FOR COMMON CORE STATE STANDARDS FOR Mathematics (CCSSM) High School Statistics and Probability Mathematics High School Statistics and Probability Decisions or predictions are often based on data numbers in

More information

c. Construct a boxplot for the data. Write a one sentence interpretation of your graph.

c. Construct a boxplot for the data. Write a one sentence interpretation of your graph. MBA/MIB 5315 Sample Test Problems Page 1 of 1 1. An English survey of 3000 medical records showed that smokers are more inclined to get depressed than non-smokers. Does this imply that smoking causes depression?

More information

Pie Charts. proportion of ice-cream flavors sold annually by a given brand. AMS-5: Statistics. Cherry. Cherry. Blueberry. Blueberry. Apple.

Pie Charts. proportion of ice-cream flavors sold annually by a given brand. AMS-5: Statistics. Cherry. Cherry. Blueberry. Blueberry. Apple. Graphical Representations of Data, Mean, Median and Standard Deviation In this class we will consider graphical representations of the distribution of a set of data. The goal is to identify the range of

More information

Lesson 4 Measures of Central Tendency

Lesson 4 Measures of Central Tendency Outline Measures of a distribution s shape -modality and skewness -the normal distribution Measures of central tendency -mean, median, and mode Skewness and Central Tendency Lesson 4 Measures of Central

More information

NEW YORK CITY COLLEGE OF TECHNOLOGY The City University of New York

NEW YORK CITY COLLEGE OF TECHNOLOGY The City University of New York NEW YORK CITY COLLEGE OF TECHNOLOGY The City University of New York DEPARTMENT: Mathematics COURSE: MAT 1272/ MA 272 TITLE: DESCRIPTION: TEXT: Statistics An introduction to statistical methods and statistical

More information

STA-201-TE. 5. Measures of relationship: correlation (5%) Correlation coefficient; Pearson r; correlation and causation; proportion of common variance

STA-201-TE. 5. Measures of relationship: correlation (5%) Correlation coefficient; Pearson r; correlation and causation; proportion of common variance Principles of Statistics STA-201-TE This TECEP is an introduction to descriptive and inferential statistics. Topics include: measures of central tendency, variability, correlation, regression, hypothesis

More information

Exploratory data analysis (Chapter 2) Fall 2011

Exploratory data analysis (Chapter 2) Fall 2011 Exploratory data analysis (Chapter 2) Fall 2011 Data Examples Example 1: Survey Data 1 Data collected from a Stat 371 class in Fall 2005 2 They answered questions about their: gender, major, year in school,

More information

Introduction to Statistics and Quantitative Research Methods

Introduction to Statistics and Quantitative Research Methods Introduction to Statistics and Quantitative Research Methods Purpose of Presentation To aid in the understanding of basic statistics, including terminology, common terms, and common statistical methods.

More information

Northumberland Knowledge

Northumberland Knowledge Northumberland Knowledge Know Guide How to Analyse Data - November 2012 - This page has been left blank 2 About this guide The Know Guides are a suite of documents that provide useful information about

More information

Descriptive statistics Statistical inference statistical inference, statistical induction and inferential statistics

Descriptive statistics Statistical inference statistical inference, statistical induction and inferential statistics Descriptive statistics is the discipline of quantitatively describing the main features of a collection of data. Descriptive statistics are distinguished from inferential statistics (or inductive statistics),

More information

Introduction to Risk, Return and the Historical Record

Introduction to Risk, Return and the Historical Record Introduction to Risk, Return and the Historical Record Rates of return Investors pay attention to the rate at which their fund have grown during the period The holding period returns (HDR) measure the

More information

1. A survey of a group s viewing habits over the last year revealed the following

1. A survey of a group s viewing habits over the last year revealed the following 1. A survey of a group s viewing habits over the last year revealed the following information: (i) 8% watched gymnastics (ii) 9% watched baseball (iii) 19% watched soccer (iv) 14% watched gymnastics and

More information

PROBABILITY AND SAMPLING DISTRIBUTIONS

PROBABILITY AND SAMPLING DISTRIBUTIONS PROBABILITY AND SAMPLING DISTRIBUTIONS SEEMA JAGGI AND P.K. BATRA Indian Agricultural Statistics Research Institute Library Avenue, New Delhi - 0 0 seema@iasri.res.in. Introduction The concept of probability

More information

Characteristics of Binomial Distributions

Characteristics of Binomial Distributions Lesson2 Characteristics of Binomial Distributions In the last lesson, you constructed several binomial distributions, observed their shapes, and estimated their means and standard deviations. In Investigation

More information

Dongfeng Li. Autumn 2010

Dongfeng Li. Autumn 2010 Autumn 2010 Chapter Contents Some statistics background; ; Comparing means and proportions; variance. Students should master the basic concepts, descriptive statistics measures and graphs, basic hypothesis

More information

Probability and statistics; Rehearsal for pattern recognition

Probability and statistics; Rehearsal for pattern recognition Probability and statistics; Rehearsal for pattern recognition Václav Hlaváč Czech Technical University in Prague Faculty of Electrical Engineering, Department of Cybernetics Center for Machine Perception

More information

Guided Reading 9 th Edition. informed consent, protection from harm, deception, confidentiality, and anonymity.

Guided Reading 9 th Edition. informed consent, protection from harm, deception, confidentiality, and anonymity. Guided Reading Educational Research: Competencies for Analysis and Applications 9th Edition EDFS 635: Educational Research Chapter 1: Introduction to Educational Research 1. List and briefly describe the

More information

Probability Distributions

Probability Distributions Learning Objectives Probability Distributions Section 1: How Can We Summarize Possible Outcomes and Their Probabilities? 1. Random variable 2. Probability distributions for discrete random variables 3.

More information

Lecture Note 1 Set and Probability Theory. MIT 14.30 Spring 2006 Herman Bennett

Lecture Note 1 Set and Probability Theory. MIT 14.30 Spring 2006 Herman Bennett Lecture Note 1 Set and Probability Theory MIT 14.30 Spring 2006 Herman Bennett 1 Set Theory 1.1 Definitions and Theorems 1. Experiment: any action or process whose outcome is subject to uncertainty. 2.

More information

Financial Time Series Analysis (FTSA) Lecture 1: Introduction

Financial Time Series Analysis (FTSA) Lecture 1: Introduction Financial Time Series Analysis (FTSA) Lecture 1: Introduction Brief History of Time Series Analysis Statistical analysis of time series data (Yule, 1927) v/s forecasting (even longer). Forecasting is often

More information

Normal Distribution Lecture Notes

Normal Distribution Lecture Notes Normal Distribution Lecture Notes Professor Richard Blecksmith richard@math.niu.edu Dept. of Mathematical Sciences Northern Illinois University Math 101 Website: http://math.niu.edu/ richard/math101 Section

More information

Geostatistics Exploratory Analysis

Geostatistics Exploratory Analysis Instituto Superior de Estatística e Gestão de Informação Universidade Nova de Lisboa Master of Science in Geospatial Technologies Geostatistics Exploratory Analysis Carlos Alberto Felgueiras cfelgueiras@isegi.unl.pt

More information

2. Filling Data Gaps, Data validation & Descriptive Statistics

2. Filling Data Gaps, Data validation & Descriptive Statistics 2. Filling Data Gaps, Data validation & Descriptive Statistics Dr. Prasad Modak Background Data collected from field may suffer from these problems Data may contain gaps ( = no readings during this period)

More information

Probability Distributions

Probability Distributions CHAPTER 5 Probability Distributions CHAPTER OUTLINE 5.1 Probability Distribution of a Discrete Random Variable 5.2 Mean and Standard Deviation of a Probability Distribution 5.3 The Binomial Distribution

More information

Quantitative Methods for Finance

Quantitative Methods for Finance Quantitative Methods for Finance Module 1: The Time Value of Money 1 Learning how to interpret interest rates as required rates of return, discount rates, or opportunity costs. 2 Learning how to explain

More information

Bayesian Analysis for the Social Sciences

Bayesian Analysis for the Social Sciences Bayesian Analysis for the Social Sciences Simon Jackman Stanford University http://jackman.stanford.edu/bass November 9, 2012 Simon Jackman (Stanford) Bayesian Analysis for the Social Sciences November

More information

Statistics in Geophysics: Introduction and Probability Theory

Statistics in Geophysics: Introduction and Probability Theory Statistics in Geophysics: Introduction and Steffen Unkel Department of Statistics Ludwig-Maximilians-University Munich, Germany Winter Term 2013/14 1/32 What is Statistics? Introduction Statistics is the

More information

Risk and return (1) Class 9 Financial Management, 15.414

Risk and return (1) Class 9 Financial Management, 15.414 Risk and return (1) Class 9 Financial Management, 15.414 Today Risk and return Statistics review Introduction to stock price behavior Reading Brealey and Myers, Chapter 7, p. 153 165 Road map Part 1. Valuation

More information

How To Write A Data Analysis

How To Write A Data Analysis Mathematics Probability and Statistics Curriculum Guide Revised 2010 This page is intentionally left blank. Introduction The Mathematics Curriculum Guide serves as a guide for teachers when planning instruction

More information

Notes on Continuous Random Variables

Notes on Continuous Random Variables Notes on Continuous Random Variables Continuous random variables are random quantities that are measured on a continuous scale. They can usually take on any value over some interval, which distinguishes

More information

STATS8: Introduction to Biostatistics. Data Exploration. Babak Shahbaba Department of Statistics, UCI

STATS8: Introduction to Biostatistics. Data Exploration. Babak Shahbaba Department of Statistics, UCI STATS8: Introduction to Biostatistics Data Exploration Babak Shahbaba Department of Statistics, UCI Introduction After clearly defining the scientific problem, selecting a set of representative members

More information

Investment Statistics: Definitions & Formulas

Investment Statistics: Definitions & Formulas Investment Statistics: Definitions & Formulas The following are brief descriptions and formulas for the various statistics and calculations available within the ease Analytics system. Unless stated otherwise,

More information

A review of the portions of probability useful for understanding experimental design and analysis.

A review of the portions of probability useful for understanding experimental design and analysis. Chapter 3 Review of Probability A review of the portions of probability useful for understanding experimental design and analysis. The material in this section is intended as a review of the topic of probability

More information

Midterm Review Problems

Midterm Review Problems Midterm Review Problems October 19, 2013 1. Consider the following research title: Cooperation among nursery school children under two types of instruction. In this study, what is the independent variable?

More information

For a partition B 1,..., B n, where B i B j = for i. A = (A B 1 ) (A B 2 ),..., (A B n ) and thus. P (A) = P (A B i ) = P (A B i )P (B i )

For a partition B 1,..., B n, where B i B j = for i. A = (A B 1 ) (A B 2 ),..., (A B n ) and thus. P (A) = P (A B i ) = P (A B i )P (B i ) Probability Review 15.075 Cynthia Rudin A probability space, defined by Kolmogorov (1903-1987) consists of: A set of outcomes S, e.g., for the roll of a die, S = {1, 2, 3, 4, 5, 6}, 1 1 2 1 6 for the roll

More information

Big Ideas in Mathematics

Big Ideas in Mathematics Big Ideas in Mathematics which are important to all mathematics learning. (Adapted from the NCTM Curriculum Focal Points, 2006) The Mathematics Big Ideas are organized using the PA Mathematics Standards

More information

Prentice Hall Algebra 2 2011 Correlated to: Colorado P-12 Academic Standards for High School Mathematics, Adopted 12/2009

Prentice Hall Algebra 2 2011 Correlated to: Colorado P-12 Academic Standards for High School Mathematics, Adopted 12/2009 Content Area: Mathematics Grade Level Expectations: High School Standard: Number Sense, Properties, and Operations Understand the structure and properties of our number system. At their most basic level

More information

Covariance and Correlation

Covariance and Correlation Covariance and Correlation ( c Robert J. Serfling Not for reproduction or distribution) We have seen how to summarize a data-based relative frequency distribution by measures of location and spread, such

More information

Interpreting Data in Normal Distributions

Interpreting Data in Normal Distributions Interpreting Data in Normal Distributions This curve is kind of a big deal. It shows the distribution of a set of test scores, the results of rolling a die a million times, the heights of people on Earth,

More information

CALCULATIONS & STATISTICS

CALCULATIONS & STATISTICS CALCULATIONS & STATISTICS CALCULATION OF SCORES Conversion of 1-5 scale to 0-100 scores When you look at your report, you will notice that the scores are reported on a 0-100 scale, even though respondents

More information

Probability. Distribution. Outline

Probability. Distribution. Outline 7 The Normal Probability Distribution Outline 7.1 Properties of the Normal Distribution 7.2 The Standard Normal Distribution 7.3 Applications of the Normal Distribution 7.4 Assessing Normality 7.5 The

More information

STANDARDISATION OF DATA SET UNDER DIFFERENT MEASUREMENT SCALES. 1 The measurement scales of variables

STANDARDISATION OF DATA SET UNDER DIFFERENT MEASUREMENT SCALES. 1 The measurement scales of variables STANDARDISATION OF DATA SET UNDER DIFFERENT MEASUREMENT SCALES Krzysztof Jajuga 1, Marek Walesiak 1 1 Wroc law University of Economics, Komandorska 118/120, 53-345 Wroc law, Poland Abstract: Standardisation

More information

Correlation key concepts:

Correlation key concepts: CORRELATION Correlation key concepts: Types of correlation Methods of studying correlation a) Scatter diagram b) Karl pearson s coefficient of correlation c) Spearman s Rank correlation coefficient d)

More information

Lecture 9: Introduction to Pattern Analysis

Lecture 9: Introduction to Pattern Analysis Lecture 9: Introduction to Pattern Analysis g Features, patterns and classifiers g Components of a PR system g An example g Probability definitions g Bayes Theorem g Gaussian densities Features, patterns

More information

Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics

Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics For 2015 Examinations Aim The aim of the Probability and Mathematical Statistics subject is to provide a grounding in

More information

PROPERTIES OF THE SAMPLE CORRELATION OF THE BIVARIATE LOGNORMAL DISTRIBUTION

PROPERTIES OF THE SAMPLE CORRELATION OF THE BIVARIATE LOGNORMAL DISTRIBUTION PROPERTIES OF THE SAMPLE CORRELATION OF THE BIVARIATE LOGNORMAL DISTRIBUTION Chin-Diew Lai, Department of Statistics, Massey University, New Zealand John C W Rayner, School of Mathematics and Applied Statistics,

More information

Algebra 1 Course Information

Algebra 1 Course Information Course Information Course Description: Students will study patterns, relations, and functions, and focus on the use of mathematical models to understand and analyze quantitative relationships. Through

More information

Exploratory Data Analysis. Psychology 3256

Exploratory Data Analysis. Psychology 3256 Exploratory Data Analysis Psychology 3256 1 Introduction If you are going to find out anything about a data set you must first understand the data Basically getting a feel for you numbers Easier to find

More information

Department of Mathematics, Indian Institute of Technology, Kharagpur Assignment 2-3, Probability and Statistics, March 2015. Due:-March 25, 2015.

Department of Mathematics, Indian Institute of Technology, Kharagpur Assignment 2-3, Probability and Statistics, March 2015. Due:-March 25, 2015. Department of Mathematics, Indian Institute of Technology, Kharagpur Assignment -3, Probability and Statistics, March 05. Due:-March 5, 05.. Show that the function 0 for x < x+ F (x) = 4 for x < for x

More information

4.1 4.2 Probability Distribution for Discrete Random Variables

4.1 4.2 Probability Distribution for Discrete Random Variables 4.1 4.2 Probability Distribution for Discrete Random Variables Key concepts: discrete random variable, probability distribution, expected value, variance, and standard deviation of a discrete random variable.

More information

Course Syllabus MATH 110 Introduction to Statistics 3 credits

Course Syllabus MATH 110 Introduction to Statistics 3 credits Course Syllabus MATH 110 Introduction to Statistics 3 credits Prerequisites: Algebra proficiency is required, as demonstrated by successful completion of high school algebra, by completion of a college

More information

The Dangers of Using Correlation to Measure Dependence

The Dangers of Using Correlation to Measure Dependence ALTERNATIVE INVESTMENT RESEARCH CENTRE WORKING PAPER SERIES Working Paper # 0010 The Dangers of Using Correlation to Measure Dependence Harry M. Kat Professor of Risk Management, Cass Business School,

More information

Random variables, probability distributions, binomial random variable

Random variables, probability distributions, binomial random variable Week 4 lecture notes. WEEK 4 page 1 Random variables, probability distributions, binomial random variable Eample 1 : Consider the eperiment of flipping a fair coin three times. The number of tails that

More information