Elementary Statistics



Similar documents
Chapter 1: The Nature of Probability and Statistics

Concepts of Variables. Levels of Measurement. The Four Levels of Measurement. Nominal Scale. Greg C Elvers, Ph.D.

SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question.

Northumberland Knowledge

6. Decide which method of data collection you would use to collect data for the study (observational study, experiment, simulation, or survey):

DATA COLLECTION AND ANALYSIS

II. DISTRIBUTIONS distribution normal distribution. standard scores

Descriptive Statistics and Measurement Scales

Basic Concepts in Research and Data Analysis

DESCRIPTIVE STATISTICS - CHAPTERS 1 & 2 1

Lecture 2: Types of Variables

Why Sample? Why not study everyone? Debate about Census vs. sampling

Determine whether the data are qualitative or quantitative. 8) the colors of automobiles on a used car lot Answer: qualitative


Statistics Review PSY379

Introduction to Sampling. Dr. Safaa R. Amer. Overview. for Non-Statisticians. Part II. Part I. Sample Size. Introduction.

Business Statistics: Intorduction

SOST 201 September 18-20, Measurement of Variables 2

Midterm Review Problems

The SURVEYFREQ Procedure in SAS 9.2: Avoiding FREQuent Mistakes When Analyzing Survey Data ABSTRACT INTRODUCTION SURVEY DESIGN 101 WHY STRATIFY?

Chapter 8: Quantitative Sampling

Statistics. Measurement. Scales of Measurement 7/18/2012

Topic #1: Introduction to measurement and statistics

Correlational Research. Correlational Research. Stephen E. Brock, Ph.D., NCSP EDS 250. Descriptive Research 1. Correlational Research: Scatter Plots

DESCRIPTIVE STATISTICS. The purpose of statistics is to condense raw data to make it easier to answer specific questions; test hypotheses.

Chapter 7 Sampling (Reminder: Don t forget to utilize the concept maps and study questions as you study this and the other chapters.

SAMPLING METHODS IN SOCIAL RESEARCH

SURVEY DESIGN: GETTING THE RESULTS YOU NEED

STAT/MATH 3379: Dr. Manage Chapter Assignment Chapter 1: The Nature of Statistics-Solutions

Association Between Variables

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

Normal Distribution Lecture Notes

Measurement and Measurement Scales

MATH 103/GRACEY PRACTICE QUIZ/CHAPTER 1. MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

DATA ANALYSIS. QEM Network HBCU-UP Fundamentals of Education Research Workshop Gerunda B. Hughes, Ph.D. Howard University

Descriptive Inferential. The First Measured Century. Statistics. Statistics. We will focus on two types of statistical applications

Levels of measurement in psychological research:

Measurement. How are variables measured?

Answer: C. The strength of a correlation does not change if units change by a linear transformation such as: Fahrenheit = 32 + (5/9) * Centigrade

STA-201-TE. 5. Measures of relationship: correlation (5%) Correlation coefficient; Pearson r; correlation and causation; proportion of common variance

Descriptive Methods Ch. 6 and 7

Research Methods & Experimental Design

Probability and Statistics Vocabulary List (Definitions for Middle School Teachers)

Sampling and Sampling Distributions

INTRODUCTION TO SURVEY DATA ANALYSIS THROUGH STATISTICAL PACKAGES

Mind on Statistics. Chapter 10

CA200 Quantitative Analysis for Business Decisions. File name: CA200_Section_04A_StatisticsIntroduction

CORRELATIONAL ANALYSIS: PEARSON S r Purpose of correlational analysis The purpose of performing a correlational analysis: To discover whether there

Statistics 151 Practice Midterm 1 Mike Kowalski

Measurement & Data Analysis. On the importance of math & measurement. Steps Involved in Doing Scientific Research. Measurement

Class 19: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.1)

Sampling Probability and Inference

MBA 611 STATISTICS AND QUANTITATIVE METHODS

SAMPLING & INFERENTIAL STATISTICS. Sampling is necessary to make inferences about a population.

LAGUARDIA COMMUNITY COLLEGE CITY UNIVERSITY OF NEW YORK DEPARTMENT OF MATHEMATICS, ENGINEERING, AND COMPUTER SCIENCE

Practice#1(chapter1,2) Name

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. A) B) C) D) 0.

An Introduction to Basic Statistics and Probability

6.2 Normal distribution. Standard Normal Distribution:

c. Construct a boxplot for the data. Write a one sentence interpretation of your graph.

Fundamentals of Probability

/-- / \ CASE STUDY APPLICATIONS STATISTICS IN INSTITUTIONAL RESEARCH. By MARY ANN COUGHLIN and MARIAN PAGAN(

STAT 350 Practice Final Exam Solution (Spring 2015)

Section 6.1 Discrete Random variables Probability Distribution

Solutions to Homework 10 Statistics 302 Professor Larget

List of Examples. Examples 319

How To Collect Data From A Large Group

Statistics E100 Fall 2013 Practice Midterm I - A Solutions

Introduction to Statistics for Psychology. Quantitative Methods for Human Sciences

Sampling. COUN 695 Experimental Design

Guided Reading 9 th Edition. informed consent, protection from harm, deception, confidentiality, and anonymity.

Chapter 4. Probability and Probability Distributions

Descriptive Statistics

Elementary Statistics

HYPOTHESIS TESTING: CONFIDENCE INTERVALS, T-TESTS, ANOVAS, AND REGRESSION

4.1 Exploratory Analysis: Once the data is collected and entered, the first question is: "What do the data look like?"

UNIVERSITY OF NAIROBI

Survey Data Analysis in Stata

Lesson 2: Constructing Line Graphs and Bar Graphs

Chapter 13 Introduction to Linear Regression and Correlation Analysis

SAMPLING DISTRIBUTIONS

Mind on Statistics. Chapter 4

Introduction to Statistics and Quantitative Research Methods

Hypothesis Testing: Two Means, Paired Data, Two Proportions

MULTIPLE REGRESSION WITH CATEGORICAL DATA

Self-Check and Review Chapter 1 Sections

DRIVER ATTRIBUTES AND REAR-END CRASH INVOLVEMENT PROPENSITY

Introduction to Hypothesis Testing. Hypothesis Testing. Step 1: State the Hypotheses

COMMON CORE STATE STANDARDS FOR

Sampling: What is it? Quantitative Research Methods ENGL 5377 Spring 2007

IAM 530 ELEMENTS OF PROBABILITY AND STATISTICS INTRODUCTION

STATISTICAL ANALYSIS AND INTERPRETATION OF DATA COMMONLY USED IN EMPLOYMENT LAW LITIGATION

How To Write A Data Analysis

STAT 121 Hybrid SUMMER 2014 Introduction to Statistics for the Social Sciences Session I: May 27 th July 3 rd

UNDERSTANDING THE TWO-WAY ANOVA

Good luck! BUSINESS STATISTICS FINAL EXAM INSTRUCTIONS. Name:

University of Arkansas Libraries ArcGIS Desktop Tutorial. Section 2: Manipulating Display Parameters in ArcMap. Symbolizing Features and Rasters:

Unit 12 Logistic Regression Supplementary Chapter 14 in IPS On CD (Chap 16, 5th ed.)

Transcription:

Elementary Statistics Chapter 1 Dr. Ghamsary Page 1 Elementary Statistics M. Ghamsary, Ph.D. Chap 01 1

Elementary Statistics Chapter 1 Dr. Ghamsary Page 2 Statistics: Statistics is the science of collecting, organizing, summarizing, analyzing data, and Draw conclusions. Objective: The primary objective of statistics is inference. The applications of statistics can be divided into two broad areas: 1. Descriptive Statistics 2. Inferential Statistics Variable: is a characteristic of an individual population unit. Data are the values (measurements or observations) that the variables can assume. Variables whose values are determined by chance are called random variables. For example: 12, 13, 69, 98, 78, 87, 36, 54, 68, 36, 63, 85, 79, 75, 32, 16, 57, 58, 34, 91, 74, 83, 92. Each value in the data set is called a data value or a datum. 1. Descriptive statistics: consists numerical and graphical techniques to summarize and present the information in the data set. 2. Inferential statistics consists of estimation, prediction, or generalizing from samples to populations. Qualitative variables are variables that can be placed into distinct categories, according to some characteristic or attribute. 2

Elementary Statistics Chapter 1 Dr. Ghamsary Page 3 For example, gender (male or female) Race (White, Black, Hispanic, etc) Religion Quantitative variables: are numerical in nature and can be ordered or ranked. For example, Age is numerical and the values can be ranked. Height Scores on a test of Stat class Discrete variables Assumes a finite number of possible values that can be counted. For example: Numbers of telephone calls is made at the switch board of our school every day. {0, 1, 2, 3, 4, } Number of accidents in FWY 5 Number of babies delivered at LLU hospital Continuous variables can assume infinitely many values between any two specific values such that there would be no gaps. Height of boys born at UCLA hospital on July 4 th Amount of rain falls in California in the year 2000. # of car accidents in FWY 10 from 5 to 7PM daily # of babies delivered at LLU hospital daiy 3

Elementary Statistics Chapter 1 Dr. Ghamsary Page 4 Levels of Measurement When we observe and record a variable, it has characteristics that influence the type of statistical analysis that we can perform on it. These characteristics are referred to as the level of measurement of the variable. The first step in any statistical analysis is to determine the level of measurement; it tells us what statistical tests can and cannot be performed. There are four levels of measurement: 1. Nominal 2. Ordinal 3. Interval 4. Ratio 1. The nominal level of measurement: Refers to data consist of names and/or categories so that the data cannot be arranged in any specific ordering scheme. The nominal level of measurement occurs when the observations do not have a meaningful numeric value. For example: Sex ( Male, Female) Race (White, Black, Hispanic, Asian, Persian, etc) Colors of car in the street Area Code Zip code The values of nominal variables cannot be meaningfully: compared to see if one is larger than another added or subtracted multiplied or divided calculate the mean (what most people call the average) 4

Elementary Statistics Chapter 1 Dr. Ghamsary Page 5 2. The ordinal level of measurement classifies data into categories that can be ranked; but differences between the ranks cannot be determined. The Ordinal variables are used to represent observations that can be categorized and rank ordered For example: Letter Grades such as A, superior; B, good; C, average; D, poor; F, Fail Size of cars in the street: Small, Medium, and Large. Scoring in games: 1 st, 2 nd, 3 rd,. Class rank, Order of finishing a horse race, How much you prefer various vegetables The values of ordinal variables can be: compared to see if they are equal or not compared to see if one is larger or smaller than another The values of ordinal variables cannot be meaningfully: added or subtracted multiplied or divided calculate the mean 3. The interval level of measurement is like ordinal, with additional property that differences between units of data can be defined, but there is no meaningful zero. The Interval variables represent observations that can be categorized, rank ordered, and have an unit of measure. An unit of measure implies that the difference between any two successive values is identical With an interval scaled variable, the value 0 does not represent the complete absence of the variable. 5

Elementary Statistics Chapter 1 Dr. Ghamsary Page 6 The values of interval variables can be: compared to see if they are equal or not compared to see if one is larger or smaller than another added or subtracted The values of interval variables cannot be meaningfully: multiplied or divided (eg. 60 o F is not twice as hot as 30 o F) For example: Temperature, like Fahrenheit as, we know there is no natural 0. The years IQ scores Shoe size 4. The ratio level of measurement is just like the interval measurement, and there exists a natural zero. In addition, true ratios and differences both exist for the same variable. The Ratio variables represent observations that can be categorized, rank ordered, have an unit of measure and have a true zero The true zero implies that a value of zero represents the complete absence of the variable The values of ratio variables can be: compared to see if they are equal or not compared to see if one is larger or smaller than another added or subtracted multiplied or divided 6

Elementary Statistics Chapter 1 Dr. Ghamsary Page 7 For example: Weight Height Age Length Distance Most students have trouble differentiating between interval and ratio levels of measurement. Here is a simple test: If one number is twice the other is the quantity being measured also twice the other quantity? For example if you have two weights 120 lbs. and 240 lbs. it should be clear that 240 lbs. is twice as heavy as 120 lbs. So weights are an example of a ratio level of measurement. However say you have two temperatures 30 degrees and 60 degrees, 60 degrees is not twice as hot as 30 degrees, so this is an example of an interval level of measurement. Another test is that in the ratio level of measurement zero means absence of quantity. If you consider weights, 0 lb. means that you have NO weight (so weight is ratio), while with the interval level of measurement, such as temperature 0 degrees Fahrenheit does not mean the absence of heat which is what temperature measures. Population: consists of all units (subjects, objects, etc) that are being studied. Sample is a subset of the units of a population. Parameter: descriptive measure of the population: Usually represented by Greek letters Statistic: descriptive measure of a sample: Usually represented by Roman letters 7

Elementary Statistics Chapter 1 Dr. Ghamsary Page 8 Measure Sample (Statistics) Population (Parameters) Mean x µ 2 Variance s 2 σ Standard Deviation s σ Correlation Coefficient r ρ Proportion ˆp p Slope of Simple Regression 1 ˆβ β 1 Size n N Summary of Data Classifications 8

Elementary Statistics Chapter 1 Dr. Ghamsary Page 9 Example1: From a sample of students in your statistics class, you collect the following: the student's name, gender, SAT score, age, IQ, birth date (BD), and their grade in a freshman level math class. Use the measurement of Qualitative or Quantitative to answer the following. Which scale of measurement? 1. The variable student's name is measured on 2. The variable student's gender is measured on 3. The variable student's SAT score is measured on 4. The variable student's age is measured on 5. The variable student's IQ is measured on 6. The variable student's BD is measured on Example2: From a sample of students in your statistics class, you collect the following: the student's name, gender, SAT score, age, IQ, birth date, and their grade in a freshman level math class. Use the measurement of Nominal, Ordinal, Interval or Ratio to answer the following. Which scale of measurement? 1. The variable student's name is measured on 2. The variable student's gender is measured on 3. The variable student's SAT score is measured on 4. The variable student's age is measured on 5. The variable student's IQ is measured on 6. The variable student's BD is measured on 9

Elementary Statistics Chapter 1 Dr. Ghamsary Page 10 Example3: A researcher is claiming that the average age of women who are graduated from medical school at Loma Linda Medical School is about 27 years. To test his hypothesis, he randomly selected 200 female doctors who have graduated from LLU medical school. 1. Describe the population. 2. Identify the variable of interest. 3. Is the variable quantitative (qualitative)? 4. Is the variable discrete or continuous? 5. Identify the type of the variable. 6. Describe the sample. 7. Describe the inference. Example4: A researcher in LA county is claiming that the men and women have different attitude toward abortion. He randomly selected 500 men and 500 women and ask them to see if they are antiabortion. 1. Describe the population. 2. Identify the variable of interest. 3. Is the variable quantitative(qualitative)? 4. Is the variable discrete or continuous? 5. Identify the type of the variable. 6. Describe the sample. 7. Describe the inference. Example5: Read the following article and answer the following questions A study in California (which also funds abortions for the poor) found that by 1990, among young white women. there was no difference in the rate of breast cancer between rich and poor. 1. Describe the population. 2. Identify the variable of interest. 3. Is the variable quantitative(qualitative)? 4. Is the variable discrete or continuous? 5. Identify the type of the variable. 6. Describe the sample. 7. Describe the inference 10

Elementary Statistics Chapter 1 Dr. Ghamsary Page 11 Methods of Sampling: There are many method of sampling, but we will describe 5 common and basic method of sampling as follows: a. Convenience Sampling b. Simple Random Sampling c. Systematic Sampling d. Stratified Sampling e. Cluster Sampling Convenience sampling: attempts to obtain a sample of convenient elements. Often, respondents are selected because they happen to be in the right place at the right time. For example: use of students, and members of social organizations mall intercept interviews without qualifying the respondents department stores using charge account lists people on the street interviews Simple Random Sampling (SRS) Each element in the population has a known and equal probability of selection. Each possible sample of a given size (n) has a known and equal probability of being the sample actually selected. This implies that every element is selected independently of every other element 11

Elementary Statistics Chapter 1 Dr. Ghamsary Page 12 Systematic Sampling The sample is chosen by selecting a random starting point and then picking every ith element in succession from the sampling frame. For example, there are 1000 elements in the population and a sample of 100 is desired. In this case the sampling interval is 10. Stratified Sampling A two-step process in which the population is partitioned into subpopulations, or strata. The strata should be mutually exclusive and collectively exhaustive in that every population element should be assigned to one and only one stratum and no population elements should be omitted. Next, elements are selected from each stratum by a random procedure, usually SRS. A major objective of stratified sampling is to increase precision without increasing cost The elements within a stratum should be as homogeneous as possible, but the elements in different strata should be as heterogeneous as possible. The stratification variables should also be closely related to the characteristic of interest. Finally, the variables should decrease the cost of the stratification process by being easy to measure and apply. In proportionate stratified sampling, the size of the sample drawn from each stratum is proportionate to the relative size of that stratum in the total population. In disproportionate stratified sampling, the size of the sample from each stratum is proportionate to the relative size of that stratum and to the standard deviation of the distribution of the characteristic of interest among all the elements in that stratum. 12

Elementary Statistics Chapter 1 Dr. Ghamsary Page 13 Cluster Sampling The target population is first divided into mutually exclusive and collectively exhaustive subpopulations, or clusters. Then a random sample of clusters is selected, based on a probability sampling technique such as SRS. For each selected cluster, either all the elements are included in the sample (one-stage) or a sample of elements is drawn probabilistically (two-stage). Elements within a cluster should be as heterogeneous as possible, but clusters themselves should be as homogeneous as possible. Ideally, each cluster should be a small-scale representation of the population. In probability proportionate to size sampling, the clusters are sampled with probability proportional to size. In the second stage, the probability of selecting a sampling unit in a selected cluster varies inversely with the size of the cluster. 13

Elementary Statistics Chapter 1 Dr. Ghamsary Page 14 Review of Chapter 01 Determine whether the given values are from a discrete or continuous data set. 1. In a sample data of 100 Pepsi s can we find that the average size of Pepsi s can was 11.98oz 2. Ina survey of 1,011 adults, it is found that 450 of them have smoked at least once in their life. 3. Ina survey of 3,289 adults, it is found that 45% of them have garden in their homes 4. The average American drink 2 cup of coffee per day. Determine whether the given variables are from a Qualitative or Quantitative. 5. Area Codes of for the phone # of students in this class 6. Social Security of students in this class 7. Professor s nationality who are teaching in this school 8. Height of students in this class. Determine which of the four levels of measurement is most appropriate: Nominal, Ordinal, Interval, or Ratio. 9. Area Codes of for the phone # of students in this class 10. Social Security of students in this class 11. Professor s nationality who are teaching in this school 12. Height of students in this class. 13. Ratings of good, average, poor for today lecture. 14. Current temperatures of this class room. 15. Numbers on the Laker s basketball players. 16. The year of student s birth day. 17. Drivers license numbers. 14

Elementary Statistics Chapter 1 Dr. Ghamsary Page 15 Identify which of these types of sampling is used: Random (SRS), Systematic, Stratified, Cluster, or Convenience. 18. An Los Angeles Times reporter gets a reaction to a breaking story by poling people as they pass the front of the Times building. 19. Dr. Ghamsary has randomly selected 5 students in his class. 20. The Orange County Commissioner of Jurors obtains a list of 55,014 car owners and constructs a poll of jurors by selecting every 50 th name on the list. 21. In a Harris poll of 1,011 adults, the interview subjects were selected by using a computer to randomly generate telephone numbers that were then called. 22. A Ford Motor Company researcher has partitioned all registered cars into categories of compact, mid-size, and family-size. He is surveying 75 car owners from each category. 23. Motivated by a student who died from binge drinking, Chico State conducts a study of student drinking by randomly selecting 10 different classes and interviewing all of the students in each of those classes. 24. A statistics student obtains height/weight data by interviewing the members of his fraternity. 25. A UCLA researcher surveys all cardiac patients in each of 30 randomly selected hospitals. 15