Chapter 1: The Nature of Probability and Statistics



Similar documents
Elementary Statistics

SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question.

Concepts of Variables. Levels of Measurement. The Four Levels of Measurement. Nominal Scale. Greg C Elvers, Ph.D.

6. Decide which method of data collection you would use to collect data for the study (observational study, experiment, simulation, or survey):

Descriptive Statistics and Measurement Scales


Statistics Review PSY379

Determine whether the data are qualitative or quantitative. 8) the colors of automobiles on a used car lot Answer: qualitative

DATA COLLECTION AND ANALYSIS

DESCRIPTIVE RESEARCH DESIGNS

Measurement and Measurement Scales

Basic Concepts in Research and Data Analysis

Sampling. COUN 695 Experimental Design

II. DISTRIBUTIONS distribution normal distribution. standard scores

Business Statistics: Intorduction

Foundation of Quantitative Data Analysis

Chapter 2: Frequency Distributions and Graphs

Northumberland Knowledge

(Following Paper ID and Roll No. to be filled in your Answer Book) Roll No. METHODOLOGY

MATH 103/GRACEY PRACTICE QUIZ/CHAPTER 1. MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

Descriptive Inferential. The First Measured Century. Statistics. Statistics. We will focus on two types of statistical applications

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

A Guide to Understanding and Using Data for Effective Advocacy

Correlational Research. Correlational Research. Stephen E. Brock, Ph.D., NCSP EDS 250. Descriptive Research 1. Correlational Research: Scatter Plots

SAMPLING & INFERENTIAL STATISTICS. Sampling is necessary to make inferences about a population.

WHAT IS A JOURNAL CLUB?

STAT/MATH 3379: Dr. Manage Chapter Assignment Chapter 1: The Nature of Statistics-Solutions

DESCRIPTIVE STATISTICS. The purpose of statistics is to condense raw data to make it easier to answer specific questions; test hypotheses.

Lecture 2: Types of Variables

Intro to GIS Winter Data Visualization Part I

STA-201-TE. 5. Measures of relationship: correlation (5%) Correlation coefficient; Pearson r; correlation and causation; proportion of common variance

Math and Science Bridge Program. Session 1 WHAT IS STATISTICS? 2/22/13. Research Paperwork. Agenda. Professional Development Website

Research Methods & Experimental Design

Best Practices in Data Visualizations. Vihao Pham January 29, 2014

Best Practices in Data Visualizations. Vihao Pham 2014

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

Chapter 4: Probability and Counting Rules

Guided Reading 9 th Edition. informed consent, protection from harm, deception, confidentiality, and anonymity.

Midterm Review Problems

Statistics. Measurement. Scales of Measurement 7/18/2012

How To Write A Data Analysis

Chapter 1: Data and Statistics GBS221, Class January 28, 2013 Notes Compiled by Nicolas C. Rouse, Instructor, Phoenix College

AP STATISTICS REVIEW (YMS Chapters 1-8)

Introduction to Hypothesis Testing. Hypothesis Testing. Step 1: State the Hypotheses

MAT 155. Chapter 1 Introduction to Statistics. Key Concept. Basics of Collecting Data. 155S1.5_3 Collecting Sample Data.

Measurement. How are variables measured?

AP Stats- Mrs. Daniel Chapter 4 MC Practice

DESCRIPTIVE STATISTICS - CHAPTERS 1 & 2 1

Chapter Seven. Multiple regression An introduction to multiple regression Performing a multiple regression on SPSS

Inclusion and Exclusion Criteria

SOST 201 September 18-20, Measurement of Variables 2

A Correlation of. to the. South Carolina Data Analysis and Probability Standards

MARKETING RESEARCH AND MARKET INTELLIGENCE (MRM711S) FEEDBACK TUTORIAL LETTER SEMESTER `1 OF Dear Student

c. Construct a boxplot for the data. Write a one sentence interpretation of your graph.

Fairfield Public Schools

The Mozart effect Methods of Scientific Research

Correlation key concepts:

Practice#1(chapter1,2) Name

Crime Mapping Methods. Assigning Spatial Locations to Events (Address Matching or Geocoding)

How To Understand The Scientific Theory Of Evolution

1.1 What is Statistics?

Lecture Notes Module 1

Introduction to Statistics and Quantitative Research Methods

MBA 611 STATISTICS AND QUANTITATIVE METHODS

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

Statistics 2014 Scoring Guidelines

Exploratory data analysis (Chapter 2) Fall 2011

Normal Distribution Lecture Notes

Introduction to Quantitative Methods

LEVEL ONE MODULE EXAM PART ONE [Clinical Questions Literature Searching Types of Research Levels of Evidence Appraisal Scales Statistic Terminology]

Pre-experimental Designs for Description. Y520 Strategies for Educational Inquiry

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. A) ±1.88 B) ±1.645 C) ±1.96 D) ±2.

IBM SPSS Direct Marketing 23

Premaster Statistics Tutorial 4 Full solutions

IBM SPSS Direct Marketing 22

Sample Size and Power in Clinical Trials

Chapter 8: Hypothesis Testing for One Population Mean, Variance, and Proportion

Descriptive Statistics

Course Syllabus MATH 110 Introduction to Statistics 3 credits

Why Taking This Course? Course Introduction, Descriptive Statistics and Data Visualization. Learning Goals. GENOME 560, Spring 2012

Self-Check and Review Chapter 1 Sections

Measurement & Data Analysis. On the importance of math & measurement. Steps Involved in Doing Scientific Research. Measurement

Lecture 6 - Data Mining Processes

What is Statistic? OPRE 6301

2. METHODS OF DATA COLLECTION. Types of Data. Some examples from Wainer, Palmer and Bradlow (Chance):

4.1 Exploratory Analysis: Once the data is collected and entered, the first question is: "What do the data look like?"

Introduction; Descriptive & Univariate Statistics

STAT 350 Practice Final Exam Solution (Spring 2015)

CORRELATIONAL ANALYSIS: PEARSON S r Purpose of correlational analysis The purpose of performing a correlational analysis: To discover whether there

Welcome back to EDFR I m Jeff Oescher, and I ll be discussing quantitative research design with you for the next several lessons.

Classify the data as either discrete or continuous. 2) An athlete runs 100 meters in 10.5 seconds. 2) A) Discrete B) Continuous

Projects Involving Statistics (& SPSS)

Levels of measurement in psychological research:

Why Sample? Why not study everyone? Debate about Census vs. sampling

Descriptive Methods Ch. 6 and 7

Section 6.1 Discrete Random variables Probability Distribution

Transcription:

Chapter 1: The Nature of Probability and Statistics Learning Objectives Upon successful completion of Chapter 1, you will have applicable knowledge of the following concepts: Statistics: An Overview and Description Types of Variables Measurement Scales Sampling Methods or Methods to Select Subjects for Samples Experimental Studies I. An Overview and Description A. Statistics is the science of conducting studies to: Collect Organize Summarize Analyze Draw conclusions from data B. Where do you use statistics? Sports Business Research Public Health C. Why do we use statistics? To be able to read and understand statistical studies. To conduct research To become a better consumer and citizen. D. Two branches of statistics: Descriptive statistics (Unit 1) is a collection, organization, summarization, and presentation of data. Inferential statistics (Units 3 & 4) uses data to generalize from a sample to its population, conduct hypothesis tests to determine relationships among variables, and estimate parameters. Dr. Janet Winter, jmw11@psu.edu Stat 200 Page 1

E. Basic Vocabulary Review A variable is a characteristic or attribute of interest that can assume different values. It is the question asked. Data are the values that the variables have assumed (answers to the question). Data set is a collection of data values, where each value is called a data value or datum. The population is all subjects of interest to the study. The sample is a part of the population or a subgroup (subset) of the subjects from the population. A parameter is a numerical summary of all data from the population. (e.g. mean of the populationμ). A statistic is a numerical of the data from the sample (e.g. mean of the sample x ). Symbols for Specific Parameters Type Parameter Statistic Mean μ x Size N n Variance σ 2 s 2 Standard Deviation Proportion P p σ s II. Variables A. Types of Variables I. Qualitative Data can be placed in distinct categories, according to some characteristic or attribute. Note: data is used to find the count and proportion for each category. a) Examples: Hair color Gender TV ownership Do you own a car? Employment status (full time, part time, not employed) Social security number License plate number Computer account number Dr. Janet Winter, jmw11@psu.edu Stat 200 Page 2

II. Quantitative Data are numerical representing counts or measurements that can be ordered and ranked. Note: data values can be used to find the average, standard, deviations, and variance. a) Types of Quantitative Variables: 1) Discrete counts or data with space between its possible values. ex: shoe shize. 2) Continuous measures or data that can assume an infinite number of values between two endpoints. ex: foot length b) Examples of Quantitative Variables: How many TV s do you own (discrete) How many cars does your family own? (discrete) What is your salary for the year? (discrete) What is your weight? (continuous) How far do you live from campus? (continuous) What is your blood pressure? (continuous) B. Exercise with Types of Variables Directions: Identify each of the following variables as Qualitative; Quantitative, discrete; or Quantitative, continuous. Use the chart provided below to write in your answers. Hair color Computer password Shoe size Shirt size (S, M, L) Shirt size (10, 12, 14, etc.) License plate number Zip code Foot length Height Time to drive to campus Qualitative Quantitative, Discrete Quantitative, Continuous *Answer key is located at the end of the document. Dr. Janet Winter, jmw11@psu.edu Stat 200 Page 3

C. Measurement Scales for Variables I. Types of Measurement Scales a) Nominal only classifies data into mutually exclusive (non-overlapping), exhausting categories in which no order or ranking can be imposed on the data. Example: SS# b) Ordinal classifies data into categories that can be ranked; however, precise differences between the ranks do not exist. Example: Shirt size (S, M, L) c) Interval ranks data with precise differences between the data values; however, there is no meaningful zero. Example: Shoe size d) Ratio possesses all the characteristics of interval measurement and a true zero. Example: Foot length Note: This level of measurement is called the ratio level because the zero starting point makes ratio meaningful. II. Levels of Measurement Data Level Summary Example Nominal Ordinal Interval Ratio (Triola & Triola, 2006) Categories only. Data cannot be arranged in an ordering scheme. Categories are ordered, but differences can t be found or are meaningless. Differences are meaningful, but there is no natural starting point, and ratios are meaningless. There is a natural zero starting point and ratios are meaningful. Bear encounter states: 5 New York 20 Idaho 40 Wyoming Bears according to aggressiveness: 5 not aggressive 20 somewhat aggressive 40 highly aggressive Bear den temperatures: 5 F 20 F 40 F Bear migration distances: 5 miles 20 miles 40 miles Categories or names only An order is determined by not, somewhat, highly. 0 F doesn t mean no heat. 40 F is not twice as hot as 20 F. 40 miles is twice as far as 20 miles. Dr. Janet Winter, jmw11@psu.edu Stat 200 Page 4

III. Exercise 01 with Measurement Scales Directions: Identify each of the following variables as Nominal, Ordinal, Interval or Ratio. Use the chart provided below to write in your answers. Zip code Grade Eye color Rating Gender IQ SAT score Height Ranking Temperature (F, C) Weight Time Nominal Ordinal Interval Ratio *Answer key is located at the end of the document. D. Exercise 02 with Measurement Scales Transportation Table Directions: The chart shows the number of job-related injuries for each of the transportation industries for 1998. Refer to this chart to answer the following 5 questions. Industry Number of injuries Railroad 4520 Intercity 5100 Subway 6850 Trucking 7144 Airline 9950 1. What are the variables under study? 2. Classify each of the variables as Quantitative, continuous; Quantitative, discrete; or Qualitative. 3. Identify the level of measurement for each variable. 4. The railroad is shown as the safest transportation industry. Does that mean railroads have fewer accidents than the other industries? 5. What factors other than safety influence a person s choice of transportation? ***Answers are located at the end of this document. Dr. Janet Winter, jmw11@psu.edu Stat 200 Page 5

III. Samples (part of the population, a subgroup or subset of the population) A. Samples Methods (or ways to select subjects or participants) I. Random samples number each subject in the population; select the subjects whose numbers match with numbers from a random number table. II. Systematic samples number each subject in the population; select the subject with every kth number. III. Stratified samples divide the population into subgroups according to some characteristic that is important to the study, then sample randomly from each subgroup. IV. Cluster samples randomly select entire intact groups called a cluster that represents the population. B. Why Use Samples Instead of Populations Saves time and money Experiment can include more detail It is effective C. SRS (Simple Random Sample) Required for most statistical procedures If data is not collected correctly, the study is useless. Every possible sample of size n has the same chance of being selected. D. Ways to Collect Data from Participants I. Surveys the researcher asks questions using a personal interview, telephone interview, or written questions. II. Observational study the researcher observes and draws conclusions based on the observations. III. Experimental study the researcher manipulates one of the variables and determines how the manipulation influences other variables. E. Kinds of Variables in Experimental Studies: Variable is the characteristic of interest or the questions asked. Independent or explanatory variable are manipulated by the researcher. Dependent, outcome, or response variable changes because of the manipulation of the independent variable. Dr. Janet Winter, jmw11@psu.edu Stat 200 Page 6

F. Some Problems with Experimental Studies Hawthorne effect the subjects know they are participating in an experiment and change their behavior in ways that affect the results of the study. Confounding variable a variable that influences the dependent or outcome variable but cannot be separated from the independent variables (e.g., IQ, previous knowledge or experience with dependent variable). Experimenter effect the experimenter unintentionally influences the dependent variable or outcome of the experiment. G. Controlling Effects Single blind- subjects do not know if they are in the experiment or control group. Double blind- neither the participant nor the experimenter know who is in the experimental or control group. H. Errors Sampling error- caused by chance fluctuations; it is the difference between the sample statistic and population parameter Non-sampling error- caused when sample data are incorrectly collected IV.Some Misuses of Statistics (Read the sections in the textbook which describe the misuses of statistics) A. Suspect Samples Very small samples Biased samples Volunteer samples B. Other Issues Ambiguous averages Detached statistics Implied connections Misleading graphics Faculty survey questions Dr. Janet Winter, jmw11@psu.edu Stat 200 Page 7

V. Computers and Calculators Calculators and computers simplify statistical computations and save time. A calculator with statistical functions is required for this class; you will not be able to do the work in this course without it. The TI 83 is strongly recommended and will be the only calculator demonstrated in class. If you choose to use a different calculator, you will be responsible to learn how to use it on your own. VI.Conclusion The applications of statistics are many and varied. You encounter statistics reading newspapers or magazines, listening to the radio, or watching television. Statistics have improved health care, business, social science, and every aspect of life. ANSWER KEYS TO EXERCISES Types of Variables (or Types of Data) *Exercise: Identify each variable as Qualitative or Quantitative Qualitative Quantitative, Discrete Quantitative, Continuous Hair color Shoe size Foot length Computer password Shirt size (10, 12, 14, etc.) Height License plate number Time to drive to campus Shirt size (S, M, L) Zip code Exercise: Measurement Scales **Exercise: Identify each of the following variables as Nominal, Ordinal, Interval or Ratio. Use the chart provided below to write in your answers. Nominal Ordinal Interval Ratio Zip code Grade IQ Height Gender Rating SAT score Time Eye color Ranking Temperature (F, C) Weight Dr. Janet Winter, jmw11@psu.edu Stat 200 Page 8

Exercise: Measurement Scales - Transportation Table ***Exercise: The chart shows the number of job-related injuries for each of the transportation industries for 1998. Refer to this chart to answer the following 5 questions. Industry Number of injuries Railroad 4520 Intercity 5100 Subway 6850 Trucking 7144 Airline 9950 1. What are the variables under study? Answer: Industry and Number of injuries. 2. Classify each of the variables as Quantitative, continuous; Quantitative, discrete; or Qualitative. Answer: Industry is Qualitative. Number of injuries is Quantitative, discrete. 3. Identify the level of measurement for each variable. Answer: Industry is Nominal. Number of injuries is Ratio. Additional questions to consider (these were reflective questions): 4. The railroad is shown as the safest transportation industry. Does that mean railroads have fewer accidents than the other industries? 5. What factors other than safety influence a person s choice of transportation? Works Cited Triola, M.D., Marc M. and Mario F. Triola. Biostatistics for the Biologoical and Health Sciences. New York: Pearson Education, Inc., 2006. Dr. Janet Winter, jmw11@psu.edu Stat 200 Page 9