Descriptive Inferential. The First Measured Century. Statistics. Statistics. We will focus on two types of statistical applications



Similar documents
SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question.

Chapter 1: The Nature of Probability and Statistics

Elementary Statistics

MARKETING RESEARCH AND MARKET INTELLIGENCE (MRM711S) FEEDBACK TUTORIAL LETTER SEMESTER `1 OF Dear Student

Sampling. COUN 695 Experimental Design

Introduction Qualitative Data Collection Methods... 7 In depth interviews... 7 Observation methods... 8 Document review... 8 Focus groups...

Survey Research. Classifying surveys on the basis of their scope and their focus gives four categories:

Why Sample? Why not study everyone? Debate about Census vs. sampling

Association Between Variables


6. Decide which method of data collection you would use to collect data for the study (observational study, experiment, simulation, or survey):

Chapter 1: Data and Statistics GBS221, Class January 28, 2013 Notes Compiled by Nicolas C. Rouse, Instructor, Phoenix College

SURVEY DESIGN: GETTING THE RESULTS YOU NEED

EMPOWERING YOURSELF AS A COMMITTEE MEMBER

SOST 201 September 18-20, Measurement of Variables 2

DATA COLLECTION AND ANALYSIS

Module 3: Correlation and Covariance

Descriptive Statistics and Measurement Scales

Basic Concepts in Research and Data Analysis

Chapter 8: Quantitative Sampling

Welcome back to EDFR I m Jeff Oescher, and I ll be discussing quantitative research design with you for the next several lessons.

CALCULATIONS & STATISTICS

IPDET Module 6: Descriptive, Normative, and Impact Evaluation Designs

Guided Reading 9 th Edition. informed consent, protection from harm, deception, confidentiality, and anonymity.

Measurement and Measurement Scales

How do we know what we know?

II. DISTRIBUTIONS distribution normal distribution. standard scores

Observing and describing the behavior of a subject without influencing it in any way.

Self-administered Questionnaire Survey

A QuestionPro Publication

Kaiser Family Foundation/New York Times Survey of Chicago Residents

Psychodynamic Workbook

How to Identify Real Needs WHAT A COMMUNITY NEEDS ASSESSMENT CAN DO FOR YOU:

Northumberland Knowledge

US Public: Keep Las Vegas in Las Vegas

Chapter 5 Conceptualization, Operationalization, and Measurement

Now, observe again the 10 digits we use to represent numbers Notice that not only is each digit different from every other

Evaluation: Designs and Approaches

Inclusion and Exclusion Criteria

Long-Term Care: Frequently Asked Questions About Long- Term Care Insurance

Economic inequality and educational attainment across a generation

Descriptive Statistics

Mind on Statistics. Chapter 4

Newspaper Multiplatform Usage

THE FIELD POLL. By Mark DiCamillo, Director, The Field Poll

Chapter Seven. Multiple regression An introduction to multiple regression Performing a multiple regression on SPSS

Statistics. Measurement. Scales of Measurement 7/18/2012

NATIONAL: TRUMP WIDENS NATIONAL LEAD

Long-Term Care Insurance:

How To Collect Data From A Large Group

Levels of Measurement. 1. Purely by the numbers numerical criteria 2. Theoretical considerations conceptual criteria

Notes - Gruber, Public Finance Chapter 20.3 A calculation that finds the optimal income tax in a simple model: Gruber and Saez (2002).

Determine whether the data are qualitative or quantitative. 8) the colors of automobiles on a used car lot Answer: qualitative

The Mozart effect Methods of Scientific Research

Chapter 2 Quantitative, Qualitative, and Mixed Research

STAT/MATH 3379: Dr. Manage Chapter Assignment Chapter 1: The Nature of Statistics-Solutions

Introduction to Quantitative Methods

Sampling and Sampling Distributions

Cash Flow Exclusive / September 2015

50 Tough Interview Questions

Equal marriage What the government says

PRODUCTION. 1The Surplus

Metrics-Led Sales Training

Post-Debate Overnight Poll Finds Trump Still Leading Pack; Carly Fiorina Winner of Debates

An Introduction to Secondary Data Analysis

THE FIELD POLL. By Mark DiCamillo, Director, The Field Poll

Economic impact of privacy on online behavioral advertising

AP Stats- Mrs. Daniel Chapter 4 MC Practice

Research Design. Recap. Problem Formulation and Approach. Step 3: Specify the Research Design

Specific learning outcomes (Course: Introduction to experimental research)

Attitudes to Mental Illness survey report

Math 251, Review Questions for Test 3 Rough Answers

Chapter Eight: Quantitative Methods

How the Survey was Conducted Nature of the Sample: NBC News/Marist New Hampshire Poll of 1,037 Adults

Research Methods & Experimental Design

Class 19: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.1)

Chapter 4 - Practice Problems 1

Correlational Research. Correlational Research. Stephen E. Brock, Ph.D., NCSP EDS 250. Descriptive Research 1. Correlational Research: Scatter Plots

MATH 103/GRACEY PRACTICE QUIZ/CHAPTER 1. MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

Midterm Review Problems

BIG DATA S BIG FAILURE: The struggles businesses face in accessing the information they need

Chapter 9 Assessing Studies Based on Multiple Regression

Fall 2011 National Survey of American Public Opinion on Climate Change Christopher Borick and Barry Rabe

Pre-experimental Designs for Description. Y520 Strategies for Educational Inquiry

SAMPLING DISTRIBUTIONS

see Designing Surveys by Ron Czaja and Johnny Blair for more information on surveys

Georgia Department of Education Common Core Georgia Performance Standards Framework Teacher Edition Coordinate Algebra Unit 4

Market Potential and Sales Forecasting

Market Research Methodology

When a Child Dies. A Survey of Bereaved Parents. Conducted by NFO Research, Inc. on Behalf of. The Compassionate Friends, Inc.

STA-201-TE. 5. Measures of relationship: correlation (5%) Correlation coefficient; Pearson r; correlation and causation; proportion of common variance

Poverty among ethnic groups

Levels of measurement in psychological research:

Transcription:

Introduction: Statistics, Data and Statistical Thinking The First Measured Century FREC 408 Dr. Tom Ilvento 213 Townsend Hall ilvento@udel.edu http://www.udel.edu/frec/ilvento http://www.pbs.org/fmc/index.htm Statistics Statistics (Def 1.1 p24) is the science of data It refers to Collecting data Classifying, summarizing, and organizing data Analysis of data Interpretation of data Statistics Statistics is both a field of study and a set of tools used by many disciplines Social Sciences Biological Sciences Physical Sciences We will focus on two types of statistical applications Descriptive Inferential Descriptive Statistics Descriptive statistics uses summary measures, graphs, and measures of association to show relationships in data. The focus is on describing the data With an emphasis on parsimony 1

Descriptive Statistics Rather than looking at a set of numbers, 0, 0, 2, 2, 3, 3, 3, 4, 5, 2, 1, 3, 2, 2, 1, 1, 3, 1, 1, 2, 5, 7, 8, 10, 12 Descriptive Statistics we want to find summary measures which describe the data adequately and succinctly Be they a Percentage Average Range from highest to lowest mode Descriptive Statistics Descriptive Statistics also involve relationships between variables or sets of variables And they can involve very sophisticated techniques regression, principle components, factor analysis, Logistic Regression, Probit Analysis Inferential Statistics Inferential statistics takes it a step further Now we use some of the same techniques to make estimates, decisions, predictions, or generalizations about a population from a smaller subset or sample Inferential Statistics Inferential statistics are a powerful tool for research It enables us to make statements about a large group from a much smaller sample. We can survey 1,000 people and make statements about 280 million people Did the public care if George W. Bush used cocaine in his 20s? A Time/CNN Poll found: If Bush did use cocaine in his 20s, should that disqualify him from being President? Yes 11% No 84% 100% 80% 60% 40% 20% 0% Yes No 2

Let s look closer at this survey example It was based on a telephone poll of 942 adult Americans taken for Time/CNN on August 19 th by Yankelovich Partners, Inc. The sampling error is ± 3.3% Here s my interpretation The survey is designed to represent adult Americans in August of 1999 Because we are taking a sample, we have some error associated with our estimate. What does this mean? Here s my interpretation Since the sample was taken randomly, we have a method to estimate the error of our estimate In this case, we are reasonably sure that the true percentage is within ± 3.3% points of our estimate Which means our interval is 7.7% to 14.3% We need some terms A Population (Def 1.6 P29) is the total number of units involved in the research question. The units are the members (or elements) of the population. Populations could be: People Animals Plants Courses Objects A POPULATION IS DEFINED BY Purpose of the study The units and elements involved Geographic coverage Time frame Population Example If I was interested in understanding current household consumption of chicken in the Mid-Atlantic states, I might define the population as: All households in in the Mid-Atlantic states (DE, MD, PA, NJ, NY) in the Fall of 2002 3

Does time matter for a population? The DOW over a One-Year Period (October 2002 to Sept. 2003) The Time/CNN poll asked: Should a candidate have to answer questions about whether he used cocaine in the past? June 60% Yes August 48% Yes 80% 60% 40% 20% 0% Should Candidates Answer about Cocaine Use? June August Yes No The DOW over a Five-Year Period Sampling When we collect data on all elements in a population, we take a census However, sometimes it is difficult to get information on the entire population So we take a sample of the population A sample (Def 1.8 P30) is a subset of the units or elements of a population Why Sample? It saves time Money Other resources (computation time) It may actually be impossible to collect information on everyone Every corn stalk in a field Every dog who suffers from heart worm Recent Census Debate Every 10 years we take a census It is mandated in the Constitution However, the Census Bureau knows that it doesn t get a complete count - some groups are difficult to contact So, the Census Bureau wants to take a really good sample to estimate the undercount, and then adjust the counts to reflect the missing people 4

More on Sample Samples are also defined in the terms we used for populations purpose of the research, the units and elements involved, the geographic coverage, and the time frame More on Sampling A valuable property of a sample is that is representative of the population. The sample characteristics resemble those possessed by the population Inferential statistics require a sample to be representative of the population, And that can be done through a random process More Terms A random sample (Def 1.11 p41) is when each element or unit has the same chance of being selected If we select a random sample of 1,000 from a population of 100,000, Each unit has a 1,000/100,000 or 1/100 th chance of being selected More Terms A variable (Def 1.3 p24) is a characteristic of an individual unit of the population To be a variable the characteristics must vary It can t all be the same; otherwise, it s a constant. More Terms Measurement is the process of assigning a number to variables of the individual units Some measurement seems relatively straight-forward years of age, dollars of income, cholesterol counts, parts per million of a chemical Measurement Other concepts are more difficult to measure Attitudes Emotions Intelligence LOVE 5

Measurement The process of measurement is often complex don t take it for granted It always comes with some error And perhaps Bias Measurement With measurement we must also deal with Validity are we measuring what we think we are measuring Reliability is the measuring device consistent Types of Data - the Book Quantitative data (Def 1.4 p24) are measures that are recorded on a naturally occurring scale Qualitative data (Def 1.5 p24) does not follow in natural numerical scale and thus are classified into categories Types of Data I will use a more elaborate description of levels of measurement Nominal Ordinal Continuous Levels of Measurement Nominal (or categorical) no implied order or superiority Men and Women Race Species or genuses Levels of Measurement Ordinal an implied order or rank, but the distance between units is not well specified Ranking Strongly agree to Strongly disagree On a scale from one to ten.. 6

Levels of Measurement Continuous (combination of interval and ratio) data that is measured on a scale where we can say something about the magnitude between numbers Age Income Years of School Why consider our level of measurement? Because our statistical techniques are predicated on certain levels of measurement. Each technique/formula assumes a certain level is used. Misusing a statistical technique on a variable can lead to results that are biased or misleading. Sources of Data Data from a published source also known as existing data. Someone else collected it and makes it available to you Census of Population Current Population Survey Sports statistics Caution data decisions are out of your control Sources of Data A designed Experiment where the researcher has strict control over the units (people, objects and events). Treatment and Control Groups Randomized designs An experimental design allows you to control more factors and to extract more information from the data Sources of Data Surveys are where a researcher samples a group of people, asks a set of questions, and records the answers Face-to-Face Telephone Mail Internet Social Surveys are extremely popular today Sources of Data Observational Studies are when the researcher observes the units in their natural setting and records the variables of interest. Animal studies in natural habitats Studies of children s behaviors Observational Studies must deal with a number of methodological issues 7

Shere Hite Report Example Shere Hite began her work in 1968 on permissive sexual attitudes in the U.S. Her work tended to be controversial, not only for her topics, but because of her methods of collecting and analyzing data A second report was even more controversial in 1988, Women and Love: A Cultural Revolution in Progress Key findings from Hite s 1988 book 84% of woman were not emotionally satisfied with their relationship 95% reported emotional and psychological harassment from their partners 70% of women married for 5 years or more were having extra-martial affairs Only 13% of women married for more than two years were in love. Shere Hite Survey Methodology Her survey was a mail survey: Mailed to 100,000 women in the U.S. over 7 years The mailing list was a combination from a wide variety of organizations which were asked to circulate them to members. The groups tended to over-represent feminist groups and women in troubled circumstances Approximately 4,500 people responded, a 4.5% response rate. Shere Hite Survey Hite s survey used 127 open-ended questions The instructions read: It is not necessary to answer every question! Feel free to skip around and answer those questions that you choose. The questions involved a complex set of issues with sub-questions and follow-ups Statistical Critiques Sample was not random or representative of the population of all women Low response rate reflected a bias towards those most angry or eager to answer the survey Encouraging skipping questions would also lead to bias Open ended questions are often difficult to summarize Critical Thinking and Statistics Statistics involves making critical decisions and rational thought to how a set of data are: Sampled Measured Collected Analyzed Interpreted 8