# Chapter 11 Introduction to Survey Sampling and Analysis Procedures

Save this PDF as:

Size: px
Start display at page:

## Transcription

1 Chapter 11 Introduction to Survey Sampling and Analysis Procedures Chapter Table of Contents OVERVIEW SurveySampling SurveyDataAnalysis DESIGN INFORMATION FOR SURVEY PROCEDURES AN EXAMPLE OF USING THE SURVEY PROCEDURES REFERENCES...156

2 148 Chapter 11. Introduction to Survey Sampling and Analysis Procedures

3 Chapter 11 Introduction to Survey Sampling and Analysis Procedures Overview This chapter introduces the SAS/STAT procedures for survey sampling and describes how you can use these procedures to analyze survey data. Researchers often use sample survey methodology to obtain information about a large population by selecting and measuring a sample from that population. Due to variability among items, researchers apply scientific probability-based designs to select the sample. This reduces the risk of a distorted view of the population and allows statistically valid inferences to be made from the sample. Refer to Cochran (1977), Kalton (1983), and Kish (1965) for more information on statistical sampling. You can use the SURVEYSELECT procedure to select probability-based samples from a study population. Many SAS/STAT procedures, such as the MEANS and GLM procedures, can compute sample means and estimate regression relationships. However, in most of these procedures, statistical inference is based on the assumption that the sample is drawn from an infinite population by simple random sampling. If the sample is actually selected from a finite population using a complex design, these procedures generally do not calculate the estimates and their variances correctly. The SURVEYMEANS and SURVEYREG procedures do properly analyze survey data, taking into account the sample design. These procedures use the Taylor expansion method to estimate sampling errors of estimators based on complex sample designs. The following table briefly describes the sampling and analysis procedures in SAS/STAT software. SURVEYSELECT Design Accommodated stratification clustering replication multistage sampling unequal probabilities of selection

4 150 Chapter 11. Introduction to Survey Sampling and Analysis Procedures Sampling Methods simple random sampling unrestricted random sampling (with replacement) systematic sequential selection probability proportional to size (PPS) with and without replacement PPS systematic PPS for two units per stratum sequential PPS with minimum replacement SURVEYMEANS Design Accommodated Available Statistics stratification clustering unequal weighting population total population mean proportion standard error confident limit t test SURVEYREG Design Accommodated Available Analysis stratification clustering unequal weighting fit linear regression model regression coefficients covariance matrix significance tests estimable functions contrasts The following sections contain brief descriptions of these procedures. Survey Sampling The SURVEYSELECT procedure provides a variety of methods for selecting probability-based random samples. The procedure can select a simple random sample or a sample according to a complex multistage sample design that includes stratification, clustering, and unequal probabilities of selection. With probability sampling, each unit in the survey population has a known, positive probability of selection. This property of probability sampling avoids selection bias and enables you to use statistical theory to make valid inferences from the sample to the survey population.

5 Survey Data Analysis 151 PROC SURVEYSELECT provides methods for both equal probability sampling and sampling with probability proportional to size (PPS). In PPS sampling, a unit s selection probability is proportional to its size measure. PPS sampling is often used in cluster sampling, where you select clusters (groups of sampling units) of varying size in the first stage of selection. Available PPS methods include without replacement, with replacement, systematic, and sequential with minimum replacement. The procedure can apply these methods for stratified and replicated sample designs. See Chapter 63, The SURVEYSELECT Procedure, for more information. Survey Data Analysis The SURVEYMEANS and SURVEYREG procedures perform statistical analysis for survey data. These analytical procedures take into account the design used to select the sample. The sample design can be a complex sample design with stratification, clustering, and unequal weighting. You can use the SURVEYMEANS procedure to compute the following statistics: population total estimate and its standard deviation and corresponding t test population mean estimate and its standard error and corresponding t test proportion estimate for a categorical variable and corresponding t test (1, )% confidence limits for the population total estimates, the population mean estimates, and the proportion estimates data summary information PROC SURVEYREG fits linear models for survey data and computes regression coefficients and their variance-covariance matrix. The procedure also provides significance tests for the model effects and for any specified estimable linear functions of the model parameters. PROC SURVEYMEANS presently does not perform domain analysis (subgroup analysis). However, note that you can produce a domain analysis with PROC SUR- VEYREG (see Example 62.7 on page 3269). This capability will be available in a future release of the SURVEYMEANS procedure. Variance Estimation The SURVEYMEANS and SURVEYREG procedures use the Taylor expansion method to estimate sampling errors of estimators based on complex sample designs. This method obtains a linear approximation for the estimator and then uses the variance estimate for this approximation to estimate the variance of the estimate itself (Woodruff 1971, Fuller 1975). When there are clusters, or primary sampling units (PSUs), in the sample design, the procedures estimate the variance from the variation among the PSUs. When the design is stratified, the procedures pool stratum variance estimates to compute the overall variance estimate. For a multistage sample design, the variance estimation method depends only on the first stage of the sample design. Thus, the required input includes only firststage cluster (PSU) and first-stage stratum identification. You do not need to input

7 Design Information for Survey Procedures 153 Stratification Stratified sampling involves selecting samples independently within strata, which are nonoverlapping subgroups of the survey population. Stratification controls the distribution of the sample size in the strata. It is widely used in practice to meet a variety of survey objectives. For example, with stratification you can ensure adequate sample sizes for subgroups of interest, including small subgroups, or you can use stratification to improve the precision of overall estimates. To improve precision, units within strata should be as homogeneous as possible for the characteristics of interest. Clustering Cluster sampling involves selecting clusters, which are groups of sampling units. For example, clusters may be schools, hospitals, or geographical areas, and sampling units may be students, patients, or citizens. Cluster sampling can provide efficiency in frame construction and other survey operations. However, it can also result in a loss in precision of your estimates, compared to a nonclustered sample of the same size. To minimize this effect, units within clusters should be as heterogeneous as possible for the characteristics of interest. Multistage Sampling In multistage sampling, you select an initial or first-stage sample based on groups of elements in the population, called primary sampling units or PSUs. Then you create a second-stage sample by drawing a subsample from each selected PSU in the first-stage sample. By repeating this operation, you can select a higherstage sample. If you include all the elements from a selected primary sampling unit, then the twostage sampling is a cluster sampling. Sampling Weights Sampling weights, or survey weights, are positive values associated with each unit in your sample. Ideally, the weight of a sampling unit should be the frequency that the sampling unit represents in the target population. Therefore, the sum of the weights over the sample should estimate the population size N. If you normalize the weights such that the sum of the weights over the sample equals the population size N, then the weighted sum of a characteristic y estimates the population total value Y. Often, sampling weights are the reciprocals of the selection probabilities for the sampling units. When you use PROC SURVEYSELECT, the procedure generates the sampling weight component for each stage of the design, and you can multiply these sampling weight components to obtain the final sampling weights. Sometimes, sampling weights also include nonresponse adjustments, post-sampling stratification, or regression adjustments using supplemental information. When the sampling units have unequal weights, you must provide the weights to the survey analysis procedures. If you do not specify sampling weights, the procedures use equal weights in the analysis.

8 154 Chapter 11. Introduction to Survey Sampling and Analysis Procedures Population Totals and Sampling Rates The ratio of the sample size (the number of sampling units in the sample) n and the population size (the total number of sampling units in the target population) N is written as f = n N This ratio is called the sampling rate or the sampling fraction. If you select a sample without replacement, the extra efficiency compared to selecting a sample with replacement can be measured by the finite population correction (fpc) factor, (1, f ). If your analysis should include a finite population correction factor, you can input either the sampling rate or the population total. Otherwise, the procedures do not use the fpc when computing variance estimates. For fairly small sampling fractions, it is appropriate to ignore this correction. Refer to Cochran (1977) and Kish (1965). As stated in the section Variance Estimation on page 151, for a multistage sample design, the variance estimation method depends only on the first stage of the sample design. Therefore, if you are specifying the sampling rate, you should input the firststage sampling rate, which is the ratio of the number of PSUs in the sample to the total number of PSUs in the target population. An Example of Using the Survey Procedures This section demonstrates how you can use the survey procedures to select a probability-based sample, compute descriptive statistics from the sample, perform regression analysis, and make inferences about income and expenditures of a group of households in North Carolina and South Carolina. The goals of the survey are to estimate total income and total basic living expenses investigate the linear relationship between income and living expenses Sample Selection To select a sample with PROC SURVEYSELECT, you input a SAS data set that contains the sampling frame, or list of units from which the sample is to be selected. You also specify the selection method, the desired sample size or sampling rate, and other selection parameters. In this example, the sample design is a stratified simple random sampling design, with households as the sampling units. The sampling frame (the list of the group of the households) is stratified by State and Region. Within strata, households are selected by simple random sampling. Using this design, the following PROC SURVEYS- ELECT statements select a probability sample of households from the HHSample data set.

9 An Example of Using the Survey Procedures 155 proc surveyselect data=hhsample out=sample method=srs n=(3, 5, 3, 6, 2); strata State Region; run; The STRATA statement names the stratification variables State and Region. In the PROC SURVEYSELECT statement, the DATA= option names the SAS data set HHSample as the input data set (the sampling frame) from which to select the sample. The OUT= option stores the sample in the SAS data set named Sample. The METHOD=SRS option specifies simple random sampling as the sample selection method. The N= option specifies the stratum sample sizes. The SURVEYSELECT procedure then selects a stratified random sample of households and produces the output data set Sample, which contains the selected households together with their selection probabilities and sampling weights. The data set Sample also contains the sampling unit identification variable Id and the stratification variables State and Region from the data set HHSample. Survey Data Analysis You can use the SURVEYMEANS and SURVEYREG procedures to estimate population values and to perform regression analyses for survey data. The following example briefly shows the capabilities of each procedure. See Chapter 61, The SUR- VEYMEANS Procedure, and Chapter 62, The SURVEYREG Procedure, for more detailed information. To estimate the total income and expenditure in the population from the sample, you specify the input data set containing the sample, the statistics to be computed, the variables to be analyzed, and any stratification variables. The statements to compute the descriptive statistics are as follows: proc surveymeans data=sample sum clm; var Income Expense; strata State Region; weight Weight; run; The PROC SURVEYMEANS statement invokes the procedure, specifies the input data set, and requests estimates of population totals and their standard deviations for the analysis variables (SUM), and confidence limits for the estimates (CLM). The VAR statement specifies the two analysis variables, Income and Expense. The STRATA statement identifies State and Region as the stratification variables in the sample design. The WEIGHT statement specifies the sampling weight variable Weight. You can also use the SURVEYREG procedure to perform regression analysis for sample survey data. Suppose that, in order to explore the relationship between the total income and the total basic living expenses of a household in the survey population, you choose the following linear model to describe the relationship. Expense = + Income + error

10 156 Chapter 11. Introduction to Survey Sampling and Analysis Procedures The following statements fit this linear model. proc surveyreg data=sample; strata State Region ; model Expense = Income; weight Weight; run; In the PROC SURVEYREG statement, the DATA= option specifies the input sample survey data as Sample. The STRATA statement identifies the stratification variables as State and Region. The MODEL statement specifies the model, with Expense as the dependent variable and Income as the independent variable. The WEIGHT statement specifies the sampling weight variable Weight. References Cochran, W. G. (1977), Sampling Techniques, Third Edition, New York: John Wiley & Sons, Inc. Fuller, W. A. (1975), Regression Analysis for Sample Survey, Sankhya, 37(3), Series C, Hansen, M. H., Hurwitz, W. N., and Madow, W. G. (1953), Sample Survey Methods and Theory, Volumes I and II, New York: John Wiley & Sons, Inc. Kalton, G. (1983), Introduction to Survey Sampling, SAGE University Paper series on Quantitative Applications in the Social Sciences, series no , Beverly Hills and London: SAGE Publications, Inc. Kish, L. (1965), Survey Sampling, New York: John Wiley & Sons, Inc. Lee, E. S., Forthoffer, R. N., and Lorimor, R. J. (1989), Analyzing Complex Survey Data, Sage University Paper series on Quantitative Applications in the Social Sciences, series no , Beverly Hills and London: Sage Publications, Inc. Sarndal, C.E., Swenson, B., and Wretman, J. (1992), Model Assisted Survey Sampling, New York: Springer-Verlag Inc. Wolter, K. M. (1985), Introduction to Variance Estimation, New York: Springer- Verlag Inc. Woodruff, R. S. (1971), A Simple Method for Approximating the Variance of a Complicated Estimate, Journal of the American Statistical Association, 66,

11 The correct bibliographic citation for this manual is as follows: SAS Institute Inc., SAS/STAT User s Guide, Version 8, Cary,NC: SAS Institute Inc., SAS/STAT User s Guide, Version 8 Copyright 1999 by SAS Institute Inc., Cary,NC, USA. ISBN All rights reserved. Produced in the United States of America. No part of this publication may be reproduced, stored in aretrieval system, or transmitted, in any form or by any means, electronic, mechanical, photocopying, or otherwise, without the prior written permission of the publisher,sas Institute Inc. U.S. Government Restricted Rights Notice. Use, duplication, or disclosure of the software and related documentation by the U.S. government is subject to the Agreement with SAS Institute and the restrictions set forth in FAR Commercial Computer Software-Restricted Rights (June 1987). SAS Institute Inc., SAS Campus Drive, Cary,North Carolina st printing, October 1999 SAS and all other SAS Institute Inc. product or service names are registered trademarks or trademarks of SAS Institute Inc. in the USA and other countries. indicates USA registration. Other brand and product names are registered trademarks or trademarks of their respective companies. The Institute is aprivate company devoted to the support and further development of its software and related services.

### New SAS Procedures for Analysis of Sample Survey Data

New SAS Procedures for Analysis of Sample Survey Data Anthony An and Donna Watts, SAS Institute Inc, Cary, NC Abstract Researchers use sample surveys to obtain information on a wide variety of issues Many

### The SURVEYFREQ Procedure in SAS 9.2: Avoiding FREQuent Mistakes When Analyzing Survey Data ABSTRACT INTRODUCTION SURVEY DESIGN 101 WHY STRATIFY?

The SURVEYFREQ Procedure in SAS 9.2: Avoiding FREQuent Mistakes When Analyzing Survey Data Kathryn Martin, Maternal, Child and Adolescent Health Program, California Department of Public Health, ABSTRACT

### IBM SPSS Complex Samples 22

IBM SPSS Complex Samples 22 Note Before using this information and the product it supports, read the information in Notices on page 51. Product Information This edition applies to version 22, release 0,

### INTRODUCTION TO SURVEY DATA ANALYSIS THROUGH STATISTICAL PACKAGES

INTRODUCTION TO SURVEY DATA ANALYSIS THROUGH STATISTICAL PACKAGES Hukum Chandra Indian Agricultural Statistics Research Institute, New Delhi-110012 1. INTRODUCTION A sample survey is a process for collecting

### Survey Data Analysis in Stata

Survey Data Analysis in Stata Jeff Pitblado Associate Director, Statistical Software StataCorp LP Stata Conference DC 2009 J. Pitblado (StataCorp) Survey Data Analysis DC 2009 1 / 44 Outline 1 Types of

### Chapter XXI Sampling error estimation for survey data* Donna Brogan Emory University Atlanta, Georgia United States of America.

Chapter XXI Sampling error estimation for survey data* Donna Brogan Emory University Atlanta, Georgia United States of America Abstract Complex sample survey designs deviate from simple random sampling,

### Sampling Error Estimation in Design-Based Analysis of the PSID Data

Technical Series Paper #11-05 Sampling Error Estimation in Design-Based Analysis of the PSID Data Steven G. Heeringa, Patricia A. Berglund, Azam Khan Survey Research Center, Institute for Social Research

### Accounting for Multi-stage Sample Designs in Complex Sample Variance Estimation

Accounting for Multi-stage Sample Designs in Complex Sample Variance Estimation Brady T. West, Michigan Program in Survey Methodology Nationally representative samples of large populations often have complex

### Survey Analysis: Options for Missing Data

Survey Analysis: Options for Missing Data Paul Gorrell, Social & Scientific Systems, Inc., Silver Spring, MD Abstract A common situation researchers working with survey data face is the analysis of missing

### Chapter 19 Statistical analysis of survey data. Abstract

Chapter 9 Statistical analysis of survey data James R. Chromy Research Triangle Institute Research Triangle Park, North Carolina, USA Savitri Abeyasekera The University of Reading Reading, UK Abstract

### Community Tracking Study. Comparison of Selected Statistical Software Packages for Variance Estimation in the CTS Surveys

Community Tracking Study Comparison of Selected Statistical Software Packages for Variance Estimation in the CTS Surveys Elizabeth Schaefer Frank Potter Stephen Williams Nuria Diaz-Tena James D. Reschovsky

### ANALYTIC AND REPORTING GUIDELINES

ANALYTIC AND REPORTING GUIDELINES The National Health and Nutrition Examination Survey (NHANES) Last Update: December, 2005 Last Correction, September, 2006 National Center for Health Statistics Centers

### Introduction to Sampling. Dr. Safaa R. Amer. Overview. for Non-Statisticians. Part II. Part I. Sample Size. Introduction.

Introduction to Sampling for Non-Statisticians Dr. Safaa R. Amer Overview Part I Part II Introduction Census or Sample Sampling Frame Probability or non-probability sample Sampling with or without replacement

### Comparison of Variance Estimates in a National Health Survey

Comparison of Variance Estimates in a National Health Survey Karen E. Davis 1 and Van L. Parsons 2 1 Agency for Healthcare Research and Quality, 540 Gaither Road, Rockville, MD 20850 2 National Center

### Comparing Alternate Designs For A Multi-Domain Cluster Sample

Comparing Alternate Designs For A Multi-Domain Cluster Sample Pedro J. Saavedra, Mareena McKinley Wright and Joseph P. Riley Mareena McKinley Wright, ORC Macro, 11785 Beltsville Dr., Calverton, MD 20705

### Survey Inference for Subpopulations

American Journal of Epidemiology Vol. 144, No. 1 Printed In U.S.A Survey Inference for Subpopulations Barry I. Graubard 1 and Edward. Korn 2 One frequently analyzes a subset of the data collected in a

### Software for Analysis of YRBS Data

Youth Risk Behavior Surveillance System (YRBSS) Software for Analysis of YRBS Data June 2014 Where can I get more information? Visit www.cdc.gov/yrbss or call 800 CDC INFO (800 232 4636). CONTENTS Overview

### Analysis of Survey Data Using the SAS SURVEY Procedures: A Primer

Analysis of Survey Data Using the SAS SURVEY Procedures: A Primer Patricia A. Berglund, Institute for Social Research - University of Michigan Wisconsin and Illinois SAS User s Group June 25, 2014 1 Overview

### UNIX Operating Environment

97 CHAPTER 14 UNIX Operating Environment Specifying File Attributes for UNIX 97 Determining the SAS Release Used to Create a Member 97 Creating a Transport File on Tape 98 Copying the Transport File from

### IPUMS User Note: Issues Concerning the Calculation of Standard Errors (i.e., variance estimation)using IPUMS Data Products

IPUMS User Note: Issues Concerning the Calculation of Standard Errors (i.e., variance estimation)using IPUMS Data Products by Michael Davern and Jeremy Strief Producing accurate standard errors is essential

### Youth Risk Behavior Survey (YRBS) Software for Analysis of YRBS Data

Youth Risk Behavior Survey (YRBS) Software for Analysis of YRBS Data CONTENTS Overview 1 Background 1 1. SUDAAN 2 1.1. Analysis capabilities 2 1.2. Data requirements 2 1.3. Variance estimation 2 1.4. Survey

### Paper PO06. Randomization in Clinical Trial Studies

Paper PO06 Randomization in Clinical Trial Studies David Shen, WCI, Inc. Zaizai Lu, AstraZeneca Pharmaceuticals ABSTRACT Randomization is of central importance in clinical trials. It prevents selection

### Life Table Analysis using Weighted Survey Data

Life Table Analysis using Weighted Survey Data James G. Booth and Thomas A. Hirschl June 2005 Abstract Formulas for constructing valid pointwise confidence bands for survival distributions, estimated using

### Accounting for complex survey design in modeling usual intake. Kevin W. Dodd, PhD National Cancer Institute

Accounting for complex survey design in modeling usual intake Kevin W. Dodd, PhD National Cancer Institute Slide 1 Hello and welcome to today s webinar, the fourth in the Measurement Error Webinar Series.

### Chapter 25 Specifying Forecasting Models

Chapter 25 Specifying Forecasting Models Chapter Table of Contents SERIES DIAGNOSTICS...1281 MODELS TO FIT WINDOW...1283 AUTOMATIC MODEL SELECTION...1285 SMOOTHING MODEL SPECIFICATION WINDOW...1287 ARIMA

### Using Repeated Measures Techniques To Analyze Cluster-correlated Survey Responses

Using Repeated Measures Techniques To Analyze Cluster-correlated Survey Responses G. Gordon Brown, Celia R. Eicheldinger, and James R. Chromy RTI International, Research Triangle Park, NC 27709 Abstract

### For Technical Assistance with HCUP Products: Phone: HCUP

HCUP Methods Series Contact Information: Healthcare Cost and Utilization Project (HCUP) Agency for Healthcare Research and Quality 5600 Fishers Lane Room 07W17B Mail Stop 7W25B Rockville, MD 20857 http://www.hcup-us.ahrq.gov

### DBF Chapter. Note to UNIX and OS/390 Users. Import/Export Facility CHAPTER 7

97 CHAPTER 7 DBF Chapter Note to UNIX and OS/390 Users 97 Import/Export Facility 97 Understanding DBF Essentials 98 DBF Files 98 DBF File Naming Conventions 99 DBF File Data Types 99 ACCESS Procedure Data

### Sampling techniques and weighting procedures for complex survey designs. The school cohorts of the National Educational Panel Study (NEPS)

Sampling techniques and weighting procedures for complex survey designs The school cohorts of the National Educational Panel Study (NEPS) Dissertation Presented to the Faculty for Social Sciences, Economics,

### ESTIMATION OF THE EFFECTIVE DEGREES OF FREEDOM IN T-TYPE TESTS FOR COMPLEX DATA

m ESTIMATION OF THE EFFECTIVE DEGREES OF FREEDOM IN T-TYPE TESTS FOR COMPLEX DATA Jiahe Qian, Educational Testing Service Rosedale Road, MS 02-T, Princeton, NJ 08541 Key Words" Complex sampling, NAEP data,

### Chapter 63 The SURVEYSELECT Procedure

Chapter 63 The SURVEYSELECT Procedure Chapter Table of Contents OVERVIEW...3275 GETTING STARTED...3276 Simple Random Sampling...3277 StratifiedSampling...3279 Stratified Sampling with Control Sorting...3282

### Robert L. Santos, Temple University

n ONE APPROACH TO OVERSAMPLING BLACKS AND HISPANICS: THE NATIONAL ALCOHOL SURVEY Robert L. Santos, Temple University I. Introduction The 1984 National Alcohol Survey (NAS) is the first national household

### Variance Estimation Guidance, NHIS 2006-2014 (Adapted from the 2006-2014 NHIS Survey Description Documents) Introduction

June 9, 2015 Variance Estimation Guidance, NHIS 2006-2014 (Adapted from the 2006-2014 NHIS Survey Description Documents) Introduction The data collected in the NHIS are obtained through a complex, multistage

### The construction and use of sample design variables in EU SILC. A user s perspective

The construction and use of sample design variables in EU SILC. A user s perspective Report prepared for Eurostat Abstract: Currently, EU SILC is the single most important data source on income and living

### Statistical and Methodological Issues in the Analysis of Complex Sample Survey Data: Practical Guidance for Trauma Researchers

Journal of Traumatic Stress, Vol. 2, No., October 2008, pp. 440 447 ( C 2008) Statistical and Methodological Issues in the Analysis of Complex Sample Survey Data: Practical Guidance for Trauma Researchers

### Approaches for Analyzing Survey Data: a Discussion

Approaches for Analyzing Survey Data: a Discussion David Binder 1, Georgia Roberts 1 Statistics Canada 1 Abstract In recent years, an increasing number of researchers have been able to access survey microdata

### Survey Data Analysis in Stata

Survey Data Analysis in Stata Jeff Pitblado Associate Director, Statistical Software StataCorp LP 2009 Canadian Stata Users Group Meeting Outline 1 Types of data 2 2 Survey data characteristics 4 2.1 Single

### Sampling Designs. 1. Simple random sampling (SRS) Steps: The method is not very different from winning a lottery.

Sampling Designs 1. Simple random sampling (SRS) Steps: (1) Assign a single number to each element in the sampling frame. (2) Use random numbers to select elements into the sample until the desired number

### SAMPLING METHODS. Chapter 5

SAMPLING METHODS Chapter 5 1 LEARNING OBJECTIVES Reasons for sampling Different sampling methods Probability & non probability sampling Advantages & disadvantages of each sampling method 2 SAMPLING A sample

### Sampling in Marketing Research

Sampling in Marketing Research 1 Basics of sampling I A sample is a part of a whole to show what the rest is like. Sampling helps to determine the corresponding value of the population and plays a vital

### 1.1 What is statistics? Data Collection. Important Definitions. What is data? Descriptive Statistics. Inferential Statistics

1.1 What is statistics? Data Collection Chapter 1 A science (bet you thought it was a math) of Collecting of data (Chap. 1) Organizing data (Chap. 2) Summarizing data (Chap. 2, 3) Analyzing data (Chap.

### Elementary Statistics

Elementary Statistics Chapter 1 Dr. Ghamsary Page 1 Elementary Statistics M. Ghamsary, Ph.D. Chap 01 1 Elementary Statistics Chapter 1 Dr. Ghamsary Page 2 Statistics: Statistics is the science of collecting,

### Statistics 522: Sampling and Survey Techniques. Topic 5. Consider sampling children in an elementary school.

Topic Overview Statistics 522: Sampling and Survey Techniques This topic will cover One-stage Cluster Sampling Two-stage Cluster Sampling Systematic Sampling Topic 5 Cluster Sampling: Basics Consider sampling

### National Endowment for the Arts. A Technical Research Manual

2012 SPPA PUBLIC-USE DATA FILE USER S GUIDE A Technical Research Manual Prepared by Timothy Triplett Statistical Methods Group Urban Institute September 2013 Table of Contents Introduction... 3 Section

### Workshop on Using the National Survey of Children s s Health Dataset: Practical Applications

Workshop on Using the National Survey of Children s s Health Dataset: Practical Applications Julian Luke Stephen Blumberg Centers for Disease Control and Prevention National Center for Health Statistics

### 1) Overview 2) Sample or Census 3) The Sampling Design Process i. Define the Target Population ii. Determine the Sampling Frame iii.

1) Overview 2) Sample or Census 3) The Sampling Design Process i. Define the Target Population ii. Determine the Sampling Frame iii. Select a Sampling Technique iv. Determine the Sample Size v. Execute

Chapter 8 Hypothesis Tests Chapter Table of Contents Introduction...157 One-Sample t-test...158 Paired t-test...164 Two-Sample Test for Proportions...169 Two-Sample Test for Variances...172 Discussion

### Random Effects Models for Longitudinal Survey Data

Analysis of Survey Data. Edited by R. L. Chambers and C. J. Skinner Copyright 2003 John Wiley & Sons, Ltd. ISBN: 0-471-89987-9 CHAPTER 14 Random Effects Models for Longitudinal Survey Data C. J. Skinner

### Survey Sampling. Know How No 9 guidance for research and evaluation in Fife. What this is about? Who is it for? What do you need to know?

guidance for research and evaluation in Fife What this is about? Sampling allows you to draw conclusions about a particular population by examining a part of it. When carrying out a survey, it is not usually

### Measuring change with the Belgian Survey on Income and Living Conditions (SILC): taking account of the sampling variance

Measuring change with the Belgian Survey on Income and Living Conditions (SILC): taking account of the sampling variance Report Financed by FPS Social Security Belgium (Ref.: 2013/BOEP/UA-EU2020) December

### Annex 5 GUIDELINES FOR SAMPLING AND SURVEYS FOR CDM PROJECT ACTIVITIES AND PROGRAMME OF ACTIVITIES. (Version 02.0) CONTENTS

Page 1 GUIDELINES FOR SAMPLING AND SURVEYS FOR CDM PROJECT ACTIVITIES AND PROGRAMME OF ACTIVITIES (Version 0.0) CONTENTS Paragraphs Page I. Introduction... 1 4 II. Common Types of Sampling Approaches...

### Guide to Operating SAS IT Resource Management 3.5 without a Middle Tier

Guide to Operating SAS IT Resource Management 3.5 without a Middle Tier SAS Documentation The correct bibliographic citation for this manual is as follows: SAS Institute Inc. 2014. Guide to Operating SAS

### Introduction to SUDAAN for survey data

Center for Studies in Demography & Ecology Anita Rocha alrocha@u.washington.edu David Chae Introduction to SUDAAN for survey data This course and other resources This is a course on how to begin using

### Northumberland Knowledge

Northumberland Knowledge Know Guide How to Analyse Data - November 2012 - This page has been left blank 2 About this guide The Know Guides are a suite of documents that provide useful information about

### Annex 6 BEST PRACTICE EXAMPLES FOCUSING ON SAMPLE SIZE AND RELIABILITY CALCULATIONS AND SAMPLING FOR VALIDATION/VERIFICATION. (Version 01.

Page 1 BEST PRACTICE EXAMPLES FOCUSING ON SAMPLE SIZE AND RELIABILITY CALCULATIONS AND SAMPLING FOR VALIDATION/VERIFICATION (Version 01.1) I. Introduction 1. The clean development mechanism (CDM) Executive

### Sampling. by David A. Freedman Department of Statistics University of California Berkeley, CA 94720

Sampling by David A. Freedman Department of Statistics University of California Berkeley, CA 94720 The basic idea in sampling is extrapolation from the part to the whole from the sample to the population.

### Designing a Sampling Method for a Survey of Iowa High School Seniors

Designing a Sampling Method for a Survey of Iowa High School Seniors Kyle A. Hewitt and Michael D. Larsen, Iowa State University Department of Statistics, Snedecor Hall, Ames, Iowa 50011-1210, larsen@iastate.edu

### Analyzing the Server Log

87 CHAPTER 7 Analyzing the Server Log Audience 87 Introduction 87 Starting the Server Log 88 Using the Server Log Analysis Tools 88 Customizing the Programs 89 Executing the Driver Program 89 About the

### SAS Software to Fit the Generalized Linear Model

SAS Software to Fit the Generalized Linear Model Gordon Johnston, SAS Institute Inc., Cary, NC Abstract In recent years, the class of generalized linear models has gained popularity as a statistical modeling

### TYPES OF SAMPLING. 2. Types of Non-probability Sample: There are the following four types of nonprobability PROBABILITY SAMPLING

TYPES OF SAMPLING 1. Types or Techniques Probability Sampling: There are a number of techniques of taking Probability sample. But here only six important techniques have been discussed as follows: 1. Simple

### Types of Probability Samples

Types of Probability Samples Simple Random Systematic Random Stratified Random Random Cluster Stratified Cluster Complex Multi-stage Random (various kinds) Simple Random Sampling Each element in the population

### OS/2: TELNET Access Method

259 CHAPTER 18 OS/2: TELNET Access Method SAS Support for TELNET on OS/2 259 SAS/CONNECT 259 System and Software Requirements for SAS/CONNECT 259 Local Host Tasks 260 Configuring Local and Remote Host

Chapter 12 Sample Size and Power Calculations Chapter Table of Contents Introduction...253 Hypothesis Testing...255 Confidence Intervals...260 Equivalence Tests...264 One-Way ANOVA...269 Power Computation

### Business Statistics. Successful completion of Introductory and/or Intermediate Algebra courses is recommended before taking Business Statistics.

Business Course Text Bowerman, Bruce L., Richard T. O'Connell, J. B. Orris, and Dawn C. Porter. Essentials of Business, 2nd edition, McGraw-Hill/Irwin, 2008, ISBN: 978-0-07-331988-9. Required Computing

### Chapter 8: Quantitative Sampling

Chapter 8: Quantitative Sampling I. Introduction to Sampling a. The primary goal of sampling is to get a representative sample, or a small collection of units or cases from a much larger collection or

Chapter 32 Histograms and Bar Charts Chapter Table of Contents VARIABLES...470 METHOD...471 OUTPUT...472 REFERENCES...474 467 Part 3. Introduction 468 Chapter 32 Histograms and Bar Charts Bar charts are

### Sample design for educational survey research

Quantitative research methods in educational planning Series editor: Kenneth N.Ross Module Kenneth N. Ross 3 Sample design for educational survey research UNESCO International Institute for Educational

### Supporting Online Material for Achard (RE 1070656) scheduled for 8/9/02 issue of Science

Supporting Online Material for Achard (RE 1070656) scheduled for 8/9/02 issue of Science Materials and Methods Overview Forest cover change is calculated using a sample of 102 observations distributed

### The Science and Art of Market Segmentation Using PROC FASTCLUS Mark E. Thompson, Forefront Economics Inc, Beaverton, Oregon

The Science and Art of Market Segmentation Using PROC FASTCLUS Mark E. Thompson, Forefront Economics Inc, Beaverton, Oregon ABSTRACT Effective business development strategies often begin with market segmentation,

### Why Sample? Why not study everyone? Debate about Census vs. sampling

Sampling Why Sample? Why not study everyone? Debate about Census vs. sampling Problems in Sampling? What problems do you know about? What issues are you aware of? What questions do you have? Key Sampling

### Permuted-block randomization with varying block sizes using SAS Proc Plan Lei Li, RTI International, RTP, North Carolina

Paper PO-21 Permuted-block randomization with varying block sizes using SAS Proc Plan Lei Li, RTI International, RTP, North Carolina ABSTRACT Permuted-block randomization with varying block sizes using

### Here alx is an estimate of the incompleteness of

SURVEYS WITH OVERLAPPING FRAMES - PROBLEMS IN APPLICATION Introduction Frederic A. Vogel, Statistical Reporting Service, U. S. Dept. of Agriculture A multiple frame survey may be defined as one relying

### Supporting Statement Part B. Collections of Information Employing Statistical Methods

Supporting Statement Part B Collections of Information Employing Statistical Methods Overview This field test will use a probability sample of each Program s eligible participants. Because the purpose

### Statistical & Technical Team

Statistical & Technical Team A Practical Guide to Sampling This guide is brought to you by the Statistical and Technical Team, who form part of the VFM Development Team. They are responsible for advice

### Weighting European Social Survey Data

Weighting European Social Survey Data 25th April 2014 http://www.europeansocialsurvey.org/ Contents II 1 Do analyses conducted with ESS data need to be weighted? 1 2 What weights are there to apply? 1

### Sampling: What is it? Quantitative Research Methods ENGL 5377 Spring 2007

Sampling: What is it? Quantitative Research Methods ENGL 5377 Spring 2007 Bobbie Latham March 8, 2007 Introduction In any research conducted, people, places, and things are studied. The opportunity to

### MISSING DATA TECHNIQUES WITH SAS. IDRE Statistical Consulting Group

MISSING DATA TECHNIQUES WITH SAS IDRE Statistical Consulting Group ROAD MAP FOR TODAY To discuss: 1. Commonly used techniques for handling missing data, focusing on multiple imputation 2. Issues that could

### Table 1" Cancer Analysis No Yes Diff. S EE p-value OLS 62.8 61.3 1.5 0.6 0.013. Design- Based 63.6 62.7 0.9 0.9 0.29

Epidemiologic Studies Utilizing Surveys: Accounting for the Sampling Design Edward L. Korn, Barry I. Graubard Edward L. Korn, Biometric Research Branch, National Cancer Institute, EPN-739, Bethesda, MD

### Organizing Your Approach to a Data Analysis

Biost/Stat 578 B: Data Analysis Emerson, September 29, 2003 Handout #1 Organizing Your Approach to a Data Analysis The general theme should be to maximize thinking about the data analysis and to minimize

### Visualization of Complex Survey Data: Regression Diagnostics

Visualization of Complex Survey Data: Regression Diagnostics Susan Hinkins 1, Edward Mulrow, Fritz Scheuren 3 1 NORC at the University of Chicago, 11 South 5th Ave, Bozeman MT 59715 NORC at the University

### Basic Research Progress

Mgt 540 Research Methods Sampling Issues 1 Basic Research Progress Explorative - Descriptive (Qualitative) 1. Framework / Domain extant knowledge for reference 2. Research Design 3. Data collection / presentation

### Cluster Sampling: Single stage cluster sampling

Chapter 6 Cluster Sampling: Single stage cluster sampling 6.1 Introduction Element sampling designs discussed in Chapter 3 and Chapter 4 are not always feasible when there is no sampling frame for the

### Statistical methods for the comparison of dietary intake

Appendix Y Statistical methods for the comparison of dietary intake Jianhua Wu, Petros Gousias, Nida Ziauddeen, Sonja Nicholson and Ivonne Solis- Trapala Y.1 Introduction This appendix provides an outline

### James E. Bartlett, II is Assistant Professor, Department of Business Education and Office Administration, Ball State University, Muncie, Indiana.

Organizational Research: Determining Appropriate Sample Size in Survey Research James E. Bartlett, II Joe W. Kotrlik Chadwick C. Higgins The determination of sample size is a common task for many organizational

### Need for Sampling. Very large populations Destructive testing Continuous production process

Chapter 4 Sampling and Estimation Need for Sampling Very large populations Destructive testing Continuous production process The objective of sampling is to draw a valid inference about a population. 4-

### Course Text. Required Computing Software. Course Description. Course Objectives. StraighterLine. Business Statistics

Course Text Business Statistics Lind, Douglas A., Marchal, William A. and Samuel A. Wathen. Basic Statistics for Business and Economics, 7th edition, McGraw-Hill/Irwin, 2010, ISBN: 9780077384470 [This

### Session 2.4: Clustering and Single Stage Cluster Sampling

Second Regional Training Course on Sampling Methods for Producing Core Data Items for Agricultural and Rural Statistics Module 2: Review of Basics of Sampling Methods Session 2.4: Clustering and Single

### Analysis of Complex Survey Data

Paper Number ST-01 Analysis of Complex Survey Data Varma Nadimpalli, Westat, Rockville, Maryland ABSTRACT Analysis of dose-response relationships has long been the main focus of clinical and epidemiological

### National Longitudinal Study of Adolescent Health. Strategies to Perform a Design-Based Analysis Using the Add Health Data

National Longitudinal Study of Adolescent Health Strategies to Perform a Design-Based Analysis Using the Add Health Data Kim Chantala Joyce Tabor Carolina Population Center University of North Carolina

### When to Move a SAS File between Hosts

3 CHAPTER Moving and Accessing SAS Files between Hosts When to Move a SAS File between Hosts 3 When to Access a SAS File on a Remote Host 3 Host Types Supported According to SAS Release 4 Avoiding and

### Sampling ecommerce Data from the Web: Methodological and Practical Issues

Sampling ecommerce Data from the Web: Methodological and Practical Issues Galit Shmueli 1, Wolfgang Jank 1 & Ravi Bapna 2 1 Dept. of Decision & Information Technologies R. H. Smith School of Business University

### 6 Regression With Survey Data From Complex Samples

6 Regression With Survey Data From Complex Samples Secondary analysis of data from large national surveys figures prominently in social science and public health research, and these surveys use complex

### A composite estimator for stratified two stage cluster sampling

Communications for Statistical Applications and Methods 2016, Vol. 23, No. 1, 47 http://dx.doi.org/.31/csam.2016.23.1.047 Print ISSN 2287-7843 / Online ISSN 2383-477 A composite estimator for stratified

### Double Sampling: What is it?

FOR 474: Forest Inventory Techniques Double Sampling What is it? Why is it Used? Double Sampling: What is it? In many cases in forestry it is too expensive or difficult to measure what you want (e.g. total