GUIDANCE ON HOW TO ANALYZE DATA FROM A SCHOOL-BASED ORAL HEALTH SURVEY JULY 2013
|
|
|
- Britton Cameron
- 10 years ago
- Views:
Transcription
1 GUIDANCE ON HOW TO ANALYZE DATA FROM A SCHOOL-BASED ORAL HEALTH SURVEY JULY 2013 Due to the technical nature of this topic, this information will be most helpful to data analysts, epidemiologists and statisticians. Has your state, territory or local health agency conducted a school-based oral health survey? If yes, then you probably have questions about how to appropriately weight and analyze the data to best represent the target population of your survey. The purpose of this document is to provide a basic framework for how to appropriately analyze data from a statewide school-based oral health survey that employed a complex sampling design. Because no one method is appropriate for all states/territories, we encourage you to read this document, and then contact ASTDD for additional guidance on analyzing data for your state/territory. Although this document is geared towards states and territories, the techniques are appropriate for other jurisdictions such as counties. This topic is important because most oral health surveys employ a complex sampling design that may include stratification, unequal selection probabilities and clustering. To obtain valid point estimates, standard errors, and confidence intervals, analysis must account for the sampling design. Simply doing a weighted analysis using statistical procedures like SAS Proc Freq is not appropriate because the variance estimation in such programs use formulas appropriate for simple random sampling rather than complex sampling. These formulas do not account for stratification or clustering and may result in biased point estimates of population parameters (in an unweighted analysis) and/or underestimation of standard errors and confidence intervals for point estimates. This document is limited to a discussion of data weighting and analysis. For additional information on how to conduct and use data from a school-based oral health survey, please refer to the Basic Screening Survey (BSS) tools developed by the Association of State and Territorial Dental Directors (ASTDD). These tools are available at the following website: Do you want to submit the data to the National Oral Health Surveillance System (NOHSS)? NOHSS ( is a collaborative effort between CDC's Division of Oral Health and ASTDD. NOHSS is designed to monitor the burden of oral disease, use of the oral health care delivery system, and the status of community water fluoridation on both a national and state level. NOHSS tracks oral health surveillance indicators based on data sources and surveillance capacity available to most states. The Council of State and Territorial Epidemiologists (CSTE) and the Chronic Disease Directors (CDD) were instrumental in developing the framework for chronic disease surveillance indicators, including the oral health indicators in NOHSS. If you follow the guidance provided in this document and ASTDD s sampling guidance document, your oral health data will meet the specifications for inclusion in NOHSS. Because no one method is appropriate for all states/territories, we encourage you to read this document, and then contact ASTDD for additional guidance on analyzing data for your state/territory. If you follow the guidance provided in this document and ASTDD s sampling guidance document, your oral health data will meet the specifications for inclusion in NOHSS. Only oral health survey data that meet the following specifications are included in the NOHSS data system: The data are from a statewide probability sample of elementary schools. 1
2 If a complex sampling scheme is used, the data must be weighted for the sampling scheme. The NOHSS Advisory Committee strongly suggests that, at minimum, 3rd grade children be screened. Grades K-2 as well as Head Start may also be screened and are included in the NOHSS website. While it is not mandatory that data be adjusted for non-response, the NOHSS Advisory Committee strongly encourages states to adjust for non-response whenever possible. Both unadjusted and adjusted data may be submitted. If you follow the guidance in this document your estimates will be adjusted for both the sampling scheme and non-response. NOTE: In some cases a state may be unable to follow this guidance. For example, because of small school size and confidentiality issues, an IRB may require that school identifiers not be included in the dataset. If you encounter such issues, please contact ASTDD for additional guidance. Once our oral health data has been collected, what steps do I need to take to prepare the data for analysis? There are several steps you should take to prepare for the analysis phase of your survey. Carefully review each step and decide if it is appropriate for your situation. Step 1: Enter the data into an electronic file that can be exported to an appropriate statistical package. Good options for data entry systems include Epi Info ( and Microsoft Access. To minimize or eliminate data entry errors it is important to have a very smart data entry system that can make a variety of checks on the data while it is being entered. Typically a useful data entry system checks each field for valid values, inconsistencies in data across fields, skip patterns, etc. A Steps to Take to Prepare for Analysis Enter the data Clean the data Determine number screened at each school Determine number in sampling interval Calculate weight factor good data entry system, just like a good form, should be designed to be self-explanatory and easy to use. Examples of data entry systems using Epi Info and Access are available from ASTDD. Step 2: Clean the data file. If you used a smart data entry system, there should be very few data entry errors. Make sure that each record includes the appropriate school code. A school code is necessary for calculating the weight factor that will be used in the analysis. For additional information on data cleaning and preemptive data cleaning techniques, refer to the following brief: www2.sas.com/proceedings/sugi26/p pdf. Once you have selected the data entry system and statistical software package to use, it may be helpful to read briefs or reference books specific to that system or package. Step 3: Determine how many children were screened at each school. This can be accomplished by generating a frequency distribution for school codes. The number of children screened at each school will be used as the denominator in the weight factor calculation. Step 4: Go back to the file you used to select the sample and determine how many children were enrolled in each sampling interval. Link the sampling interval information to the participating school codes. The number of children in the sampling interval will be partially dependent on the type of sampling strategy you used. If you used a probability proportional to size (PPS) sampling strategy, the number of children in the sampling interval will be the same for each sampling interval in a given strata. Refer to Example 1 and Table 1 (page 10). If you used a non-pps sampling strategy, the number of children in the sampling interval will generally be different for each sampling interval. Refer to Example 2 and Table 2 (page 12). Refer to the school survey sampling guidance developed by ASTDD for additional information. The sampling guidance is available at the following site: 2
3 Step 5: Calculate the weight factor using the following formula. Each child in a particular school and grade will have the same weight factor. Weight = (# of children in sampling interval) / (# of children screened in sampling interval) o This formula reflects the reduction of the overall probability calculation of: (# enrolled in school/# in sampling interval)* (# children invited to participate /# enrolled in school) * (# screened/# children invited to participate) = # of children screened in sampling interval/# of children in sampling interval o Note that number enrolled in school cancels out in the first and second terms and number of children invited to participate cancels out in the second and third terms o The analysis weight is the inverse of the reduced probability term: (# of children in sampling interval) / (# of children screened in sampling interval) NOTE: The number of children in the sampling interval is based on the sampling frame used for selecting the sample which generally will be from the school year prior to the data collection year. These numbers would be expected to be very close to current numbers. What statistical software package and program code should I use? Analysis of data from surveys that employ a complex sampling design, such as a school-based oral health survey, must account for the sampling design. Several statistical software packages are either (1) designed specifically to analyze complex sample survey data or (2) have special procedures or modules to correctly analyze complex sample survey data including SUDAAN, SAS, STATA, SPSS, Epi Info and R. All of these packages are appropriate for the analysis of school-based oral health survey data; your decision for which package to use will probably be based on availability, familiarity or cost. Both Epi Info and R are available at no cost to the user. To help you with the analysis process, we have created sample program code for each of the packages listed and have compared results from each package based on a sample data set from a recent state oral health survey of kindergarten and 3 rd grade children. Information about each statistical software package, except R, was excerpted from Software for Analysis of YRBS Data (CDC 2012). Definition of variables used in the sample program code: Grade K=kindergarten, 3=third grade Race 1=white, 2=black Cluster a unique number for each school, primary sampling unit (PSU) Strata geographic region of the state, stratification variable used in selecting sample Weight analysis weight factor (# children in sampling interval / # children screened in interval) Untreated does the child have untreated decay (0=No, 1=Yes) Treated does the child have treated decay (0=No, 1=Yes) Experience does the child have treated and/or untreated decay (0=No, 1=Yes). You will need to create the variable Experience from Treated and Untreated o If Untreated is missing and Treated is missing then Experience should be coded missing o If Untreated=0 and Treated=0 then Experience=0 o If Untreated=1 or Treated=1 then Experience=1 Sealants does the child have dental sealants (0=No, 1=Yes) General items that deserve caution: Missing data: Each software package uses its own special coding for missing data, for example. in SAS and N/A in R. These special codes for missing data translate to numeric values in calculations, sometimes very small and sometimes very large. Take care in recoding or creating new variables to be sure that missing data are categorized as you intended. Subpopulation analyses, By statement dropping observations from data set: With complex sample data, to get estimates for a subpopulation, such as male or Hispanic children, the statistical software 3
4 requires information about the sampling design, the strata, and primary sampling units (PSUs). Dropping observations from the data set for children who are not in the subpopulation can result in loss of information on some strata and PSUs, resulting in estimates that do not account for the correct number of strata and PSUs. Using a By statement to get estimates for males and females in SAS, for example, is equivalent to doing the analysis once dropping all of the females, and then again dropping all of the males. The By statement is NOT the recommended way to get estimates for subpopulations for many of the software packages. Sudaan version 11 is an exception. Check the documentation for your preferred statistical software to be sure you are using the correct syntax for proper subpopulation analysis. SAS sample code: SAS versions 8 and higher include special sample survey procedures that are appropriate for analyzing complex survey data. These sample survey procedures use SAS syntax that will be familiar to those who already use SAS. There are three sample design statements in SAS: CLUSTER, where the name of the primary sampling unit (PSU) is placed; STRATA, where the name of the stratification variable is placed; and WEIGHT, where the name of the analysis weight variable is placed. Variables may be numeric or character. The input data file does not need to be sorted by stratum and/or PSU variables before analysis. Univariate analysis (data not presented): PROC SURVEYFREQ ; STRATA strata ; CLUSTER cluster ; WEIGHT weight ; TABLES untreated experience sealants / cl ; RUN ; Table 3: PROC SURVEYFREQ ; STRATA strata ; CLUSTER cluster ; WEIGHT weight ; TABLES grade*untreated / row cl ; RUN ; PROC SURVEYFREQ ; STRATA strata ; CLUSTER cluster ; WEIGHT weight ; TABLES grade*experience / row cl ; RUN ; PROC SURVEYFREQ ; STRATA strata ; CLUSTER cluster ; WEIGHT weight ; TABLES grade*sealants / row cl ; RUN ; Table 4: PROC SURVEYFREQ ; STRATA strata ; 4
5 CLUSTER cluster ; WEIGHT weight ; TABLES grade*race*sealants / row cl ; RUN ; Epi Info sample code: Epi Info includes a module for complex sample survey analysis. The analytic capabilities of Epi Info are limited and are oriented towards public health field work applications. Sample design information is entered into the appropriate box (Weight, PSU, Stratify by) in the dialog box that appears once an analysis (Complex Sample Frequencies, Complex Sample Tables, Complex Sample Means) has been selected. You can also use the syntax codes below. Variables may be numeric or character. The input data file does not need to be sorted by stratum and/or PSU variables before analysis. IMPORTANT NOTE: As of July 2013, Epi Info does not have the ability to appropriately generate subpopulation analyses. Using the Select statement will drop observations and may impact information about the sampling design, strata, and primary sampling units (PSUs). Dropping observations from the data set for children who are not in the subpopulation may result in loss of information on some strata and PSUs, resulting in estimates that do not account for the correct number of strata and PSUs. Univariate analysis (data not presented): FREQ untreated experience sealants STRATAVAR=strata WEIGHTVAR=weight PSUVAR=cluster Table 3: TABLES grade untreated STRATAVAR=strata WEIGHTVAR=weight PSUVAR=cluster TABLES grade experience STRATAVAR=strata WEIGHTVAR=weight PSUVAR=cluster TABLES grade sealants STRATAVAR=strata WEIGHTVAR=weight PSUVAR=cluster Table 4: SELECT grade= 3 TABLES race sealants STRATAVAR=strata WEIGHTVAR=weight PSUVAR=cluster R sample code: R is an open source, freely available software. Users develop R packages for specific purposes. Analysis of complex sample survey data requires the package survey developed by Thomas Lumley at University of Washington. Details of R can be found at the R-Project website and further details of the survey package can be found at Thomas Lumley s website Variables may be numeric or character. The input data file does not need to be sorted by stratum and/or PSU variables before analysis. #Describe the sample design to R BSS <- svydesign(id=~cluster, strat=~strata, weight=~weight, data=dat2) Univariate analysis (data not presented): uniana <- svymean(~untreated+experience+sealants, BSS,na.rm = TRUE) #Calculating Confidence Intervals t1<- ftable(uniana) UniTab = data.frame(mean = t1[,1], CintLow = t1[,1]-1.96*t1[,2], CintHigh = t1[,1]+1.96*t1[,2]) #Rounding Table Values UniTab[,1:3] <- round(100*unitab[,1:3],1) 5
6 Table 3: #Estimate proportions and standard errors within groups vun<-svyby(~untreated, ~Grade, svymean, design=bss, keep.names=false, na.rm = TRUE) vexp<-svyby(~experience, ~Grade, svymean, design=bss, keep.names=false, na.rm = TRUE) vseal<-svyby(~sealants, ~Grade, svymean, design=bss, keep.names=false, na.rm = TRUE) #Calculating the Confidence Intervals v1 = vexp[,1:2] v1$untreated = vun$untreated v1$cll.untreated= v1$untreated *vun$se.Untreated v1$clu.untreated= v1$untreated *vun$se.Untreated v1$experience = vexp$experience v1$cll.experience= v1$experience *vexp$se.Experience v1$clu.experience= v1$experience *vexp$se.Experience v1$sealants = vseal$sealants v1$cll.sealants= v1$sealants *vseal$se.Sealants v1$clu.sealants= v1$sealants *vseal$se.Sealants #Rounding Table Values v1[,2:10] <- round(100*v1[,2:10],1) Table 4: v3seal<-svyby(~sealants, ~Grade+Race, svymean, design=bss, keep.names=false, na.rm = TRUE) # Calculating Confidence Interval and Rounding v3seal$cintlow=v3seal$sealants *v3Seal$se.Sealants v3seal$cinthigh=v3seal$sealants *v3Seal$se.Sealants v3seal[,3:6] <- round(100*v3seal[,3:6],1) SUDAAN sample code: SUDAAN is specifically designed to analyze complex sample survey data. The user describes the sample survey design in three statements: (1) by specifying an option for the DESIGN keyword on the PROC statement, (2) by specifying the stratification and clustering (PSU) variables on the NEST design statement, and (3) by specifying the analysis weight variable on the WEIGHT design statement. All variables must be numeric. For this example, grade was changed for K and 3 to 0 and 3. Data should be sorted by the variables that appear on the NEST statement before analysis, otherwise procedure syntax must contain the NOTSORTED option. Univariate analysis (data not presented): proc descript data=bss design=wr conf_lim=95; nest strata cluster; weight weight; var EXPERIENCE UNTREATED SEALANTS; catlevel 1 1 1; run; Table 3: proc descript data=bss design=wr conf_lim=95; nest strata cluster; weight weight; 6
7 class GRADE; var EXPERIENCE UNTREATED SEALANTS; catlevel 1 1 1; tables GRADE; run; Table 4: proc descript data=bss design=wr conf_lim=95; nest strata cluster; weight weight; class RACE; var SEALANTS; catlevel 1; tables RACE; subpopn GRADE=3; run; STATA sample code: STATA versions 7 or higher offers the capability to perform many statistical procedures on complex sample survey data. When performing menu-driven analyses, sample design information is entered into boxes on the MAIN (PSU and stratification variables) and WEIGHT (analysis weight variable) tabs of the dialogue box that appears after declare survey design for data set is chosen from the Survey Data Analysis menu. If syntax is written the information is included on the SVYSET statement. The survey design descriptors only need to be entered once at the beginning of the analysis session. Although variables in STATA data sets can be numeric or character, all variables used in an analysis must be numeric. The input data file does not need to be sorted by stratum and/or PSU variables before analysis. Univariate analysis (data not presented): Svyset Cluster [pweight = Weight], strata(strata) Svy linearized : proportion Sealants Untreated Experience Table 3: Svy linearized : proportion Sealants Untreated Experience, over(grade) Table 4: Svy linearized : proportion Sealants Untreated Experience, over(race Grade) SPSS sample code: SPSS has an add-on module, SPSS Complex Samples, which includes sample selection and analysis of complex sample survey data. When performing menu-driven analysis, the sample design information is entered into a dialogue box when preparing for analysis and is saved as a sampling plan for the data set. Once the sampling plan has been created, it will be opened along with the data set at the beginning of an SPSS session. You can also use the following syntax code. Variables may be numeric or character. The input data file does not need to be sorted by stratum and/or PSU variables before analysis. *Analysis Preparation Wizard. CSPLAN ANALYSIS /PLAN FILE='...\astdd test sample.csaplan' /PLANVARS ANALYSISWEIGHT=Weight /SRSESTIMATOR TYPE=WOR /PRINT PLAN /DESIGN STRATA=Strata CLUSTER=Cluster 7
8 /ESTIMATOR TYPE=WR. Univariate analysis (data not presented): * Complex Samples Frequencies. CSTABULATE /PLAN FILE='...\astdd test sample.csaplan' /TABLES VARIABLES=Untreated experience Sealants /CELLS POPSIZE TABLEPCT /STATISTICS CIN(95) /MISSING SCOPE=TABLE CLASSMISSING=EXCLUDE. Table 3: * Complex Samples Frequencies. CSTABULATE /PLAN FILE='...\astdd test sample.csaplan' /TABLES VARIABLES=Untreated /SUBPOP TABLE=Grade DISPLAY=LAYERED /CELLS POPSIZE TABLEPCT /STATISTICS SE CIN(95) /MISSING SCOPE=TABLE CLASSMISSING=EXCLUDE. * Complex Samples Frequencies. CSTABULATE /PLAN FILE='...\astdd test sample.csaplan' /TABLES VARIABLES=experience /SUBPOP TABLE=Grade DISPLAY=LAYERED /CELLS POPSIZE TABLEPCT /STATISTICS SE CIN(95) /MISSING SCOPE=TABLE CLASSMISSING=EXCLUDE. * Complex Samples Frequencies. CSTABULATE /PLAN FILE='...\astdd test sample.csaplan' /TABLES VARIABLES=Sealants /SUBPOP TABLE=Grade DISPLAY=LAYERED /CELLS POPSIZE TABLEPCT /STATISTICS SE CIN(95) /MISSING SCOPE=TABLE CLASSMISSING=EXCLUDE. Table 4: * Complex Samples Crosstabs. CSTABULATE /PLAN FILE='...\astdd test sample.csaplan' /TABLES VARIABLES=Race BY Sealants /SUBPOP TABLE=Grade DISPLAY=LAYERED /CELLS POPSIZE ROWPCT /STATISTICS SE CIN(95) /MISSING SCOPE=TABLE CLASSMISSING=EXCLUDE. 8
9 Comparison of results: Tables 3 and 4 compare the results from each of the aforementioned statistical software packages from a recent state oral health survey of kindergarten and 3 rd grade children. Table 3 presents the prevalence of decay experience, untreated decay, and dental sealants by grade while the prevalence of dental sealants among 3 rd grade children by race is presented in Table 4. Each of the six statistical software packages described in this document can be used to appropriately analyze data from a school-based survey, although Epi Info may not appropriately account for the correct number of strata and PSUs in subpopulation analyses. Are there other things that I should consider or be aware of? Yes, there are a variety of other issues that may impact your analysis or how you report your survey reports. Following is a short list that you should review. If you have additional questions or concerns please contact ASTDD. Finite population correction: If more than 10% of children from within any given strata are selected, you may want to consider using a finite population correction factor, which reduces variance yielding smaller standard errors and confidence intervals. For additional information on finite population correction refer to Introduction to Survey Sampling. No data for a given sampling interval: If a school refuses to participate, we encourage you to select a replacement school from the same sampling interval. Unfortunately, circumstances may result in an inability to screen the original or a replacement school in a given sampling interval. If this happens you should clearly report that you were not able to screen children in a sampling interval along with what that interval represented. For example, if you selected 70 schools but you only have data for 69,report that you are missing data from one sampling interval that represents children from region 3 attending schools where 30-40% of the children are eligible for the free or reduced price school lunch (FRL) program. Reporting response rates: For each school in your survey, you should collect the number of children enrolled in the grade of interest on the day of the screening (or the number invited to participate if you did not invite all children in a given grade). Your response rate for the survey will be the number screened divided by the number enrolled or invited to participate. Stratifying results by school FRL status: Many states use school FRL status as a surrogate measure of socioeconomic status. We recommend using the current year FRL status of the school if stratifying the results by FRL status; this information can be obtained from the school on the day of the screening. Limitations of survey: When preparing your survey report, it is important to clearly state any limitations of the survey including representativeness and response rate. Confidence intervals: Confidence intervals are important because they provide context for understanding the precision or exactness of a point estimate. The wider the confidence interval, the less exact the point value estimate becomes. Take, for example, a point estimate of 40% for the prevalence of dental caries experience. If the confidence interval of this point estimate is 35%-45%, then we can have greater certainty that the true prevalence is near 40% than if the confidence interval was 10-70%. For your data to be included in NOHSS, confidence intervals must be included (unless you screened all children in your target population). Where can I get additional help? ASTDD can help you with the survey analysis process. Please contact us if you have any questions. Association of State & Territorial Dental Directors Kathy Phipps, Data and Surveillance Coordinator Phone: , [email protected] Acknowledgements Supported by Cooperative Agreement 5U58DP from the Centers for Disease Control and Prevention. Its contents are solely the responsibility of the authors and do not necessarily represent the official views of 9
10 CDC. ASTDD would like to thank Kathy Phipps, Michael Manz, Laurie Barker, Eugenio Beltran, Mei Lin, Liang Wei and Srdjan Lesaja for their assistance in developing and reviewing this guidance. References and additional resources Software for Analysis of YRBS Data, Division of Adolescent and School Health, Centers for Disease Control and Prevention, Brogan D, Sampling error estimation for survey data. In: United Nations, Department of Economic and Social Affairs, Household Sample Surveys in Developing and Transition Countries, Available at: Kalton G. Introduction to survey sampling. Quantitative Applications in the Social Sciences Sage Publications, Beverly Hills. Series/Number Isbn Kish L. Statistical Design for Research John Wiley & Sons, Hoboken. 10
11 Example #1 Weight factor calculation for a survey that used systematic PPS sampling with implicit stratification by region, urban/rural status and F/R lunch This example shows the steps from sample selection to weight factor calculation when a probability proportional to size (PPS) sampling strategy is used. Based on available resources, the decision was made to include 70 schools in the Utopia oral health survey of 3 rd grade children. The following sampling steps were employed: The sampling frame list was sorted by region then by urban/rural status within each region Schools were then sorted by percent of children participating in the FRL program within urban/rural school categories. Calculations used for selecting the systematic PPS sample: Sampling interval for sampling = (total 3 rd grade enrollment) / (# of schools to be screened) o 53,320 / 70 = Random start = random number between 0 and interval (761.7) = o This is the first school selection number o There are a variety of methods for selecting a random number including, but not limited to, Excel and Select the school with the 148 th child. Add the sampling interval (761.7) to 148 to get the next school (909.7). Continue adding the sampling interval repeatedly until all 70 school selections are made , 909.7, , , , , , These numbers are matched to the cumulative enrollment numbers in the sampling list. The schools with enrollment intervals containing the sample selection numbers are selected into the sample. The sampling frame list and the selected schools are shown in Table 1. Weight factor calculation (Table 1): Weight = (number of children in sampling interval) / (number of children screened) When PPS sampling is used the number of children in the sampling interval will always be the sampling interval used when selecting the sample, in this case,
12 Table 1: Systematic PPS sampling with implicit stratification by region, urban/rural and F/R lunch Region Urban/ Rural School Name Free/Reduced Lunch Percent 3rd Grade Enrollment Cumulative Enrollment Selected School # of Children in Sampling Interval (A) Number Screened (B) 1 Rural KEOWEE 37.1% Rural WALHALLA 39.8% Rural RAVENEL 52.3% Rural LAKEVIEW 52.4% Rural NINETY SIX 52.7% Rural NORTHSIDE 52.8% Rural MCCORMICK 53.9% Rural FAIR-OAK 55.7% Rural HICKORY TAVERN 56.8% Rural HOLLYWOOD 57.2% Rural CHEROKEE TRAIL 57.8% Rural PINECREST 60.0% Rural MERRYWOOD 60.1% 90 1,083 1 Rural DIAMOND HILL 60.3% 38 1,121 1 Rural SPRINGFIELD 60.9% 89 1,210 1 Rural TAMASSEE-SALEM 61.7% 41 1,251 1 Rural WESTMINSTER 67.2% 62 1,313 1 Rural HODGES 67.5% 36 1,349 1 Rural LAURENS 67.7% 92 1,441 1 Rural GRAY COURT OWINGS 68.7% 58 1,499 1 Rural E B MORSE 68.7% 95 1,594 1 Rural CLINTON 69.0% 87 1,681 1, Rural WESTWOOD 69.8% 120 1,801 1 Rural ORCHARD PARK 71.5% 61 1,862 1 Rural WARE SHOALS PRIMARY 72.0% 55 1,917 1 Rural JOANNA-WOODSON 72.2% 50 1,967 1 Rural JAMES M BROWN 73.9% 98 2,065 1 Rural OAKLAND 74.8% 78 2,143 1 Rural BLUE RIDGE ELEMENTARY 76.6% 90 2,233 1 Rural WATERLOO 77.5% 37 2,270 1 Rural EASTSIDE 78.9% 67 2,337 1 Rural WOODFIELDS 80.8% 97 2,434 2, Rural SALUDA 81.0% 107 2,541 1 Rural MATHEWS 83.3% 84 2,625 1 Rural JOHN C CALHOUN 89.9% 36 2,661 1 Rural FORD 92.8% 81 2,742 1 Urban MIDWAY SCHL 16.7% 142 2,884 1 Urban WREN 26.5% 100 2,984 1 Urban WRIGHT 28.6% 28 3,012 1 Urban POWDERSVILLE 31.2% 173 3,185 1 Urban CONCORD 31.5% 133 3,318 3, Urban HUNT MEADOWS 39.8% 75 3,393 1 Urban MT LEBANON 41.9% 55 3,448 1 Urban MERRIWETHER 48.4% 120 3,568 1 Urban SPEARMAN 51.3% 60 3,628 1 Urban LA FRANCE 51.3% 52 3,680 1 Urban BELTON 52.2% 160 3,840 1 Urban STARR 53.2% 57 3,897 1 Urban CENTERVILLE 55.5% 117 4,014 3, Urban WEST PELZER 55.6% 69 4,083 1 Urban HONEA PATH 55.6% 97 4,180 1 Urban CEDAR GROVE 55.9% 90 4,270 1 Urban PALMETTO 61.9% 90 4,360 1 Urban TOWNVILLE 63.5% 36 4,396 1 Urban W E PARKER 64.9% 88 4,484 1 Urban MCLEES 65.1% 118 4,602 1 Urban IVA 67.2% 70 4,672 1 Urban CALHOUN ACADEMY 67.6% 120 4,792 4, Urban WHITEHALL 69.9% 75 4,867 1 Urban NEW PROSPECT 72.3% 66 4,933 1 Urban JOHNSTON 73.5% 49 4,982 1 Urban HOMELAND PARK 74.7% 52 5,034 1 Urban PENDLETON 74.7% 52 5,086 1 Urban FLAT ROCK. 76.5% 67 5,153 1 Urban NEVITT FOREST SCHOOL 80.7% 66 5,219 1 Urban DOUGLAS 82.4% 48 5,267 1 Urban VARENNES ACADEMY 90.8% 65 5,332 2 Rural SPARTANBURG 43.2% 45 5,377 2 Rural LOCKHART 58.8% 23 5,400 2 Rural BUFFALO 70.9% 103 5,503 Weight (A)/(B) 12
13 Example #2 Weight factor calculation for a survey that used systematic non-pps sampling with implicit stratification by region, urban/rural status and F/R lunch This example shows the steps from sample selection to weight factor calculation when a non-pps sampling strategy is used. Based on available resources, the decision was made to include 70 schools in the Utopia oral health survey of 3 rd grade children. The following sampling steps were employed: The sampling frame list was sorted by region then by urban/rural status within each region Schools were then sorted by percent of children participating in the FRL program within urban/rural school categories. Calculations used for selecting the systematic non-pps sample: Sampling interval for sampling = (number of schools in sampling frame) / (# of schools to be screened) o 700 / 70 = 10.0 Random start = random number between 1 and interval (10) = 6.0 o This is the first school selection number o There are a variety of methods for selecting a random number including, but not limited to, Excel and Select the 6 th school. Add the sampling interval (10.0) to 6 to get the next school (16.0). Continue adding the sampling interval repeatedly until all 70 school selections are made. 6.0, 16.0, 26.0, 36.0, 46.0, 56.0, 66.0, These numbers are matched to the sequential number of schools in the sampling list to identify the schools selected into the sample. The sampling frame list and the selected schools are shown in Table 2. Weight factor calculation (Table 2): Weight = (number of children in sampling interval) / (number of children screened) When non-pps sampling is used the number of children in the sampling interval will vary from one interval to another. The number of children in the sampling interval is the total of all 3 rd grade children in the given interval. NOTE: In this example, dividing the number of schools by the number of schools to screen produced a whole number. Please contact ASTDD if you need more information on how to appropriately calculate weights if a fractional sampling interval was used. 13
14 Table 2: Systematic sampling (non-pps) with implicit stratification by region, urban/rural status and F/R lunch Region Urban/ Rural School Name Free/Reduced Lunch Percent 3rd Grade Enrollment Cumulative # of Schools Sampling Interval Selected School # of Children in Sampling Interval (A) Number Screened (B) 1 Rural KEOWEE 37.1% Rural WALHALLA 39.8% Rural RAVENEL 52.3% Rural LAKEVIEW 52.4% Rural NINETY SIX 52.7% Rural NORTHSIDE 52.8% Rural MCCORMICK 53.9% Rural FAIR-OAK 55.7% Rural HICKORY TAVERN 56.8% Rural HOLLYWOOD 57.2% Rural CHEROKEE TRAIL 57.8% Rural PINECREST 60.0% Rural MERRYWOOD 60.1% Rural DIAMOND HILL 60.3% Rural SPRINGFIELD 60.9% Rural TAMASSEE-SALEM 61.7% Rural WESTMINSTER 67.2% Rural HODGES 67.5% Rural LAURENS 67.7% Rural GRAY COURT OWINGS 68.7% Rural E B MORSE 68.7% Rural CLINTON 69.0% Rural WESTWOOD 69.8% Rural ORCHARD PARK 71.5% Rural WARE SHOALS 72.0% Rural JOANNA-WOODSON 72.2% Rural JAMES M BROWN 73.9% Rural OAKLAND 74.8% Rural BLUE RIDGE 76.6% Rural WATERLOO 77.5% Rural EASTSIDE 78.9% Rural WOODFIELDS 80.8% Rural SALUDA 81.0% Rural MATHEWS 83.3% Rural JOHN C CALHOUN 89.9% Rural FORD 92.8% Urban MIDWAY 16.7% Urban WREN 26.5% Urban WRIGHT 28.6% Urban POWDERSVILLE 31.2% Urban CONCORD 31.5% Urban HUNT MEADOWS 39.8% Urban MT LEBANON 41.9% Urban MERRIWETHER 48.4% Urban SPEARMAN 51.3% Urban LA FRANCE 51.3% Urban BELTON 52.2% Urban STARR 53.2% Urban CENTERVILLE 55.5% Urban WEST PELZER 55.6% Urban HONEA PATH 55.6% Urban CEDAR GROVE 55.9% Urban PALMETTO 61.9% Urban TOWNVILLE 63.5% Urban W E PARKER 64.9% Urban MCLEES 65.1% Urban IVA 67.2% Urban CALHOUN ACADEMY 67.6% Urban WHITEHALL 69.9% Urban NEW PROSPECT 72.3% Urban JOHNSTON 73.5% Urban HOMELAND PARK 74.7% Urban PENDLETON 74.7% Urban FLAT ROCK 76.5% Urban NEVITT FOREST 80.7% Urban DOUGLAS 82.4% Urban VARENNES ACADEMY 90.8% Rural SPARTANBURG 43.2% Rural LOCKHART 58.8% Rural BUFFALO 70.9% Weight (A)/(B) 14
15 Table 3: Results from analyses of an oral health dataset using SAS, Epi Info, R, SUDAAN, SPSS and Stata Kindergarten 3 rd Grade Oral health variable 95% CI 95% CI 95% CI 95% CI Estimated % Estimated % Lower limit Upper limit Lower limit Upper limit Decay experience (% yes) SAS Epi Info R SUDAAN SPSS Stata Untreated decay (% yes) SAS Epi Info R SUDAAN SPSS Stata Dental sealants (% yes) SAS Epi Info R NA SUDAAN SPSS Stata Table 4: Results from analyses of an oral health dataset using SAS, Epi Info, R, SUDAAN, SPSS and Stata 3 rd Grade Students Non-Hispanic White Non-Hispanic Black Oral health variable 95% CI 95% CI 95% CI 95% CI Estimated % Estimated % Lower limit Upper limit Lower limit Upper limit Dental sealants (% yes) SAS Epi Info R SUDAAN SPSS Stata
Software for Analysis of YRBS Data
Youth Risk Behavior Surveillance System (YRBSS) Software for Analysis of YRBS Data June 2014 Where can I get more information? Visit www.cdc.gov/yrbss or call 800 CDC INFO (800 232 4636). CONTENTS Overview
Youth Risk Behavior Survey (YRBS) Software for Analysis of YRBS Data
Youth Risk Behavior Survey (YRBS) Software for Analysis of YRBS Data CONTENTS Overview 1 Background 1 1. SUDAAN 2 1.1. Analysis capabilities 2 1.2. Data requirements 2 1.3. Variance estimation 2 1.4. Survey
The SURVEYFREQ Procedure in SAS 9.2: Avoiding FREQuent Mistakes When Analyzing Survey Data ABSTRACT INTRODUCTION SURVEY DESIGN 101 WHY STRATIFY?
The SURVEYFREQ Procedure in SAS 9.2: Avoiding FREQuent Mistakes When Analyzing Survey Data Kathryn Martin, Maternal, Child and Adolescent Health Program, California Department of Public Health, ABSTRACT
Variance Estimation Guidance, NHIS 2006-2014 (Adapted from the 2006-2014 NHIS Survey Description Documents) Introduction
June 9, 2015 Variance Estimation Guidance, NHIS 2006-2014 (Adapted from the 2006-2014 NHIS Survey Description Documents) Introduction The data collected in the NHIS are obtained through a complex, multistage
Sampling Error Estimation in Design-Based Analysis of the PSID Data
Technical Series Paper #11-05 Sampling Error Estimation in Design-Based Analysis of the PSID Data Steven G. Heeringa, Patricia A. Berglund, Azam Khan Survey Research Center, Institute for Social Research
Workshop on Using the National Survey of Children s s Health Dataset: Practical Applications
Workshop on Using the National Survey of Children s s Health Dataset: Practical Applications Julian Luke Stephen Blumberg Centers for Disease Control and Prevention National Center for Health Statistics
Chapter 11 Introduction to Survey Sampling and Analysis Procedures
Chapter 11 Introduction to Survey Sampling and Analysis Procedures Chapter Table of Contents OVERVIEW...149 SurveySampling...150 SurveyDataAnalysis...151 DESIGN INFORMATION FOR SURVEY PROCEDURES...152
National Longitudinal Study of Adolescent Health. Strategies to Perform a Design-Based Analysis Using the Add Health Data
National Longitudinal Study of Adolescent Health Strategies to Perform a Design-Based Analysis Using the Add Health Data Kim Chantala Joyce Tabor Carolina Population Center University of North Carolina
Chapter XXI Sampling error estimation for survey data* Donna Brogan Emory University Atlanta, Georgia United States of America.
Chapter XXI Sampling error estimation for survey data* Donna Brogan Emory University Atlanta, Georgia United States of America Abstract Complex sample survey designs deviate from simple random sampling,
INTRODUCTION TO SURVEY DATA ANALYSIS THROUGH STATISTICAL PACKAGES
INTRODUCTION TO SURVEY DATA ANALYSIS THROUGH STATISTICAL PACKAGES Hukum Chandra Indian Agricultural Statistics Research Institute, New Delhi-110012 1. INTRODUCTION A sample survey is a process for collecting
Complex Survey Design Using Stata
Complex Survey Design Using Stata 2010 This document provides a basic overview of how to handle complex survey design using Stata. Probability weighting and compensating for clustered and stratified samples
Survey Data Analysis in Stata
Survey Data Analysis in Stata Jeff Pitblado Associate Director, Statistical Software StataCorp LP Stata Conference DC 2009 J. Pitblado (StataCorp) Survey Data Analysis DC 2009 1 / 44 Outline 1 Types of
Comparing Alternate Designs For A Multi-Domain Cluster Sample
Comparing Alternate Designs For A Multi-Domain Cluster Sample Pedro J. Saavedra, Mareena McKinley Wright and Joseph P. Riley Mareena McKinley Wright, ORC Macro, 11785 Beltsville Dr., Calverton, MD 20705
New SAS Procedures for Analysis of Sample Survey Data
New SAS Procedures for Analysis of Sample Survey Data Anthony An and Donna Watts, SAS Institute Inc, Cary, NC Abstract Researchers use sample surveys to obtain information on a wide variety of issues Many
Count Your Smiles. Count Your Smiles
Count Your Smiles 2011-2012 Count Count Your Your Smiles Smiles i ACKNOWLEDGEMENTS State of Michigan Governor Rick Snyder May 2011 Michigan Department of Community Health Director Olga Dazzo Public Health
Survey Analysis: Options for Missing Data
Survey Analysis: Options for Missing Data Paul Gorrell, Social & Scientific Systems, Inc., Silver Spring, MD Abstract A common situation researchers working with survey data face is the analysis of missing
WHO STEPS Surveillance Support Materials. STEPS Epi Info Training Guide
STEPS Epi Info Training Guide Department of Chronic Diseases and Health Promotion World Health Organization 20 Avenue Appia, 1211 Geneva 27, Switzerland For further information: www.who.int/chp/steps WHO
Logistic (RLOGIST) Example #3
Logistic (RLOGIST) Example #3 SUDAAN Statements and Results Illustrated PREDMARG (predicted marginal proportion) CONDMARG (conditional marginal proportion) PRED_EFF pairwise comparison COND_EFF pairwise
Life Table Analysis using Weighted Survey Data
Life Table Analysis using Weighted Survey Data James G. Booth and Thomas A. Hirschl June 2005 Abstract Formulas for constructing valid pointwise confidence bands for survival distributions, estimated using
The Research Data Centres Information and Technical Bulletin
Catalogue no. 12-002 X No. 2014001 ISSN 1710-2197 The Research Data Centres Information and Technical Bulletin Winter 2014, vol. 6 no. 1 How to obtain more information For information about this product
Constructing a Table of Survey Data with Percent and Confidence Intervals in every Direction
Constructing a Table of Survey Data with Percent and Confidence Intervals in every Direction David Izrael, Abt Associates Sarah W. Ball, Abt Associates Sara M.A. Donahue, Abt Associates ABSTRACT We examined
California s State Oral Health Infrastructure: Opportunities for Improvement and Funding
California s State Oral Health Infrastructure: Opportunities for Improvement and Funding joel diringer, jd, mph, and kathy r. phipps, drph, rdh abstract California has virtually no statewide dental public
Best Practices in Using Large, Complex Samples: The Importance of Using Appropriate Weights and Design Effect Compensation
A peer-reviewed electronic journal. Copyright is retained by the first or sole author, who grants right of first publication to the Practical Assessment, Research & Evaluation. Permission is granted to
Failure to take the sampling scheme into account can lead to inaccurate point estimates and/or flawed estimates of the standard errors.
Analyzing Complex Survey Data: Some key issues to be aware of Richard Williams, University of Notre Dame, http://www3.nd.edu/~rwilliam/ Last revised January 24, 2015 Rather than repeat material that is
Introduction to Sampling. Dr. Safaa R. Amer. Overview. for Non-Statisticians. Part II. Part I. Sample Size. Introduction.
Introduction to Sampling for Non-Statisticians Dr. Safaa R. Amer Overview Part I Part II Introduction Census or Sample Sampling Frame Probability or non-probability sample Sampling with or without replacement
Teacher Attrition and Mobility:
NCES 2014-077 U.S. DEPARTMENT OF EDUCATION Teacher Attrition and Mobility: Results From the 2012 13 Teacher Follow-up Survey First Look Teacher Attrition and Mobility: Results From the 2012 13 Teacher
Robert L. Santos, Temple University
n ONE APPROACH TO OVERSAMPLING BLACKS AND HISPANICS: THE NATIONAL ALCOHOL SURVEY Robert L. Santos, Temple University I. Introduction The 1984 National Alcohol Survey (NAS) is the first national household
CMS oral Health Initiative an opportunity for Dental Hygienists April 23 rd 2015 Nancy Gurzick, RDH, BSDH, MA
CMS oral Health Initiative an opportunity for Dental Hygienists April 23 rd 2015 Nancy Gurzick, RDH, BSDH, MA Course objectives Part I Federal and State Oral Health Programs Understanding Medicaid/Federal
Accounting for complex survey design in modeling usual intake. Kevin W. Dodd, PhD National Cancer Institute
Accounting for complex survey design in modeling usual intake Kevin W. Dodd, PhD National Cancer Institute Slide 1 Hello and welcome to today s webinar, the fourth in the Measurement Error Webinar Series.
State of Mississippi. Oral Health Plan
State of Mississippi Oral Health Plan 2006 2010 Vision Statement: We envision a Mississippi where every child enjoys optimal oral health; where prevention and health education are emphasized and treatment
Using Repeated Measures Techniques To Analyze Cluster-correlated Survey Responses
Using Repeated Measures Techniques To Analyze Cluster-correlated Survey Responses G. Gordon Brown, Celia R. Eicheldinger, and James R. Chromy RTI International, Research Triangle Park, NC 27709 Abstract
Need for Sampling. Very large populations Destructive testing Continuous production process
Chapter 4 Sampling and Estimation Need for Sampling Very large populations Destructive testing Continuous production process The objective of sampling is to draw a valid inference about a population. 4-
Organizing Your Approach to a Data Analysis
Biost/Stat 578 B: Data Analysis Emerson, September 29, 2003 Handout #1 Organizing Your Approach to a Data Analysis The general theme should be to maximize thinking about the data analysis and to minimize
Paper RIV15 SAS Macros to Produce Publication-ready Tables from SAS Survey Procedures
Paper RIV15 SAS Macros to Produce Publication-ready Tables from SAS Survey Procedures ABSTRACT Emma L. Frazier, Centers for Disease Control, Atlanta, Georgia Shuyan Zhang, ICF International, Atlanta, Georgia
2013 State of Colorado Distracted Driver Study
2013 State of Colorado Distracted Driver Study Colorado Department of Transportation SEAT BE L STUDY T INSTITUTE OF TRANSPORTATION MANAGEMENT EXECUTIVE SUMMARY The Institute of Transportation Management
Guidelines for Analyzing Add Health Data. Ping Chen Kim Chantala
Guidelines for Analyzing Add Health Data Ping Chen Kim Chantala Carolina Population Center University of North Carolina at Chapel Hill Last Update: March 2014 Table of Contents Overview... 3 Understanding
Northumberland Knowledge
Northumberland Knowledge Know Guide How to Analyse Data - November 2012 - This page has been left blank 2 About this guide The Know Guides are a suite of documents that provide useful information about
2014 National Study of Long-Term Care Providers (NSLTCP) Adult Day Services Center Survey. Restricted Data File. July 2015
2014 National Study of Long-Term Care Providers (NSLTCP) Adult Day Services Center Survey Restricted Data File July 2015 Data Description and Usage (Readme) Table of Contents Data File... 4 Documentation...
IBM SPSS Direct Marketing 22
IBM SPSS Direct Marketing 22 Note Before using this information and the product it supports, read the information in Notices on page 25. Product Information This edition applies to version 22, release
How to Analyze ANES Survey Data
How to Analyze ANES Survey Data May 2010 (Revised September 28, 2010) Matthew DeBell Stanford University American National Election Studies How to Analyze ANES Survey Data This document may be freely copied
Survey Data Analysis in Stata
Survey Data Analysis in Stata Jeff Pitblado Associate Director, Statistical Software StataCorp LP 2009 Canadian Stata Users Group Meeting Outline 1 Types of data 2 2 Survey data characteristics 4 2.1 Single
FACILITATOR/MENTOR GUIDE
FACILITATOR/MENTOR GUIDE Descriptive analysis variables table shells hypotheses Measures of association methods design justify analytic assess calculate analysis problem stratify confounding statistical
SPSS and AM statistical software example.
A detailed example of statistical analysis using the NELS:88 data file and ECB, to perform a longitudinal analysis of 1988 8 th graders in the year 2000: SPSS and AM statistical software example. Overall
6 Regression With Survey Data From Complex Samples
6 Regression With Survey Data From Complex Samples Secondary analysis of data from large national surveys figures prominently in social science and public health research, and these surveys use complex
6. CONDUCTING SURVEY DATA ANALYSIS
49 incorporating adjustments for the nonresponse and poststratification. But such weights usually are not included in many survey data sets, nor is there the appropriate information for creating such replicate
Measuring change with the Belgian Survey on Income and Living Conditions (SILC): taking account of the sampling variance
Measuring change with the Belgian Survey on Income and Living Conditions (SILC): taking account of the sampling variance Report Financed by FPS Social Security Belgium (Ref.: 2013/BOEP/UA-EU2020) December
Data The estimates presented in the tables originate from the 2013 SCS to the NCVS. The SCS collects information about student and school
This document reports data from the 2013 School Crime Supplement (SCS) of the National Crime Victimization Survey (NCVS). 1 The Web Tables show the extent to which with different personal characteristics
Statistical and Methodological Issues in the Analysis of Complex Sample Survey Data: Practical Guidance for Trauma Researchers
Journal of Traumatic Stress, Vol. 2, No., October 2008, pp. 440 447 ( C 2008) Statistical and Methodological Issues in the Analysis of Complex Sample Survey Data: Practical Guidance for Trauma Researchers
Logistic (RLOGIST) Example #7
Logistic (RLOGIST) Example #7 SUDAAN Statements and Results Illustrated EFFECTS UNITS option EXP option SUBPOPX REFLEVEL Input Data Set(s): SAMADULTED.SAS7bdat Example Using 2006 NHIS data, determine for
IBM SPSS Direct Marketing 23
IBM SPSS Direct Marketing 23 Note Before using this information and the product it supports, read the information in Notices on page 25. Product Information This edition applies to version 23, release
health watch Marijuana Use Among Adolescents in Colorado: Results from the 2013 Healthy Kids Colorado Survey Background March 2015 No.
Health Statistics & Evaluation Branch Vital Statistics Health Surveys & Analysis Evaluation Survey Research health watch March 15 No. 95 Marijuana Use Among Adolescents in Colorado: Results from the 13
SECTION 3.2: MOTOR VEHICLE TRAFFIC CRASHES
SECTION 3.2: MOTOR VEHICLE TRAFFIC CRASHES 1,155 Deaths* 4,755 Hospitalizations 103,860 ED Visits *SOURCE: OHIO DEPARTMENT OF HEALTH, VITAL STATISTICS SOURCE: OHIO HOSPITAL ASSOCIATION CHAPTER HIGHLIGHTS:
IBM SPSS Statistics 20 Part 4: Chi-Square and ANOVA
CALIFORNIA STATE UNIVERSITY, LOS ANGELES INFORMATION TECHNOLOGY SERVICES IBM SPSS Statistics 20 Part 4: Chi-Square and ANOVA Summer 2013, Version 2.0 Table of Contents Introduction...2 Downloading the
The construction and use of sample design variables in EU SILC. A user s perspective
The construction and use of sample design variables in EU SILC. A user s perspective Report prepared for Eurostat Abstract: Currently, EU SILC is the single most important data source on income and living
Elementary Statistics
Elementary Statistics Chapter 1 Dr. Ghamsary Page 1 Elementary Statistics M. Ghamsary, Ph.D. Chap 01 1 Elementary Statistics Chapter 1 Dr. Ghamsary Page 2 Statistics: Statistics is the science of collecting,
Household Survey Data Basics
Household Survey Data Basics Jann Lay Kiel Institute for the World Economy Overview I. Background II. Household surveys Design Content Quality Availability I. Background Not new, household survey data
SECOND NATIONAL JUVENILE ONLINE VICTIMIZATION INCIDENCE STUDY (NJOV-2)
SECOND NATIONAL JUVENILE ONLINE VICTIMIZATION INCIDENCE STUDY (NJOV-2) NDACAN Dataset Number 162 USER'S GUIDE National Data Archive on Child Abuse and Neglect Bronfenbrenner Center for Translational Research
The Effects of Demographics on Consumer Perceptions of Identity Theft in Rural and Urban Settings
The Effects of Demographics on Consumer Perceptions of Identity Theft in Rural and Urban Settings Axton Betz, Ph.D. Student, Iowa State University 1 Identity theft is a serious crime in which an individual
Oral Health Program. Strategic Plan. U.S. Department of Health and Human Services Centers for Disease Control and Prevention
Oral Health Program Strategic Plan 2011 2014 U.S. Department of Health and Human Services Centers for Disease Control and Prevention Centers for Disease Control and Prevention National Center for Chronic
ESTIMATION OF THE EFFECTIVE DEGREES OF FREEDOM IN T-TYPE TESTS FOR COMPLEX DATA
m ESTIMATION OF THE EFFECTIVE DEGREES OF FREEDOM IN T-TYPE TESTS FOR COMPLEX DATA Jiahe Qian, Educational Testing Service Rosedale Road, MS 02-T, Princeton, NJ 08541 Key Words" Complex sampling, NAEP data,
SAMPLE DESIGN RESEARCH FOR THE NATIONAL NURSING HOME SURVEY
SAMPLE DESIGN RESEARCH FOR THE NATIONAL NURSING HOME SURVEY Karen E. Davis National Center for Health Statistics, 6525 Belcrest Road, Room 915, Hyattsville, MD 20782 KEY WORDS: Sample survey, cost model
Create Custom Tables in No Time
SPSS Custom Tables 17.0 Create Custom Tables in No Time Easily analyze and communicate your results with SPSS Custom Tables, an add-on module for the SPSS Statistics product line Share analytical results
An Introduction to Secondary Data Analysis
research methodology series An Introduction to Secondary Data Analysis Natalie Koziol, MA CYFS Statistics and Measurement Consultant Ann Arthur, MS CYFS Statistics and Measurement Consultant Outline Overview
Public Use Microdata Sample, Guam
Public Use Microdata Sample, Guam 2000 Census of Population and Housing 2000 Issued January 2005 PUMS/03-GUAM Technical Documentation U.S. Department of Commerce Economics and Statistics Administration
Confidence Intervals in Public Health
Confidence Intervals in Public Health When public health practitioners use health statistics, sometimes they are interested in the actual number of health events, but more often they use the statistics
IBM SPSS Statistics for Beginners for Windows
ISS, NEWCASTLE UNIVERSITY IBM SPSS Statistics for Beginners for Windows A Training Manual for Beginners Dr. S. T. Kometa A Training Manual for Beginners Contents 1 Aims and Objectives... 3 1.1 Learning
Using the American Community Survey Data
Using the American Community Survey Data The ACS website is accessible from www.census.gov/acs. In the middle of the screen choose In-depth Data Go directly to the ACS Datasets Tab The ACS Datasets Tab
SPSS: Getting Started. For Windows
For Windows Updated: August 2012 Table of Contents Section 1: Overview... 3 1.1 Introduction to SPSS Tutorials... 3 1.2 Introduction to SPSS... 3 1.3 Overview of SPSS for Windows... 3 Section 2: Entering
A COMPREHENSIVE SOFTWARE PACKAGE FOR SURVEY DATA ANALYSIS
SUDAAN: A COMPREHENSIVE SOFTWARE PACKAGE FOR SURVEY DATA ANALYSIS Lisa M. LaVange, Babubhai V. Shah, Beth G. Barnwell and Joyce F. Killinger Lisa M. LaVan~e. Research Triangle Institute KEYWORDS: variance
Farm Business Survey - Statistical information
Farm Business Survey - Statistical information Sample representation and design The sample structure of the FBS was re-designed starting from the 2010/11 accounting year. The coverage of the survey is
Statistical & Technical Team
Statistical & Technical Team A Practical Guide to Sampling This guide is brought to you by the Statistical and Technical Team, who form part of the VFM Development Team. They are responsible for advice
Using Microsoft Excel to Manage and Analyze Data: Some Tips
Using Microsoft Excel to Manage and Analyze Data: Some Tips Larger, complex data management may require specialized and/or customized database software, and larger or more complex analyses may require
Analysis of categorical data: Course quiz instructions for SPSS
Analysis of categorical data: Course quiz instructions for SPSS The dataset Please download the Online sales dataset from the Download pod in the Course quiz resources screen. The filename is smr_bus_acd_clo_quiz_online_250.xls.
Instructions for Analyzing Data from CAHPS Surveys:
Instructions for Analyzing Data from CAHPS Surveys: Using the CAHPS Analysis Program Version 3.6 The CAHPS Analysis Program...1 Computing Requirements...1 Pre-Analysis Decisions...2 What Does the CAHPS
Power Calculation Using the Online Variance Almanac (Web VA): A User s Guide
Power Calculation Using the Online Variance Almanac (Web VA): A User s Guide Larry V. Hedges & E.C. Hedberg This research was supported by the National Science Foundation under Award Nos. 0129365 and 0815295.
Appendix I: Methodology
Appendix I: Methodology SSRS METHODOLOGY SSRS conducted a survey of Muslims and Jews for the Institute for Social Policy and Understanding from January 18 through January 27, 2016. The study investigated
Issue Brief: Expanding Access to Oral Health Care in Idaho
Issue Brief: Expanding Access to Oral Health Care in Idaho Oral diseases can be attributed to bacterial infections that, if left untreated, can affect other systems of the body. A growing number of studies
Comparison of Variance Estimates in a National Health Survey
Comparison of Variance Estimates in a National Health Survey Karen E. Davis 1 and Van L. Parsons 2 1 Agency for Healthcare Research and Quality, 540 Gaither Road, Rockville, MD 20850 2 National Center
Survey, Statistics and Psychometrics Core Research Facility University of Nebraska-Lincoln. Log-Rank Test for More Than Two Groups
Survey, Statistics and Psychometrics Core Research Facility University of Nebraska-Lincoln Log-Rank Test for More Than Two Groups Prepared by Harlan Sayles (SRAM) Revised by Julia Soulakova (Statistics)
Characteristics of Public and Private Elementary and Secondary School Teachers in the United States:
NCES 2013-314 U.S. DEPARTMENT OF EDUCATION Characteristics of Public and Private Elementary and Secondary School Teachers in the United States: Results From the 2011 12 Schools and Staffing Survey First
