Constructing a Table of Survey Data with Percent and Confidence Intervals in every Direction

Size: px
Start display at page:

Download "Constructing a Table of Survey Data with Percent and Confidence Intervals in every Direction"

Transcription

1 Constructing a Table of Survey Data with Percent and Confidence Intervals in every Direction David Izrael, Abt Associates Sarah W. Ball, Abt Associates Sara M.A. Donahue, Abt Associates ABSTRACT We examined a survey sample consisting of treated and not-treated respondents. We show how, using SAS macros based on PROC SURVEYFREQ, the user can easily construct a table that presents survey findings of interest: the unweighted sample, unweighted sample percent (column percent ) and weighted sample for characteristics/variables of interest (rows). We then show how to use the macros to compute a weighted column percent and the weighted treatment ratio (weighted row percent), with respective confidence intervals. We demonstrate the application of the macros to two types of variables: those representing single-select survey questions (i.e., survey question with one response allowed) and those representing survey questions that allow the respondent to choose more than one response. INTRODUCTION Our assumptions are as follows: 1) The survey data includes calculated survey weights and a variable to identify treated vs. not-treated respondents (TX); 2) The survey data includes variables that characterize a respondent s demographic characteristics, such as age, gender, education, and health insurance; 3) The survey design uses stratification and clustering and thus the survey data include the variables strata and cluster. We constructed a table of survey data based on the following shell: Demographic characteristic Sample Sample % Total Weighted sample Age Gender Male Female Race/Ethnicity 1

2 Demographic characteristic Sample Sample % Non-Hispanic white Weighted sample Non-Hispanic black Hispanic Non-Hispanic other Education High school graduate or less Some college or Associate degree Bachelor's degree Master's degree or above Insurance* Private medical insurance Medicare Medicaid Other public insurance No Health Insurance CI: confidence interval *Respondents may select more than one type of insurance Note that the variables age, gender, race/ethnicity, and education represent single-select survey questions, in contrast to insurance, which represents a survey question for which respondents may select more than one response. Sample % is the column percent based on the unweighted sample ; it totals 100 for all categories of the variables that represent each single-select survey question. is the column percent based on the weighted sample ; it also totals 100 for all categories of variables that represent each single-select survey question. Percent treated is the weighted percent of a given category s population that is identified as treated (weighted row percent). For a multiple choice survey item, such as insurance, the sums of Sample % and may be greater than 100 because respondents may select more than one response. Thus, an individual respondent may be counted in more than one category of the variable representing insurance. SAS MACROS TO CALCULATE PERCENT AND CONFIDENCE INTERVALS IN EVERY DIRECTION 1. The first macro TOTAL computes the first Total row of the table and is driven by two procedures: PROC SUMMARY gives us the sample and weighted sample : PROC SUMMARY nway data= ourdata noprint; var N final_wgt; output out=out(drop=_: ) sum = n wgt_n; PROC SURVEYFREQ calculates the total percent of treated respondents and the lower and upper limits of the 95% confidence interval: PROC SURVEYFREQ data=outdata nosummary; tables TX/cl nostd ; 2

3 ods output OneWay = Tot (keep =TX Frequency WgtFreq Percent LowerCL UpperCL ); strata strata; cluster cluster; weight final_wgt; As expected, the sample percent and the weighted percent for the total row are 100. The macro %TOTAL results in the data set total, which carries all needed values for the first row. 2. The second macro (%SINGLE) is intended to calculate the column values for single-select survey questions (such as age, gender, etc.). The macro call looks like the following: %SINGLE (var, charact, fmt); where var is a reported variable (age, for example), charact is the label that precedes the categories in the leftmost column of the table shell ( Race/Ethnicity, for example), and fmt is a user format with which the categories of the variable will be printed. For the variables that represent single-select survey questions in the above table shell the macro calls look like the following: %SINGLE (age, %NRBQUOTE (Age in years), agef); %SINGLE (sex, %NRBQUOTE (Gender), sexf); %SINGLE (race_ethn, %NRBQUOTE (Race/Ethnicity), racef); %SINGLE (education, %NRBQUOTE (Education), educationf); Here and below we use %NRBQUOTE macro function to accommodate various symbols in the labels, such as,, &, %, etc. Each macro call ultimately creates the data set with the name of the variable the macro processes. This data set contains all the numbers needed to fill the table shell. To combine these data sets for printing we use the following data step: data combined_single; set age sex, race_ethn, education; The core of the macro %SINGLE contains two PROC SURVEYFREQ s and one PROC FREQ. The first PROC SURVEYFREQ calculates the weighted percent of a given category s population that is identified as treated (weighted row percent), with a 95% confidence interval: PROC SURVEYFREQ data=f nosummary; tables &var*tx/cl row nostd; ods output CrossTabs = goriz; strata strata; cluster cluster; weight final_wgt; The data set goriz has all components ( ) of the estimates for all categories of the variable. The second PROC SURVEYFREQ calculates the unweighted and weighted sample for each category of the variable, as well as the weighted column percent and its 95% confidence interval: PROC SURVEYFREQ data=f nosummary ; tables &var/cl nostd; ods output OneWay = vertic(keep = &var frequency wgtfreq percent LowerCL UpperCL rename = (frequency=n wgtfreq = wgt_n )); 3

4 strata strata; cluster cluster; weight final_wgt; The data set vertic has all components (Sample, Weighted sample, ) of the estimates for all categories of the variable. Finally, to calculate the unweighted percent for each category of the variable, we use PROC FREQ (unfortunately, PROC SURVEYFREQ does not calculate the unweighted percent), as follows: PROC FREQ data=f; tables &var/noprint out=unw (keep = &var percent rename = (percent = unw_pct)); The data set unw has all components (Sample %) of the estimates for all categories of the variable. 3. The third macro (%MULTY) is intended to calculate the column values for each category of those survey questions for which respondents may select more than one response ( multiple response items, such as insurance). As a rule, a multiple response item in the SAS data set includes several variables that represent the individual response options. For insurance (shown in the table shell) the variables are I1-I5. Each variable can be selected (1) or not selected (0). Contrary to the way we approached single response items by processing all categories of the variable in one macro call, the %MULTY macro calculates the values of the columns for each variable (I1-I5) separately. The macro call looks like the following: %MULTY (var, text); where var is a variable representing a response option (for example, I1) and text is the name we would like to assign to this variable in the left most column of the table shell (for example, Private medical insurance for I1). For the insurance multiple response item in the above table shell the macro calls look like the following: %MULTY (I1, %NRBQUOTE (Private medical insurance)); %MULTY (I2, %NRBQUOTE (Medicare)); %MULTY (I3, %NRBQUOTE (Medicaid)); %MULTY (I4, %NRBQUOTE (Other public insurance)); %MULTY (I5, %NRBQUOTE (No Health Insurance)); Each macro call ultimately creates a data set with the name of the variable it processes preceded by the prefix r_ that contains all of the numbers to fill the table shell. To combine those data sets for printing we use the following data step: data combined_multy; set r_i1-r_i5; Unlike in the macro %SINGLE, however, the user must assign the title of the multiple response item ( Insurance in our case) to the variables representing the response options. This can be done by creating a dummy data set as follows: data dummy; 4

5 length characteristic $100; characteristic='insurance'; output; After assigning a title, the data set containing all values for each variable that represents an individual response option of a multiple response item is created: data combined_multy; set dummy combined_multy; At the core of the macro %MULTY are essentially the same PROC SUREVYFREQ and PROC FREQ as described above; however, the user should remember that contrary to %SINGLE, %MULTY only works with dichotomized variables (with values 1 and 0 ) and only the level 1 (selected) is the object of the estimate. RESULTS Finally, the user combines the data sets total, combined_single, and combined_multy and then prints the dataset in the format of the table shell. The resulting table for the example described is presented below. Weighted sample Demographic characteristic Sample Sample % Total ( 47.1, 52.6) Age ( 31.7, 37.6) 48.2( 42.3, 54.0) ( 25.3, 30.0) 51.3( 46.4, 56.2) ( 19.8, 23.2) 50.4( 46.6, 54.1) ( 11.2, 14.8) 49.9( 42.6, 57.2) ( 2.9, 3.5) 52.2( 48.2, 56.2) Gender Male ( 46.6, 52.1) 50.0( 46.1, 53.9) Female ( 47.9, 53.4) 49.7( 45.9, 53.5) Race/Ethnicity Non-Hispanic white ( 48.0, 53.5) 51.1( 47.8, 54.3) Non-Hispanic black ( 21.0, 26.4) 48.4( 41.5, 55.3) Hispanic ( 19.0, 23.7) 48.0( 41.6, 54.3) Non-Hispanic other ( 3.2, 5.2) 52.7( 40.8, 64.6) Education High school graduate or less ( 28.0, 33.1) 49.7( 44.7, 54.8) Some college or Associate degree ( 17.6, 22.0) 53.2( 47.1, 59.4) Bachelor's degree ( 27.2, 32.2) 45.4( 40.5, 50.4) Master's degree or above ( 17.8, 22.1) 53.3( 47.3, 59.3) Insurance* Private medical insurance ( 68.9, 73.8) 49.9(46.6,53.2) Medicare ( 28.1, 33.1) 50.0(45.2,54.9) Medicaid ( 17.8, 22.2) 50.7(44.5,56.9) 5

6 Weighted sample Demographic characteristic Sample Sample % Other public insurance ( 18.2, 22.7) 50.0(43.8,56.1) No Health Insurance ( 11.5, 15.1) 51.1(43.8,58.4) CI: confidence interval *Respondents may select more than one type of insurance FLEXIBILITY How flexible is our table? Suppose we need to replace education with marital status and place marital status after insurance. We would write the following statements: %SINGLE(age, %NRBQUOTE (Age in years), agef); %SINGLE(sex, %NRBQUOTE (Gender), sexf); %SINGLE(race_ethn, %NRBQUOTE (Race/Ethnicity), racef); /* %SINGLE(education, %NRBQUOTE (Education), educationf); OLD LINE COMMENTED */ %SINGLE(marital_status, %NRBQUOTE (Marital status),maritalf); /* NEW LINE */ and then construct the data set for printing like this: data forprint; set total age sex race_ethn combined_multy marital_status; where combined_multy is the combined insurance data created earlier. Could not be easier! What if the format of the table is different? For example, a table might require parallel columns for two separate groups of survey respondents (males and females in the example below). Males Females Demographic characteristic Total Sample Sample Age Race/Ethnicity Non-Hispanic white Non-Hispanic black Hispanic Non-Hispanic other 6

7 No worry! Using the variable indicating the category of the group of survey respondents (in this example, Gender ) apply the macros presented above (TOTAL, %SINGLE, %MULTY, as needed]) to the first group ( Males ), renaming the macros output data sets with a marker for the group (e.g., age_male). Then apply the same macros to the second group ( Females ). Before combining the resulting data sets using the data combined_single step outlined above, merge the two data sets by each individual variable (in the above example, there will be two data sets each for age and race_ethn). After merging, the data sets that now contain output from the two separate groups of survey respondents can be combined using the data combined_single step outlined above. For the table shell presented above, do not forget to drop the unweighted percent. As needed, apply the other macros and combine all data sets to print in the format of the table shell. To print this kind of table one can use PROC REPORT rather than PROC PRINT. Done! DISCLAIMER All the numbers in the table are based on randomly generated data and, therefore, have nothing in common with any survey data we have dealt with in our work. CONTACT INFORMATION David Izrael Abt Associates Inc, david_izrael@abtassoc.com 7

Paper RIV15 SAS Macros to Produce Publication-ready Tables from SAS Survey Procedures

Paper RIV15 SAS Macros to Produce Publication-ready Tables from SAS Survey Procedures Paper RIV15 SAS Macros to Produce Publication-ready Tables from SAS Survey Procedures ABSTRACT Emma L. Frazier, Centers for Disease Control, Atlanta, Georgia Shuyan Zhang, ICF International, Atlanta, Georgia

More information

Enrollment under the Medicaid Expansion and Health Insurance Exchanges. A Focus on Those with Behavioral Health Conditions in Maine

Enrollment under the Medicaid Expansion and Health Insurance Exchanges. A Focus on Those with Behavioral Health Conditions in Maine Enrollment under the Medicaid Expansion and Health Insurance Exchanges A Focus on Those with Behavioral Health Conditions in Maine Data Sources National Survey on Drug Use and Health Sponsored by SAMHSA

More information

Enrollment under the Medicaid Expansion and Health Insurance Exchanges. A Focus on Those with Behavioral Health Conditions in Indiana

Enrollment under the Medicaid Expansion and Health Insurance Exchanges. A Focus on Those with Behavioral Health Conditions in Indiana Enrollment under the Medicaid Expansion and Health Insurance Exchanges A Focus on Those with Behavioral Health Conditions in Indiana Data Sources National Survey on Drug Use and Health Sponsored by SAMHSA

More information

Analysis of Survey Data Using the SAS SURVEY Procedures: A Primer

Analysis of Survey Data Using the SAS SURVEY Procedures: A Primer Analysis of Survey Data Using the SAS SURVEY Procedures: A Primer Patricia A. Berglund, Institute for Social Research - University of Michigan Wisconsin and Illinois SAS User s Group June 25, 2014 1 Overview

More information

Enrollment under the Medicaid Expansion and Health Insurance Exchanges. A Focus on Those with Behavioral Health Conditions in Florida

Enrollment under the Medicaid Expansion and Health Insurance Exchanges. A Focus on Those with Behavioral Health Conditions in Florida Enrollment under the Medicaid Expansion and Health Insurance Exchanges A Focus on Those with Behavioral Health Conditions in Florida Data Sources National Survey on Drug Use and Health Sponsored by SAMHSA

More information

Enrollment under the Medicaid Expansion and Health Insurance Exchanges. A Focus on Those with Behavioral Health Conditions in Washington

Enrollment under the Medicaid Expansion and Health Insurance Exchanges. A Focus on Those with Behavioral Health Conditions in Washington Enrollment under the Medicaid Expansion and Health Insurance Exchanges A Focus on Those with Behavioral Health Conditions in Washington Data Sources National Survey on Drug Use and Health Sponsored by

More information

Enrollment under the Medicaid Expansion and Health Insurance Exchanges. A Focus on Those with Behavioral Health Conditions in Georgia

Enrollment under the Medicaid Expansion and Health Insurance Exchanges. A Focus on Those with Behavioral Health Conditions in Georgia Enrollment under the Medicaid Expansion and Health Insurance Exchanges A Focus on Those with Behavioral Health Conditions in Georgia Data Sources National Survey on Drug Use and Health Sponsored by SAMHSA

More information

Enrollment under the Medicaid Expansion and Health Insurance Exchanges. A Focus on Those with Behavioral Health Conditions in Idaho

Enrollment under the Medicaid Expansion and Health Insurance Exchanges. A Focus on Those with Behavioral Health Conditions in Idaho Enrollment under the Medicaid Expansion and Health Insurance Exchanges A Focus on Those with Behavioral Health Conditions in Idaho Data Sources National Survey on Drug Use and Health Sponsored by SAMHSA

More information

Enrollment under the Medicaid Expansion and Health Insurance Exchanges. A Focus on Those with Behavioral Health Conditions in New Hampshire

Enrollment under the Medicaid Expansion and Health Insurance Exchanges. A Focus on Those with Behavioral Health Conditions in New Hampshire Enrollment under the Medicaid Expansion and Health Insurance Exchanges A Focus on Those with Behavioral Health Conditions in New Hampshire Data Sources National Survey on Drug Use and Health Sponsored

More information

Figure 1.1 Percentage of persons without health insurance coverage: all ages, United States, 1997-2001

Figure 1.1 Percentage of persons without health insurance coverage: all ages, United States, 1997-2001 Figure 1.1 Percentage of persons without health insurance coverage: all ages, United States, 1997-2001 DATA SOURCE: Family Core component of the 1997-2001 National Health Interview Surveys. The estimate

More information

PROC LOGISTIC: Traps for the unwary Peter L. Flom, Independent statistical consultant, New York, NY

PROC LOGISTIC: Traps for the unwary Peter L. Flom, Independent statistical consultant, New York, NY PROC LOGISTIC: Traps for the unwary Peter L. Flom, Independent statistical consultant, New York, NY ABSTRACT Keywords: Logistic. INTRODUCTION This paper covers some gotchas in SAS R PROC LOGISTIC. A gotcha

More information

Methodologies for Converting Microsoft Excel Spreadsheets to SAS datasets

Methodologies for Converting Microsoft Excel Spreadsheets to SAS datasets Methodologies for Converting Microsoft Excel Spreadsheets to SAS datasets Karin LaPann ViroPharma Incorporated ABSTRACT Much functionality has been added to the SAS to Excel procedures in SAS version 9.

More information

Methods for Interaction Detection in Predictive Modeling Using SAS Doug Thompson, PhD, Blue Cross Blue Shield of IL, NM, OK & TX, Chicago, IL

Methods for Interaction Detection in Predictive Modeling Using SAS Doug Thompson, PhD, Blue Cross Blue Shield of IL, NM, OK & TX, Chicago, IL Paper SA01-2012 Methods for Interaction Detection in Predictive Modeling Using SAS Doug Thompson, PhD, Blue Cross Blue Shield of IL, NM, OK & TX, Chicago, IL ABSTRACT Analysts typically consider combinations

More information

Simulate PRELOADFMT Option in PROC FREQ Ajay Gupta, PPD, Morrisville, NC

Simulate PRELOADFMT Option in PROC FREQ Ajay Gupta, PPD, Morrisville, NC ABSTRACT PharmaSUG 2015 - Paper QT33 Simulate PRELOADFMT Option in PROC FREQ Ajay Gupta, PPD, Morrisville, NC In Pharmaceuticals/CRO industries, table programing is often started when only partial data

More information

Workshop on Using the National Survey of Children s s Health Dataset: Practical Applications

Workshop on Using the National Survey of Children s s Health Dataset: Practical Applications Workshop on Using the National Survey of Children s s Health Dataset: Practical Applications Julian Luke Stephen Blumberg Centers for Disease Control and Prevention National Center for Health Statistics

More information

WHO STEPS Surveillance Support Materials. STEPS Epi Info Training Guide

WHO STEPS Surveillance Support Materials. STEPS Epi Info Training Guide STEPS Epi Info Training Guide Department of Chronic Diseases and Health Promotion World Health Organization 20 Avenue Appia, 1211 Geneva 27, Switzerland For further information: www.who.int/chp/steps WHO

More information

Data Presentation. Paper 126-27. Using SAS Macros to Create Automated Excel Reports Containing Tables, Charts and Graphs

Data Presentation. Paper 126-27. Using SAS Macros to Create Automated Excel Reports Containing Tables, Charts and Graphs Paper 126-27 Using SAS Macros to Create Automated Excel Reports Containing Tables, Charts and Graphs Tugluke Abdurazak Abt Associates Inc. 1110 Vermont Avenue N.W. Suite 610 Washington D.C. 20005-3522

More information

EXST SAS Lab Lab #4: Data input and dataset modifications

EXST SAS Lab Lab #4: Data input and dataset modifications EXST SAS Lab Lab #4: Data input and dataset modifications Objectives 1. Import an EXCEL dataset. 2. Infile an external dataset (CSV file) 3. Concatenate two datasets into one 4. The PLOT statement will

More information

Effective Use of SQL in SAS Programming

Effective Use of SQL in SAS Programming INTRODUCTION Effective Use of SQL in SAS Programming Yi Zhao Merck & Co. Inc., Upper Gwynedd, Pennsylvania Structured Query Language (SQL) is a data manipulation tool of which many SAS programmers are

More information

How to set the main menu of STATA to default factory settings standards

How to set the main menu of STATA to default factory settings standards University of Pretoria Data analysis for evaluation studies Examples in STATA version 11 List of data sets b1.dta (To be created by students in class) fp1.xls (To be provided to students) fp1.txt (To be

More information

Technical Notes for HCAHPS Star Ratings

Technical Notes for HCAHPS Star Ratings Overview of HCAHPS Star Ratings Technical Notes for HCAHPS Star Ratings As part of the initiative to add five-star quality ratings to its Compare Web sites, the Centers for Medicare & Medicaid Services

More information

IBM SPSS Statistics for Beginners for Windows

IBM SPSS Statistics for Beginners for Windows ISS, NEWCASTLE UNIVERSITY IBM SPSS Statistics for Beginners for Windows A Training Manual for Beginners Dr. S. T. Kometa A Training Manual for Beginners Contents 1 Aims and Objectives... 3 1.1 Learning

More information

Survey Analysis: Options for Missing Data

Survey Analysis: Options for Missing Data Survey Analysis: Options for Missing Data Paul Gorrell, Social & Scientific Systems, Inc., Silver Spring, MD Abstract A common situation researchers working with survey data face is the analysis of missing

More information

Using the American Community Survey Data

Using the American Community Survey Data Using the American Community Survey Data The ACS website is accessible from www.census.gov/acs. In the middle of the screen choose In-depth Data Go directly to the ACS Datasets Tab The ACS Datasets Tab

More information

HEALTH INSURANCE COVERAGE STATUS. 2009-2013 American Community Survey 5-Year Estimates

HEALTH INSURANCE COVERAGE STATUS. 2009-2013 American Community Survey 5-Year Estimates S2701 HEALTH INSURANCE COVERAGE STATUS 2009-2013 American Community Survey 5-Year Estimates Supporting documentation on code lists, subject definitions, data accuracy, and statistical testing can be found

More information

FACILITATOR/MENTOR GUIDE

FACILITATOR/MENTOR GUIDE FACILITATOR/MENTOR GUIDE Descriptive analysis variables table shells hypotheses Measures of association methods design justify analytic assess calculate analysis problem stratify confounding statistical

More information

Instructions for Analyzing Data from CAHPS Surveys:

Instructions for Analyzing Data from CAHPS Surveys: Instructions for Analyzing Data from CAHPS Surveys: Using the CAHPS Analysis Program Version 3.6 The CAHPS Analysis Program...1 Computing Requirements...1 Pre-Analysis Decisions...2 What Does the CAHPS

More information

Health Services Research Utilizing Electronic Health Record Data: A Grad Student How-To Paper

Health Services Research Utilizing Electronic Health Record Data: A Grad Student How-To Paper Paper 3485-2015 Health Services Research Utilizing Electronic Health Record Data: A Grad Student How-To Paper Ashley W. Collinsworth, ScD, MPH, Baylor Scott & White Health and Tulane University School

More information

Northumberland Knowledge

Northumberland Knowledge Northumberland Knowledge Know Guide How to Analyse Data - November 2012 - This page has been left blank 2 About this guide The Know Guides are a suite of documents that provide useful information about

More information

Chartpack. August 2008

Chartpack. August 2008 Chartpack Examining Sources of Coverage Among Medicare Beneficiaries: Supplemental Insurance, Medicare Advantage, and Prescription Drug Coverage Findings from the Medicare Current Beneficiary Survey, 2006

More information

Enrollment Data Undergraduate Programs by Race/ethnicity and Gender (Fall 2008) Summary Data Undergraduate Programs by Race/ethnicity

Enrollment Data Undergraduate Programs by Race/ethnicity and Gender (Fall 2008) Summary Data Undergraduate Programs by Race/ethnicity Enrollment Data Undergraduate Programs by Race/ethnicity and Gender (Fall 8) Summary Data Undergraduate Programs by Race/ethnicity The following tables and figures depict 8, 7, and 6 enrollment data for

More information

Counting the Ways to Count in SAS. Imelda C. Go, South Carolina Department of Education, Columbia, SC

Counting the Ways to Count in SAS. Imelda C. Go, South Carolina Department of Education, Columbia, SC Paper CC 14 Counting the Ways to Count in SAS Imelda C. Go, South Carolina Department of Education, Columbia, SC ABSTRACT This paper first takes the reader through a progression of ways to count in SAS.

More information

HOUSEHOLDS WITH HIGH LEVELS OF NET ASSETS

HOUSEHOLDS WITH HIGH LEVELS OF NET ASSETS HOUSEHOLDS WITH HIGH LEVELS OF NET ASSETS Report to the Consumer Federation of America and Providian Financial Corp. Catherine P. Montalto, Ph.D. Associate Professor Consumer and Textile Sciences Department

More information

Introduction to SAS Business Intelligence/Enterprise Guide Alex Dmitrienko, Ph.D., Eli Lilly and Company, Indianapolis, IN

Introduction to SAS Business Intelligence/Enterprise Guide Alex Dmitrienko, Ph.D., Eli Lilly and Company, Indianapolis, IN Paper TS600 Introduction to SAS Business Intelligence/Enterprise Guide Alex Dmitrienko, Ph.D., Eli Lilly and Company, Indianapolis, IN ABSTRACT This paper provides an overview of new SAS Business Intelligence

More information

Example: Find the expected value of the random variable X. X 2 4 6 7 P(X) 0.3 0.2 0.1 0.4

Example: Find the expected value of the random variable X. X 2 4 6 7 P(X) 0.3 0.2 0.1 0.4 MATH 110 Test Three Outline of Test Material EXPECTED VALUE (8.5) Super easy ones (when the PDF is already given to you as a table and all you need to do is multiply down the columns and add across) Example:

More information

Using the Magical Keyword "INTO:" in PROC SQL

Using the Magical Keyword INTO: in PROC SQL Using the Magical Keyword "INTO:" in PROC SQL Thiru Satchi Blue Cross and Blue Shield of Massachusetts, Boston, Massachusetts Abstract INTO: host-variable in PROC SQL is a powerful tool. It simplifies

More information

Guido s Guide to PROC FREQ A Tutorial for Beginners Using the SAS System Joseph J. Guido, University of Rochester Medical Center, Rochester, NY

Guido s Guide to PROC FREQ A Tutorial for Beginners Using the SAS System Joseph J. Guido, University of Rochester Medical Center, Rochester, NY Guido s Guide to PROC FREQ A Tutorial for Beginners Using the SAS System Joseph J. Guido, University of Rochester Medical Center, Rochester, NY ABSTRACT PROC FREQ is an essential procedure within BASE

More information

Paper AD11 Exceptional Exception Reports

Paper AD11 Exceptional Exception Reports Paper AD11 Exceptional Exception Reports Gary McQuown Data and Analytic Solutions Inc. http://www.dasconsultants.com Introduction This paper presents an overview of exception reports for data quality control

More information

Descriptive Statistics Categorical Variables

Descriptive Statistics Categorical Variables Descriptive Statistics Categorical Variables 3 Introduction... 41 Computing Frequency Counts and Percentages... 42 Computing Frequencies on a Continuous Variable... 44 Using Formats to Group Observations...

More information

Utilizing Clinical SAS Report Templates with ODS Sunil Kumar Gupta, Gupta Programming, Simi Valley, CA

Utilizing Clinical SAS Report Templates with ODS Sunil Kumar Gupta, Gupta Programming, Simi Valley, CA Utilizing Clinical SAS Report Templates with ODS Sunil Kumar Gupta, Gupta Programming, Simi Valley, CA ABSTRACT SAS progrannners often have the responsibility of supporting the reporting needs of the Clinical

More information

Healthcare Utilization by Individuals with Criminal Justice Involvement: Results of a National Survey

Healthcare Utilization by Individuals with Criminal Justice Involvement: Results of a National Survey Healthcare Utilization by Individuals with Criminal Justice Involvement: Results of a National Survey Frank JW, Linder JA, Becker WC, Fiellin DA, Wang EA Background U.S. criminal justice population is

More information

Main Effects and Interactions

Main Effects and Interactions Main Effects & Interactions page 1 Main Effects and Interactions So far, we ve talked about studies in which there is just one independent variable, such as violence of television program. You might randomly

More information

SPSS and AM statistical software example.

SPSS and AM statistical software example. A detailed example of statistical analysis using the NELS:88 data file and ECB, to perform a longitudinal analysis of 1988 8 th graders in the year 2000: SPSS and AM statistical software example. Overall

More information

KEY FEATURES OF SOURCE CONTROL UTILITIES

KEY FEATURES OF SOURCE CONTROL UTILITIES Source Code Revision Control Systems and Auto-Documenting Headers for SAS Programs on a UNIX or PC Multiuser Environment Terek Peterson, Alliance Consulting Group, Philadelphia, PA Max Cherny, Alliance

More information

SPSS Workbook 1 Data Entry : Questionnaire Data

SPSS Workbook 1 Data Entry : Questionnaire Data TEESSIDE UNIVERSITY SCHOOL OF HEALTH & SOCIAL CARE SPSS Workbook 1 Data Entry : Questionnaire Data Prepared by: Sylvia Storey s.storey@tees.ac.uk SPSS data entry 1 This workbook is designed to introduce

More information

Creating Dynamic Reports Using Data Exchange to Excel

Creating Dynamic Reports Using Data Exchange to Excel Creating Dynamic Reports Using Data Exchange to Excel Liping Huang Visiting Nurse Service of New York ABSTRACT The ability to generate flexible reports in Excel is in great demand. This paper illustrates

More information

Emailing Automated Notification of Errors in a Batch SAS Program Julie Kilburn, City of Hope, Duarte, CA Rebecca Ottesen, City of Hope, Duarte, CA

Emailing Automated Notification of Errors in a Batch SAS Program Julie Kilburn, City of Hope, Duarte, CA Rebecca Ottesen, City of Hope, Duarte, CA Emailing Automated Notification of Errors in a Batch SAS Program Julie Kilburn, City of Hope, Duarte, CA Rebecca Ottesen, City of Hope, Duarte, CA ABSTRACT With multiple programmers contributing to a batch

More information

Drawing a histogram using Excel

Drawing a histogram using Excel Drawing a histogram using Excel STEP 1: Examine the data to decide how many class intervals you need and what the class boundaries should be. (In an assignment you may be told what class boundaries to

More information

Cohort Analysis for Genetic Epidemiology (C. A.G. E.) User Reference Manual

Cohort Analysis for Genetic Epidemiology (C. A.G. E.) User Reference Manual Cohort Analysis for Genetic Epidemiology (C. A.G. E.) User Reference Manual CAGE is a UNIX based program, which calculates the standardized cancer incidence ratios (Observed / Expected) with 95% confidence

More information

Chapter 5 Analysis of variance SPSS Analysis of variance

Chapter 5 Analysis of variance SPSS Analysis of variance Chapter 5 Analysis of variance SPSS Analysis of variance Data file used: gss.sav How to get there: Analyze Compare Means One-way ANOVA To test the null hypothesis that several population means are equal,

More information

Paper PO06. Randomization in Clinical Trial Studies

Paper PO06. Randomization in Clinical Trial Studies Paper PO06 Randomization in Clinical Trial Studies David Shen, WCI, Inc. Zaizai Lu, AstraZeneca Pharmaceuticals ABSTRACT Randomization is of central importance in clinical trials. It prevents selection

More information

SUGI 29 Statistics and Data Analysis

SUGI 29 Statistics and Data Analysis Paper 194-29 Head of the CLASS: Impress your colleagues with a superior understanding of the CLASS statement in PROC LOGISTIC Michelle L. Pritchard and David J. Pasta Ovation Research Group, San Francisco,

More information

The SURVEYFREQ Procedure in SAS 9.2: Avoiding FREQuent Mistakes When Analyzing Survey Data ABSTRACT INTRODUCTION SURVEY DESIGN 101 WHY STRATIFY?

The SURVEYFREQ Procedure in SAS 9.2: Avoiding FREQuent Mistakes When Analyzing Survey Data ABSTRACT INTRODUCTION SURVEY DESIGN 101 WHY STRATIFY? The SURVEYFREQ Procedure in SAS 9.2: Avoiding FREQuent Mistakes When Analyzing Survey Data Kathryn Martin, Maternal, Child and Adolescent Health Program, California Department of Public Health, ABSTRACT

More information

Death Data: CDC Wonder, Texas Health Data, and VitalWeb

Death Data: CDC Wonder, Texas Health Data, and VitalWeb Death Data: CDC Wonder, Texas Health Data, and VitalWeb Evidence-Based Public Health Practice Step 2: Quantify the Issue This handout demonstrates how to access CDC Wonder, Texas Health Data, and VitalWeb

More information

ABSTRACT INTRODUCTION STUDY DESCRIPTION

ABSTRACT INTRODUCTION STUDY DESCRIPTION ABSTRACT Paper 1675-2014 Validating Self-Reported Survey Measures Using SAS Sarah A. Lyons MS, Kimberly A. Kaphingst ScD, Melody S. Goodman PhD Washington University School of Medicine Researchers often

More information

Adverse Impact Ratio for Females (0/ 1) = 0 (5/ 17) = 0.2941 Adverse impact as defined by the 4/5ths rule was not found in the above data.

Adverse Impact Ratio for Females (0/ 1) = 0 (5/ 17) = 0.2941 Adverse impact as defined by the 4/5ths rule was not found in the above data. 1 of 9 12/8/2014 12:57 PM (an On-Line Internet based application) Instructions: Please fill out the information into the form below. Once you have entered your data below, you may select the types of analysis

More information

Youth Risk Behavior Survey (YRBS) Software for Analysis of YRBS Data

Youth Risk Behavior Survey (YRBS) Software for Analysis of YRBS Data Youth Risk Behavior Survey (YRBS) Software for Analysis of YRBS Data CONTENTS Overview 1 Background 1 1. SUDAAN 2 1.1. Analysis capabilities 2 1.2. Data requirements 2 1.3. Variance estimation 2 1.4. Survey

More information

Software for Analysis of YRBS Data

Software for Analysis of YRBS Data Youth Risk Behavior Surveillance System (YRBSS) Software for Analysis of YRBS Data June 2014 Where can I get more information? Visit www.cdc.gov/yrbss or call 800 CDC INFO (800 232 4636). CONTENTS Overview

More information

SPSS Manual for Introductory Applied Statistics: A Variable Approach

SPSS Manual for Introductory Applied Statistics: A Variable Approach SPSS Manual for Introductory Applied Statistics: A Variable Approach John Gabrosek Department of Statistics Grand Valley State University Allendale, MI USA August 2013 2 Copyright 2013 John Gabrosek. All

More information

Salary. Cumulative Frequency

Salary. Cumulative Frequency HW01 Answering the Right Question with the Right PROC Carrie Mariner, Afton-Royal Training & Consulting, Richmond, VA ABSTRACT When your boss comes to you and says "I need this report by tomorrow!" do

More information

Using SAS to Examine Health-Promoting Life Style Activities of Upper Division Nursing Students at USC

Using SAS to Examine Health-Promoting Life Style Activities of Upper Division Nursing Students at USC SESUG 2015 Paper PO-46 Using SAS to Examine Health-Promoting Life Style Activities of Upper Division Nursing Students at USC Abbas S. Tavakoli, DrPH, MPH, ME, Mary Boyd, Phd, RN, ABSTRACT Health promotion

More information

Using Excel s PivotTable to Analyze Learning Assessment Data

Using Excel s PivotTable to Analyze Learning Assessment Data Using Excel s PivotTable to Analyze Learning Assessment Data Assessment Office University of Hawaiʻiat Mānoa Feb 13, 2013 1 Mission: Improve student learning through program assessment 2 1 Learning Outcomes

More information

Is it statistically significant? The chi-square test

Is it statistically significant? The chi-square test UAS Conference Series 2013/14 Is it statistically significant? The chi-square test Dr Gosia Turner Student Data Management and Analysis 14 September 2010 Page 1 Why chi-square? Tests whether two categorical

More information

Christianna S. Williams, University of North Carolina at Chapel Hill, Chapel Hill, NC

Christianna S. Williams, University of North Carolina at Chapel Hill, Chapel Hill, NC Christianna S. Williams, University of North Carolina at Chapel Hill, Chapel Hill, NC ABSTRACT Have you used PROC MEANS or PROC SUMMARY and wished there was something intermediate between the NWAY option

More information

Jessica S. Banthin and Thomas M. Selden. Agency for Healthcare Research and Quality Working Paper No. 06005. July 2006

Jessica S. Banthin and Thomas M. Selden. Agency for Healthcare Research and Quality Working Paper No. 06005. July 2006 Income Measurement in the Medical Expenditure Panel Survey Jessica S. Banthin and Thomas M. Selden Agency for Healthcare Research and Quality Working Paper No. 06005 July 2006 Suggested citation: Banthin

More information

Survey, Statistics and Psychometrics Core Research Facility University of Nebraska-Lincoln. Log-Rank Test for More Than Two Groups

Survey, Statistics and Psychometrics Core Research Facility University of Nebraska-Lincoln. Log-Rank Test for More Than Two Groups Survey, Statistics and Psychometrics Core Research Facility University of Nebraska-Lincoln Log-Rank Test for More Than Two Groups Prepared by Harlan Sayles (SRAM) Revised by Julia Soulakova (Statistics)

More information

Utilizing Clinical SAS Report Templates Sunil Kumar Gupta Gupta Programming, Thousand Oaks, CA

Utilizing Clinical SAS Report Templates Sunil Kumar Gupta Gupta Programming, Thousand Oaks, CA Utilizing Clinical SAS Report Templates Sunil Kumar Gupta Gupta Programming, Thousand Oaks, CA ABSTRACT SAS programmers often have the responsibility of supporting the reporting needs of the Clinical Affairs

More information

Health Care and Life Sciences

Health Care and Life Sciences Sensitivity, Specificity, Accuracy, Associated Confidence Interval and ROC Analysis with Practical SAS Implementations Wen Zhu 1, Nancy Zeng 2, Ning Wang 2 1 K&L consulting services, Inc, Fort Washington,

More information

Directions for using SPSS

Directions for using SPSS Directions for using SPSS Table of Contents Connecting and Working with Files 1. Accessing SPSS... 2 2. Transferring Files to N:\drive or your computer... 3 3. Importing Data from Another File Format...

More information

Innovative Techniques and Tools to Detect Data Quality Problems

Innovative Techniques and Tools to Detect Data Quality Problems Paper DM05 Innovative Techniques and Tools to Detect Data Quality Problems Hong Qi and Allan Glaser Merck & Co., Inc., Upper Gwynnedd, PA ABSTRACT High quality data are essential for accurate and meaningful

More information

Performing Queries Using PROC SQL (1)

Performing Queries Using PROC SQL (1) SAS SQL Contents Performing queries using PROC SQL Performing advanced queries using PROC SQL Combining tables horizontally using PROC SQL Combining tables vertically using PROC SQL 2 Performing Queries

More information

Using Names To Check Accuracy of Race and Gender Coding in NAEP

Using Names To Check Accuracy of Race and Gender Coding in NAEP Using Names To Check Accuracy of Race and Gender Coding in NAEP Jennifer Czuprynski Kali, James Bethel, John Burke, David Morganstein, and Sharon Hirabayashi Westat Keywords: Data quality, Coding errors,

More information

Comparing 2010 SIPP and 2013 CPS Content Test Health Insurance Offer and Take-Up Rates 1. Hubert Janicki U.S Census Bureau, Washington D.

Comparing 2010 SIPP and 2013 CPS Content Test Health Insurance Offer and Take-Up Rates 1. Hubert Janicki U.S Census Bureau, Washington D. Comparing 2010 SIPP and 2013 CPS Content Test Health Insurance Offer and Take-Up Rates 1 Hubert Janicki U.S Census Bureau, Washington D.C Abstract This brief compares employment-based health insurance

More information

watch Introduction January 2012 No. 83

watch Introduction January 2012 No. 83 AND health COLORADO DEPARTMENT OF PUBLIC HEALTH AND ENVIRONMENT watch January 2012 No. 83 U N I O N CONSTITUTION Public versus Private Health Insurance in Colorado at a Glance: A Glimpse at Health Care

More information

Data The estimates presented in the tables originate from the 2013 SCS to the NCVS. The SCS collects information about student and school

Data The estimates presented in the tables originate from the 2013 SCS to the NCVS. The SCS collects information about student and school This document reports data from the 2013 School Crime Supplement (SCS) of the National Crime Victimization Survey (NCVS). 1 The Web Tables show the extent to which with different personal characteristics

More information

Figure 1.1. Percentage of persons of all ages without health insurance coverage: United States, 1997 2005

Figure 1.1. Percentage of persons of all ages without health insurance coverage: United States, 1997 2005 Figure 1.1. Percentage of persons of all ages without health insurance coverage: United States, 1997 2005 Percent 20 95% confidence interval 15 10 5 0 1997 1998 1999 2000 2001 2002 2003 2004 2005 (Jan.

More information

Federal Employee Viewpoint Survey Online Reporting and Analysis Tool

Federal Employee Viewpoint Survey Online Reporting and Analysis Tool Federal Employee Viewpoint Survey Online Reporting and Analysis Tool Tutorial January 2013 NOTE: If you have any questions about the FEVS Online Reporting and Analysis Tool, please contact your OPM point

More information

Part 3. Comparing Groups. Chapter 7 Comparing Paired Groups 189. Chapter 8 Comparing Two Independent Groups 217

Part 3. Comparing Groups. Chapter 7 Comparing Paired Groups 189. Chapter 8 Comparing Two Independent Groups 217 Part 3 Comparing Groups Chapter 7 Comparing Paired Groups 189 Chapter 8 Comparing Two Independent Groups 217 Chapter 9 Comparing More Than Two Groups 257 188 Elementary Statistics Using SAS Chapter 7 Comparing

More information

SPSS (Statistical Package for the Social Sciences)

SPSS (Statistical Package for the Social Sciences) SPSS (Statistical Package for the Social Sciences) What is SPSS? SPSS stands for Statistical Package for the Social Sciences The SPSS home-page is: www.spss.com 2 What can you do with SPSS? Run Frequencies

More information

Intro to Longitudinal Data: A Grad Student How-To Paper Elisa L. Priest 1,2, Ashley W. Collinsworth 1,3 1

Intro to Longitudinal Data: A Grad Student How-To Paper Elisa L. Priest 1,2, Ashley W. Collinsworth 1,3 1 Intro to Longitudinal Data: A Grad Student How-To Paper Elisa L. Priest 1,2, Ashley W. Collinsworth 1,3 1 Institute for Health Care Research and Improvement, Baylor Health Care System 2 University of North

More information

Tutorial Segmentation and Classification

Tutorial Segmentation and Classification MARKETING ENGINEERING FOR EXCEL TUTORIAL VERSION 1.0.8 Tutorial Segmentation and Classification Marketing Engineering for Excel is a Microsoft Excel add-in. The software runs from within Microsoft Excel

More information

Abbas S. Tavakoli, DrPH, MPH, ME 1 ; Nikki R. Wooten, PhD, LISW-CP 2,3, Jordan Brittingham, MSPH 4

Abbas S. Tavakoli, DrPH, MPH, ME 1 ; Nikki R. Wooten, PhD, LISW-CP 2,3, Jordan Brittingham, MSPH 4 1 Paper 1680-2016 Using GENMOD to Analyze Correlated Data on Military System Beneficiaries Receiving Inpatient Behavioral Care in South Carolina Care Systems Abbas S. Tavakoli, DrPH, MPH, ME 1 ; Nikki

More information

Best Practice in SAS programs validation. A Case Study

Best Practice in SAS programs validation. A Case Study Best Practice in SAS programs validation. A Case Study CROS NT srl Contract Research Organisation Clinical Data Management Statistics Dr. Paolo Morelli, CEO Dr. Luca Girardello, SAS programmer AGENDA Introduction

More information

January 26, 2009 The Faculty Center for Teaching and Learning

January 26, 2009 The Faculty Center for Teaching and Learning THE BASICS OF DATA MANAGEMENT AND ANALYSIS A USER GUIDE January 26, 2009 The Faculty Center for Teaching and Learning THE BASICS OF DATA MANAGEMENT AND ANALYSIS Table of Contents Table of Contents... i

More information

Failure to take the sampling scheme into account can lead to inaccurate point estimates and/or flawed estimates of the standard errors.

Failure to take the sampling scheme into account can lead to inaccurate point estimates and/or flawed estimates of the standard errors. Analyzing Complex Survey Data: Some key issues to be aware of Richard Williams, University of Notre Dame, http://www3.nd.edu/~rwilliam/ Last revised January 24, 2015 Rather than repeat material that is

More information

THE ASSOCIATED PRESS-CNBC INVESTORS SURVEY CONDUCTED BY KNOWLEDGE NETWORKS

THE ASSOCIATED PRESS-CNBC INVESTORS SURVEY CONDUCTED BY KNOWLEDGE NETWORKS 1350 Willow Rd, Suite 102 Menlo Park, CA 94025 www.knowledgenetworks.com Interview dates: August 26 September 08, 2010 Interviews: 1,035 adults who own stocks, bonds or mutual funds Sampling margin of

More information

CCF Guide to the ACS Health Insurance Coverage Data

CCF Guide to the ACS Health Insurance Coverage Data CCF Guide to the ACS Health Insurance Coverage Data 2013 ACS Timeline o Thursday, September 19: The Census Bureau will release the 2012 one-year estimates o Week of November 4: Advocates that would like

More information

Racial and Ethnic Differences in Health Insurance Coverage Among Adult Workers in Florida. Jacky LaGrace Mentor: Dr. Allyson Hall

Racial and Ethnic Differences in Health Insurance Coverage Among Adult Workers in Florida. Jacky LaGrace Mentor: Dr. Allyson Hall Racial and Ethnic Differences in Health Insurance Coverage Among Adult Workers in Florida Jacky LaGrace Mentor: Dr. Allyson Hall Overview Background Study objective Methods Results Conclusion Limitations/Future

More information

Health Insurance Coverage: Estimates from the National Health Interview Survey, 2005

Health Insurance Coverage: Estimates from the National Health Interview Survey, 2005 Health Insurance Coverage: Estimates from the National Health Interview Survey, 2005 by Robin A. Cohen, Ph.D., and Michael E. Martinez, M.P.H., Division of Health Interview Statistics, National Center

More information

Health Insurance Coverage: Estimates from the National Health Interview Survey, 2004

Health Insurance Coverage: Estimates from the National Health Interview Survey, 2004 Health Insurance Coverage: Estimates from the National Health Interview Survey, 2004 by Robin A. Cohen, Ph.D., and Michael E. Martinez, M.P.H., Division of Health Interview Statistics, National Center

More information

This book serves as a guide for those interested in using IBM

This book serves as a guide for those interested in using IBM 1 Overview This book serves as a guide for those interested in using IBM SPSS/PASW Statistics software to aid in statistical data analysis whether as a companion to a statistics or research methods course

More information

Experiences in Using Academic Data for BI Dashboard Development

Experiences in Using Academic Data for BI Dashboard Development Paper RIV09 Experiences in Using Academic Data for BI Dashboard Development Evangeline Collado, University of Central Florida; Michelle Parente, University of Central Florida ABSTRACT Business Intelligence

More information

Summary of R software commands used to generate bootstrap and permutation test output and figures in Chapter 16

Summary of R software commands used to generate bootstrap and permutation test output and figures in Chapter 16 Summary of R software commands used to generate bootstrap and permutation test output and figures in Chapter 16 Since R is command line driven and the primary software of Chapter 16, this document details

More information

Beyond the Basics: Advanced REPORT Procedure Tips and Tricks Updated for SAS 9.2 Allison McMahill Booth, SAS Institute Inc.

Beyond the Basics: Advanced REPORT Procedure Tips and Tricks Updated for SAS 9.2 Allison McMahill Booth, SAS Institute Inc. ABSTRACT PharmaSUG 2011 - Paper SAS-AD02 Beyond the Basics: Advanced REPORT Procedure Tips and Tricks Updated for SAS 9.2 Allison McMahill Booth, SAS Institute Inc., Cary, NC, USA This paper is an update

More information

5 Point Choice ( 五 分 選 擇 題 ): Allow a single rating of between 1 and 5 for the question at hand. Date ( 日 期 ): Enter a date Eg: What is your birthdate

5 Point Choice ( 五 分 選 擇 題 ): Allow a single rating of between 1 and 5 for the question at hand. Date ( 日 期 ): Enter a date Eg: What is your birthdate 5 Point Choice ( 五 分 選 擇 題 ): Allow a single rating of between 1 and 5 for the question at hand. Date ( 日 期 ): Enter a date Eg: What is your birthdate Gender ( 性 別 ): Offers participants a pre-defined

More information

Generating Randomization Schedules Using SAS Programming Chunqin Deng and Julia Graz, PPD, Inc., Research Triangle Park, North Carolina

Generating Randomization Schedules Using SAS Programming Chunqin Deng and Julia Graz, PPD, Inc., Research Triangle Park, North Carolina Paper 267-27 Generating Randomization Schedules Using SAS Programming Chunqin Deng and Julia Graz, PPD, Inc., Research Triangle Park, North Carolina ABSTRACT Randomization as a method of experimental control

More information

Evaluating the results of a car crash study using Statistical Analysis System. Kennesaw State University

Evaluating the results of a car crash study using Statistical Analysis System. Kennesaw State University Running head: EVALUATING THE RESULTS OF A CAR CRASH STUDY USING SAS 1 Evaluating the results of a car crash study using Statistical Analysis System Kennesaw State University 2 Abstract Part 1. The study

More information

Importing and Exporting With SPSS for Windows 17 TUT 117

Importing and Exporting With SPSS for Windows 17 TUT 117 Information Systems Services Importing and Exporting With TUT 117 Version 2.0 (Nov 2009) Contents 1. Introduction... 3 1.1 Aim of this Document... 3 2. Importing Data from Other Sources... 3 2.1 Reading

More information