Secondary Data Analysis

Similar documents
SAMPLE DESIGN RESEARCH FOR THE NATIONAL NURSING HOME SURVEY

ANALYTIC AND REPORTING GUIDELINES

Survey Data Analysis in Stata

Data The estimates presented in the tables originate from the 2013 SCS to the NCVS. The SCS collects information about student and school

Guidelines for Analyzing Add Health Data. Ping Chen Kim Chantala

Texas Diabetes Fact Sheet

CDC Secondary Database Sources

Workshop on Using the National Survey of Children s s Health Dataset: Practical Applications

Introduction to Sampling. Dr. Safaa R. Amer. Overview. for Non-Statisticians. Part II. Part I. Sample Size. Introduction.

Use and Integration of Freely Available U.S. Public Use Files to Answer Pharmacoeconomic Questions: Deciphering the Alphabet Soup

Kaiser Family Foundation/New York Times Survey of Chicago Residents

Application of Information Systems and Secondary Data. Lynda Burton, ScD Johns Hopkins University

Basic Overview of The National Ambulatory Medical Care Survey (NAMCS) and The National Hospital Ambulatory Medical Care Survey (NHAMCS)

Sampling Design for the National Hospital Ambulatory Medical Care Survey

In 2013, U.S. residents age 12 or older experienced

Sampling Error Estimation in Design-Based Analysis of the PSID Data

Variance Estimation Guidance, NHIS (Adapted from the NHIS Survey Description Documents) Introduction

Negative Binomials Regression Model in Analysis of Wait Time at Hospital Emergency Department

The SURVEYFREQ Procedure in SAS 9.2: Avoiding FREQuent Mistakes When Analyzing Survey Data ABSTRACT INTRODUCTION SURVEY DESIGN 101 WHY STRATIFY?

Crime in America

Maryland Population POLICY ACADEMY STATE PROFILE. Maryland MARYLAND POPULATION (IN 1,000S) BY AGE GROUP

Best Practices in Using Large, Complex Samples: The Importance of Using Appropriate Weights and Design Effect Compensation

Enrollment under the Medicaid Expansion and Health Insurance Exchanges. A Focus on Those with Behavioral Health Conditions in Washington

Problems Paying Medical Bills Among Persons Under Age 65: Early Release of Estimates From the National Health Interview Survey, 2011 June 2015

DATA EDITING AT THE NATIONAL CENTER FOR HEALTH STATISTICS

HowHow to Identify the Best Stock Broker For You

Sacramento County 2010

Descriptive Methods Ch. 6 and 7

San Diego County 2010

Using complete administration data for nonresponse analysis: The PASS survey of low-income households in Germany

Methodology. Report 1. Design and Methods of the Medical Expenditure Panel Survey Household Component

Educational Attainment

Why Sample? Why not study everyone? Debate about Census vs. sampling

Mode and Patient-mix Adjustment of the CAHPS Hospital Survey (HCAHPS)

Massachusetts Population

New Jersey Population

A User s Guide to the PSU-Census Bureau Research Data Center. Mark Roberts & Jennifer Van Hook

Enrollment under the Medicaid Expansion and Health Insurance Exchanges. A Focus on Those with Behavioral Health Conditions in Maine

2014 National Study of Long-Term Care Providers (NSLTCP) Adult Day Services Center Survey. Restricted Data File. July 2015

The California Census Research Data Center. Vocational Education Cluster

Health Insurance Coverage: Estimates from the National Health Interview Survey, 2004

Florida Population POLICY ACADEMY STATE PROFILE. Florida FLORIDA POPULATION (IN 1,000S) AGE GROUP

Results from the National Survey of Ambulatory Surgery (NSAS) Karen A. Cullen, PhD, MPH

Enrollment under the Medicaid Expansion and Health Insurance Exchanges. A Focus on Those with Behavioral Health Conditions in Indiana

PROFILE OF ADOLESCENT DISCHARGES FROM SUBSTANCE ABUSE TREATMENT

Mode and respondent effects in a dual-mode survey of physicians:

Enrollment under the Medicaid Expansion and Health Insurance Exchanges. A Focus on Those with Behavioral Health Conditions in Florida

Chapter 11 Introduction to Survey Sampling and Analysis Procedures

Center for Behavioral Health Statistics and Quality CBHSQ DATA REVIEW

Enrollment under the Medicaid Expansion and Health Insurance Exchanges. A Focus on Those with Behavioral Health Conditions in Georgia

Enrollment under the Medicaid Expansion and Health Insurance Exchanges. A Focus on Those with Behavioral Health Conditions in Idaho

Facts about Diabetes in Massachusetts

Health Insurance Coverage: Estimates from the National Health Interview Survey, 2005

Enrollment under the Medicaid Expansion and Health Insurance Exchanges. A Focus on Those with Behavioral Health Conditions in New Hampshire


In 2014, U.S. residents age 12 or older experienced

Complex Survey Design Using Stata

Comparison of Variance Estimates in a National Health Survey

3 Sources of Information about Crime:

Physician Assistant and Advance Practice Nurse Care in Hospital Outpatient Departments: United States,

Deja-vu all over again, or is it? : nursing home use in the 1990 s

Wages, Fringe Benefits, and Turnover for Direct Care Workers Working for Long-Term Care Providers in Oregon

Youth Risk Behavior Survey (YRBS) Software for Analysis of YRBS Data

Americans and text messaging

CHAPTER 6: Substance Abuse and Mental Health A Comparison of Appalachian Coal Mining Areas to Other Areas within the Appalachian Region

Research Opportunities at the Triangle Census Research Data Center

An Introduction to Secondary Data Analysis

CHILDHOOD SEXUAL ABUSE FACT SHEET

E-reader Ownership Doubles in Six Months

Behavioral Health Barometer. United States, 2013

Macomb County Office of Substance Abuse MCOSA. Executive Summary

Texas Census Research Data Center

Multimode Strategies for Designing Establishment Surveys

Power Calculation Using the Online Variance Almanac (Web VA): A User s Guide

National Longitudinal Study of Adolescent Health. Strategies to Perform a Design-Based Analysis Using the Add Health Data

California Marijuana Arrests

Oklahoma county. Community Health Status Assessment

For the 10-year aggregate period , domestic violence

Workplace Violence Against Government Employees,

SECOND NATIONAL JUVENILE ONLINE VICTIMIZATION INCIDENCE STUDY (NJOV-2)

71% of online adults now use video sharing sites

NQS Priority #1: Making Care Safer by Reducing the Harm Caused in the Delivery of Care

Contacts between Police and the Public, 2008

Community Tracking Study. Comparison of Selected Statistical Software Packages for Variance Estimation in the CTS Surveys

A comparison of mail and face-to-face responses in a dualmode survey of physicians

OREGON PROPERLY VERIFIED CORRECTION OF DEFICIENCIES IDENTIFIED DURING SURVEYS OF NURSING HOMES PARTICIPATING IN MEDICARE AND MEDICAID

2015 Michigan Department of Health and Human Services Adult Medicaid Health Plan CAHPS Report

Characteristics and Outcomes Site Profiles Report Guide

Economic Impact of the Queen of Peace Hospital and Related Health Sectors of Scott County

Treatment. Race. Adults. Ethnicity. Services. Racial/Ethnic Differences in Mental Health Service Use among Adults. Inpatient Services.

Public Health Improvement Plan

24% of internet users have made phone calls online

APPENDIX V METHODOLOGY OF 2011 MONTANA HEALTH INSURANCE SURVEYS

Software for Analysis of YRBS Data

The 2014 Update of the Rural-Urban Chartbook

HOUSEHOLD TELEPHONE SERVICE AND USAGE PATTERNS IN THE U.S. IN 2004

A Preliminary Assessment of Risk and Recidivism of Illinois Prison Releasees

Prisoner Recidivism Analysis Tool (PRAT ) User s Guide

49. INFANT MORTALITY RATE. Infant mortality rate is defined as the death of an infant before his or her first birthday.

Transcription:

Slide 1 Secondary Data Analysis Young I. Cho University of Illinois Slide 2 What is secondary data? Data collected by a person or organization other than the users of the data 2 of 20 Slide 3 Advantages of Secondary Data Unobtrusive Fast & inexpensive Avoid data collection problems Provide bases for comparison 3 of 20

Slide 4 Disadvantages of Secondary Data Data availability Level of observation Quality of documentation Data quality control Outdated data 4 of 20 Slide 5 Data Sources Inter-university Consortium for Political and Social Research (ICPSR) http://www.icpsr.umich.edu/index-medium.html National Center for Health Statistics (NCHS) http://www.cdc.gov/nchs/default.htm Center for Medicare and Medicaid Services (CMS) http://cms.hhs.gov/researchers/ US Census Bureau http://www.census.gov/main/www/access.html 5 of 20 Slide 6 Data Sources (cont( cont.) Examples of Directly Downloadable Data from NCHS: National Health and Nutrition Examination Survey (NHANES) National Ambulatory Medical Care Survey (NAMCS) National Hospital Ambulatory Medical Care Survey (NHAMCS) National Hospital Discharge Survey (NHDS) National Home and Hospice Care Survey (NHHCS) National Nursing Home Survey (NNHS) National Survey of Ambulatory Surgery (NSAS) National Employer Health Insurance Survey (NEHIS) National Vital Statistics System (NVSS) National Health Interview Survey (NHIS) 6 of 20

Slide 7 Data Sources (cont( cont.) Data Available for Use with Survey Documentation and Analysis (SDA): Aging Data Longitudinal Study of Aging, 70 Years and Older, 1984-1990 National Survey of Self-Care and Aging: Follow-Up, 1994 National Health and Nutrition Examination Survey II: Mortality Study, 1992 National Hospital Discharge Survey, 1994-1997 National Health Interview Survey, 1994, Second Supplement on Aging Criminal Justice Data International Crime Data Homicide Data National Crime Victimization Survey Data Corrections Data 7 of 20 Slide 8 Data Sources (cont( cont.) Data Available for Use with Survey Documentation and Analysis (continued): Substance Abuse Data Drug Abuse Warning Network Monitoring the Future National Household Survey on Drug Abuse National Pregnancy and Health Survey National Treatment Improvement Evaluation Study Treatment Episode Data Set Uniform Facility Data Set Washington, DC Metropolitan Area Drug Study (DC*MADS) 8 of 20 Slide 9 Evaluation of Data Sources Purpose of the study Sponsor/collector of the data Mode of data collection Sampling procedures Consistency of data with other sources 9 of 20

Slide 10 Evaluation of Data Sources (cont.) Documentation Number of observations Number of variables Coding scheme Summary statistics 10 of 20 Slide 11 Types of Survey Sample Design Simple Random Sampling Systematic Sampling Complex sample designs? stratified designs? cluster designs? mixed mode designs 11 of 20 Slide 12 Why complex survey design? Increased efficiency Decreased costs 12 of 20

Slide 13 Complex Survey Design Complex designs with clustering and unequal selection probabilities generally increase the sampling variance. Not accounting for the impact of complex sample design can lead to Type I error. 13 of 20 Slide 14 Sample Weights Used to adjust for differing probabilities of selection In theory, simple random samples are self-weighted 14 of 20 Slide 15 Sample Weights (cont.) Used to adjust for differing probabilities of selection In theory, simple random samples are self-weighted In practice, simple random samples are likely to also require adjustments for non-response 15 of 20

Slide 16 Types of Sample Weights Post-stratification weights: designed to bring the sample proportions in demographic subgroups into agreement with the population proportion in the subgroups. 16 of 20 Slide 17 Types of Sample Weights (cont.) Non-response weights: designed to inflate the weights of survey respondents to compensate for nonrespondents with similar characteristics. 17 of 20 Slide 18 Types of Sample Weights (cont.) Blow-up (expansion) weights: provide estimates for the total population of interest 18 of 20

Slide 19 Syntax Examples of Design-based Analysis in STATA, SUDAAN & SAS STATA svyset strata strata svyset psu psu svyset pweight finalwt svyreg fatitk age male black hispanic SUDAAN proc regress data= c:\nhanes.sav filetype=spss desgn=wr; nest strata psu; weight finalwt subpgroup sex race; levels 2 3; model fatintk = age sex race; 19 of 20 Slide 20 Syntax Examples of Design-based Analysis in STATA, SUDAAN & SAS SAS proc surveyreg data=nhanes; strata strata; cluster psu; class sex race; model fatintk = age sex race; weight finalwt 20 of 20