A User s Guide to the PSU-Census Bureau Research Data Center. Mark Roberts & Jennifer Van Hook



Similar documents
Research Opportunities at the Triangle Census Research Data Center

Texas Census Research Data Center

The California Census Research Data Center. Vocational Education Cluster

The National Center for Health Statistics' Linked Data Files: Resources for Research and Policy. Eric A. Miller National Center for Health Statistics

Northwest Census Research Data Center (NWCRDC)

Triangle Census Research Data Center Notes from information sessions


Use and Integration of Freely Available U.S. Public Use Files to Answer Pharmacoeconomic Questions: Deciphering the Alphabet Soup


STATISTICAL BRIEF #40

Susan G. Queen, Ph.D. Assistant Secretary for Planning and Evaluation

ACCESS METHODS FOR UNITED STATES MICRODATA

The 2006 Earnings Public-Use Microdata File:

New Developments in Data Sharing, Remote Access, Secure Data, and Documentation at the Cornell Institute for Social and Economic Research (CISER)

ASSESSMENT OF SURVEY DATA FOR THE ANALYSIS OF MARRIAGE AND DIVORCE AT

THE SURVEY OF INCOME AND PROGRAM PARTICIPATION SURVEYS-ON-CALL: ON-LINE ACCESS TO SURVEY DATA. No. 229

The Kansas City Research Data Center A Resource for Researchers

National Center for Health Statistics Research Data Center. Disclosure Manual. Preventing Disclosure: Rules for Researchers

The Panel Study of Income Dynamics Linked Medicare Claims Data

Assessing and Forecasting Population Health

Results from the National Survey of Ambulatory Surgery (NSAS) Karen A. Cullen, PhD, MPH

CDC Secondary Database Sources

Assessing and Forecasting Population Health

Update on Statistics Canada Dissemination Activities

Application of Information Systems and Secondary Data. Lynda Burton, ScD Johns Hopkins University

Working Beyond Retirement-Age

Demography. Focus on the three contributors to population change: Fertility, mortality, and migration

STATISTICAL BRIEF #173

Health Care Expenditures for Uncomplicated Pregnancies

NCHS-CMS Medicare Part D Event File

SELECTED POPULATION PROFILE IN THE UNITED STATES American Community Survey 1-Year Estimates

Introduction to Veteran Statistics: Market Research Tools for Veteran Small Businesses

SURVEY OF INTEREST IN ECE FROM THE BEST UNIVERSITIES IN THE WORLD

The Healthy Michigan Plan Handbook

2003 National Survey of College Graduates Nonresponse Bias Analysis 1

Massachusetts Population

STATISTICAL BRIEF #117

New Jersey Population

Social Security 44th Edition

Regulations for Data Access

The Determinants and Consequences of Personal Bankruptcy

Florida Population POLICY ACADEMY STATE PROFILE. Florida FLORIDA POPULATION (IN 1,000S) AGE GROUP

Capture of the Pediatric Core Measures from Electronic Health Records by Two Category A Grantees

FOR TOPICS A.3 TO A.6 (NOT COVERED IN THE COURSE) SEE

Educational Attainment

VITAL STATISTICS ADVISORY COMMITTEE (VSAC) VITAL RECORDS PROTECTION ADVISORY COMMITTEE (VRPAC) JOINT COMMITTEE MEETING

NQS Priority #1: Making Care Safer by Reducing the Harm Caused in the Delivery of Care

DATA EDITING AT THE NATIONAL CENTER FOR HEALTH STATISTICS

Demographic and Business

NCHS Director s Update to the Friends of NCHS

Ethical, Legal and Societal consideration in the design of Canadian Longitudinal Study on Aging (CLSA) Parminder Raina, Susan Kirkland and Christina

Men in Nursing Occupations

DOES MORTALITY DIFFER BETWEEN PUBLIC AND PRIVATE SECTOR WORKERS?

Should we be making better use of public data in health research? Paul Boyle. The value of routine administrative data

Aggregate data available; release of county or case-based data requires approval by the DHMH Institutional Review Board

EMR Name/ Model. meridianemr 4.2 CCHIT 2011 certified

July Background

Mike Proctor, J.D. April 15, 2014

Overview of Vital Records and Public Health Informatics in CDPH

Building Better Longitudinal Surveys (on the cheap) Through Links to Administrative Data. September 2014

Meaningful Use Criteria for Eligible Hospitals and Eligible Professionals (EPs)

Meaningful Use. Medicare and Medicaid EHR Incentive Programs

Time for a Shared Recovery

JOSEPH T. LARISCY. The University of Memphis Phone: (901) Clement Hall Fax: (901) Memphis, TN 38152

Young Adults More Likely to Qualify for Special Enrollment

Expanding Access to Administrative Data for Research in the United States

cambodia Maternal, Newborn AND Child Health and Nutrition

Department of Hospital and health service Management Courses Description Hospitl Management

STAGE 2 MEANINGFUL USE FOR ELIGIBLE HOSPITALS AND CRITICAL ACCESS HOSPITALS (CAHS)

Resources and Services Directory for Head Injury and Other Conditions

STATISTICAL BRIEF #113

Maine Single-Payer Microsimulation Model

Tips from Social Workers for Kidney Patients

Statistique. Canada. Statistics. Canada

The Healthy Michigan Plan Handbook

A Quick Guide to Long Term Care Medicaid

New Jersey Kids Count 2015 Bergen County Profile

Major Responsibilities I. Statistical surveys Survey system and statistical surveys

Field of Degree and Earnings by Selected Employment Characteristics: 2011

Linking Hospitalizations and Death Certificates across Minnesota Hospitals

THE CHARACTERISTICS OF PERSONS REPORTING STATE CHILDREN S HEALTH INSURANCE PROGRAM COVERAGE IN THE MARCH 2001 CURRENT POPULATION SURVEY 1

uninsured RESEARCH BRIEF: INSURANCE COVERAGE AND ACCESS TO CARE IN PRIMARY CARE SHORTAGE AREAS

Caring for an Aging Parent Checklist

Survey Methods for a New Mail Survey of Office-Based Physicians 1

Testimony on behalf of the. Population Association of America/Association of Population Centers

Georgia s Ranking Among the States: Budget, Taxes, and Other Indicators

Bachelor of Science in Public Health EXPLORE EXPERIENCE EXCEL

2009 Franklin County Profile Statistical and Demographic Data. German Village, Columbus, Ohio

Comparison of Variance Estimates in a National Health Survey

Big Data Analytics in Healthcare In pursuit of the Triple Aim with Analytics. David Wiggin, Director, Industry Marketing, Teradata 20 November, 2014

CMS Data Resources. Informing the Affordable Care Act. Jason Petroski, PhD, MPA Director, Division of Survey Management and Data Analysis

GENERAL GOVERNMENT ADMINISTRATION EXECUTIVE RECORDS RETENTION AND DISPOSITION SCHEDULES (ERRDS) ERRDS, HEALTH POLICY COMMISSION

STATISTICAL BRIEF #273

Disability Statistics from the U.S. Census Bureau in 2013/2014

Reporting Period: For Stage 2, the reporting period must be the entire Federal Fiscal Year.

Outline. Rules for researchers access to micro data. Data available for researcher. Main task for research service unit

State Health Assessment Health Priority Status Report Update. June 29, 2015 Presented by UIC SPH and IDPH

Overview of Methodology for Imputing Missing Expenditure Data in the Medical Expenditure Panel Survey

ANALYTIC AND REPORTING GUIDELINES

Transcription:

A User s Guide to the PSU-Census Bureau Research Data Center Mark Roberts & Jennifer Van Hook

Outline Introduction to RDCs Data Resources in Health and Demography Application Process for NCHS Data Resources in Economics Application Process for Census Bureau Data Using the RDC Special Sworn Status Conducting Research Resources at Penn State Synthetic Data Sets

Introduction to RDCs

Social Science & Data Many social scientists rely on large nationallevel data sets Everyone can access the same public-use data (e.g., ACS, NHANES, SIPP) Innovation fueled by how we use data: New statistical and measurement methods Creatively combining data sets Access to restricted-use data

Limitations of Public-use Data To protect confidentiality, public-use data: Are completely de-identified Have limited geography (e.g., county, state, country of birth) Group response categories (e.g., income) Exclude sensitive personal characteristics (e.g., weight) Exclude sensitive economic & social data (e.g., linked tax and Social Security data) Data are perturbed (e.g., age) Data availability has become more limited over the past decade

Census Research Data Centers Established in the 1990s by the Center for Economic Studies (CES) at the U.S. Census Bureau Network of highly-secure computer labs Permits researchers to access restricted-use Census and NCHS data without travel Major Research-I universities have a Census RDC Michigan, Minnesota, UCLA, UNC, Berkeley, Stanford, Cornell, Chicago, Columbia, Boston area, Texas, Penn State

Penn State s Research Data Center Opening in March 2014 Location: 203E Pattee Supported by: NSF Office of the President SSRI PRI University Libraries Colleges of the Liberal Arts, Health and Human Development, Agriculture, and Science (Eberly)

Restricted Data Can Answer Questions Like These: How much has income inequality changed over time? What are the characteristics of firms that provide pension benefits versus those that do not? How does natural gas development affect population distribution and health?.... Working papers and bibliographies available here: http://ideas.repec.org/s/cen/wpaper.html http://www.cdc.gov/rdc/b6pubeyond/pub611.htm

RDC Data Resources in Health and Demography

Primary Data Producers U.S. Census Bureau National Center for Health Statistics National Institute of Justice Agency for Healthcare Research and Quality NOT NCES or Add Health (we have other arrangements for these data sets)

Added Value Detailed Geographic Identifiers Unperturbed data (e.g., Age) More detail on characteristics & events Place of birth Date of birth Occupation Income Administrative record linkages

Demographic Data American Community Survey (1996-2009) American Housing Survey (1984-2009) Decennial Census (1970-2000) Current Population Survey March Supplements (1967-2005) National Longitudinal Survey (original cohorts) Survey of Income and Program Participation linkage is possible at detailed levels of geography Complete list: http://www.census.gov/ces/dataproducts/demographicdata.html

NCHS Data (unlinked) National Health and Nutrition Examination Survey I, II, and III National Ambulatory Medical Care Survey National Hospital Ambulatory Medical Care Survey National Survey of Ambulatory Surgery National Hospital Discharge Survey National Nursing Home Survey National Home and Hospice Care Survey National Employer Health Insurance Survey National Health Provider Inventory National Health Interview Survey National Immunization Survey Longitudinal Study on Aging National Survey of Family Growth State and Local Area Integrated Telephone Survey Health Child Well-Being and Welfare, 1997 National Survey of Early Childhood Health National Survey of Children with Special Health Care Needs National Survey of Children's Health National Asthma Survey National Survey of Children with Special Health Care Needs Vital Statistics (Birth, Mortality, Marriages and Divorces, Fetal, Death National Death Index) http://www.cdc.gov/nchs/r&d/rdc.htm

NCHS Data (linked) National Health Interview Survey with: Mortality Data 1986-2000 Medicare Enrollment and Claims Data Social Security Administration Retirement, Survivors, and Disability Insurance Data, 1962-2003 Social Security Administration Supplemental Security Income Data, 1974-2003 National Health and Nutrition Examination Survey I Epidemiologic Follow-up Study with: Mortality Data 1971-2000 Medicare Enrollment and Claims Data 1991-2000 National Health and Nutrition Examination Survey I with: Social Security Administration Retirement, Survivors, and Disability Insurance Data 1962-2003 Social Security Administration Supplemental Security Income Data,1974-2003 National Health and Nutrition Examination Survey II with: Mortality data 1976-2000 Medicare Utilization and Expenditure Data 1991-2000 National Health and Nutrition Examination Survey III with: Mortality Data 1988-2000 Medicare Enrollment and Claims Data (CMS-1991-2000) http://www.cdc.gov/nchs/r&d/rdc.htm

AHRQ Data Medical Expenditure Panel Survey Datasets Household Component-Insurance Component linked file (1996-1999, 2001) Nursing Home Component (1996) Medical Provider Component (except directly identifiable data) Two-Year, Two-Panel Files Area Resource File (county-level data that can be linked to MEPS-HC) MEPS-HC Public Use Files AHRQ will create a custom extract for each project. http://www.meps.ahrq.gov/mepsweb/data_stats/onsite_datacenter.jsp

Application Process for NCHS Data Step 1: Determine a need for restricted data Step 2: Determine the best mode of access (remote, NCHS, Census) Step 3: Develop your research proposal Step 4: Submit your proposal for review, emailed as one document to Peter Meyer, RDC Director at rdca@cdc.gov. Step 5: Wait for comments from the review committee and respond quickly to expedite review Step 6: Update your proposal when there are changes http://www.cdc.gov/rdc/b3prosal/pp300.htm

Application Process for NCHS Data Proposal Outline A. Abstract B. Research Question C. Background D. Public Health Benefit E. Data Requirements: Survey, Years, Files Restricted Variables: Non-NCHS Data: Merge Variables F. Methodology: G. Output: Overview: Examples/Table Shells Presentation of Results H. Data Dictionary I. References J. Other Authors K. Resumes/C.V Examples NCHS: http://www.cdc.gov/rdc/data/b3/sampleproposal.pdf Van Hook et al. see handout

Application Process for NCHS Data Tips Consult with Penn State s RDC administrator and NCHS RDC staff Smaller-scope projects are easier to obtain approval for Proposals can be slim on theory and lit review Lots of detail about the data and file construction required: what variables will you use and/or construct? What files do the variables come from? How will the data files be constructed? What files are merged, by what identifiers, in what order?

Application Process for Census Data Preliminary Proposal approved by local RDC Director or Administrator Final Proposal. For Census data this includes a Predominant Purpose Statement. Examples Must identify all data sets you will need for the project.

NCHS versus Census Information about data in proposal Benefit to statistical agency NCHS Lots of detail about data and variables required Not required Census Less detail required Required Time to gain approval Usually 3-4 months At least 6 months. More time for projects requesting IRS data Who merges the data files? Data Access NCHS programmers Only the variables specified in proposal Researchers All variables in requested data files Disclosure Review 1-2 days 3-4 weeks