Automated Analysis of Survey Data using STATA. Vinay A. Patel Deputy Manager National Dairy Development Board



Similar documents
Survey Data Analysis in Stata

Survey Data Analysis in Stata

Youth Risk Behavior Survey (YRBS) Software for Analysis of YRBS Data

Software for Analysis of YRBS Data

Complex Survey Design Using Stata

Failure to take the sampling scheme into account can lead to inaccurate point estimates and/or flawed estimates of the standard errors.

<next> Italian Stata Users Group meeting. Tutorial: Output processing and automatic reporting with Stata. Ben Jann, ETH Zurich,

Exploratory Data Analysis with One and Two Variables

Sampling Error Estimation in Design-Based Analysis of the PSID Data

2. Look for a link to ODESI data portal. In the Key Links section in the left side and double click ODESI Data Retrieval.

How to Download Census Data from American Factfinder and Display it in ArcMap

Introduction to Sampling. Dr. Safaa R. Amer. Overview. for Non-Statisticians. Part II. Part I. Sample Size. Introduction.

Workshop on Using the National Survey of Children s s Health Dataset: Practical Applications

Chapter XXI Sampling error estimation for survey data* Donna Brogan Emory University Atlanta, Georgia United States of America.

>

Getting Started in Frequencies, Crosstab, Factor and Regression Analysis (ver. 2.0 beta, draft)

Module 14: Missing Data Stata Practical

INTRODUCTION TO SURVEY DATA ANALYSIS THROUGH STATISTICAL PACKAGES

The Research Data Centres Information and Technical Bulletin

Personal Geodatabase 101

STATA SURVEY DATA REFERENCE MANUAL

Syntax Menu Description Options Remarks and examples Stored results References Also see

Title. Syntax. stata.com. odbc Load, write, or view data from ODBC sources. List ODBC sources to which Stata can connect odbc list

Working with Temporal Data

Data exploration with Microsoft Excel: analysing more than one variable

Quick Start Guide. Microsoft Access 2013 looks different from previous versions, so we created this guide to help you minimize the learning curve.

The SURVEYFREQ Procedure in SAS 9.2: Avoiding FREQuent Mistakes When Analyzing Survey Data ABSTRACT INTRODUCTION SURVEY DESIGN 101 WHY STRATIFY?

Introduction to Exploratory Data Analysis

Trade Flows and Trade Policy Analysis. October 2013 Dhaka, Bangladesh

Comparison of Programs for Fixed Kernel Home Range Analysis

Data Analysis. Using Excel. Jeffrey L. Rummel. BBA Seminar. Data in Excel. Excel Calculations of Descriptive Statistics. Single Variable Graphs

Statement of Work. Shin Woong Sung

Ian Watson

An Introduction to Point Pattern Analysis using CrimeStat

From the help desk: hurdle models

Appendix III: SPSS Preliminary

1. Examples using data from the Survey of industrial and service firms

Monte Carlo Simulation in Stata

Getting Started in Data Analysis using Stata

Chemotaxis and Migration Tool 2.0

New SAS Procedures for Analysis of Sample Survey Data

Dairy Information Services Kiosk and Dairy Portal 1

Survey, Statistics and Psychometrics Core Research Facility University of Nebraska-Lincoln. Log-Rank Test for More Than Two Groups

National Longitudinal Study of Adolescent Health. Strategies to Perform a Design-Based Analysis Using the Add Health Data

Additional sources Compilation of sources:

Computing Poverty measures with R vs. Stata. Rosendo Ramirez and Darryl McLeod. Professor Vinod R-Group presentation, May 1, 2014

Market Pricing Override

MicroStrategy Desktop

GUIDE TO REDCAP EXPORTED FILES

Introduction to RStudio

Syntax Menu Description Options Remarks and examples Stored results Methods and formulas References Also see

Integrating Spreadsheet Templates and Data Analysis into Fluid Power Instruction

TABLE OF CONTENTS. 7. shapefile..59. shapefile shapefile

Microsoft Courses. Microsoft Office 2007

Data Visualisation and Its Application in Official Statistics. Olivia Or Census and Statistics Department, Hong Kong, China

Importing Data into R

ECONOMICS 351* -- Stata 10 Tutorial 2. Stata 10 Tutorial 2

A Short Guide to Stata 13

The Survey of Income and Program Participation (SIPP) * Useful Tools for Data Analysis using the SIPP

Quick Start Guide. Microsoft Access 2013 looks different from previous versions, so we created this guide to help you minimize the learning curve.

<next> <PDF version> Tutorial: Automated table generation and reporting with Stata. Ben Jann, ETH Zurich,

GeoManitoba Spatial Data Infrastructure Update. Presented by: Jim Aberdeen Shawn Cruise

PSU Winter Training Roadmap

Stata 12 Merging Guide. Nathan Favero Texas A&M University October 19, 2012

Data Management for Multi-Environment Trials in Excel

Microsoft Access 2010 Advanced Queries

Computer Skills: Levels of Proficiency

Variance Estimation Guidance, NHIS (Adapted from the NHIS Survey Description Documents) Introduction

Microsoft Office Word 2010: Level 1

Optimization of sampling strata with the SamplingStrata package

OPTIMAl PREMIUM CONTROl IN A NON-liFE INSURANCE BUSINESS

Creating a File Geodatabase

Data Visualization. Brief Overview of ArcMap

Introduction to STATA 11 for Windows

Cookbook 23 September 2013 GIS Analysis Part 1 - A GIS is NOT a Map!

Using Excel for Statistics Tips and Warnings

Effective Big Data Visualization

Data Visualization in Official Statistics

Business Intelligence System for Monitoring, Analysis and Forecasting of Socioeconomic Development of Russian Territories

Chapter 11 Introduction to Survey Sampling and Analysis Procedures

Introduction to IBM SPSS Statistics

Example: Credit card default, we may be more interested in predicting the probabilty of a default than classifying individuals as default or not.

Quick and Easy Web Maps with Google Fusion Tables. SCO Technical Paper

Health Care Payments and Poverty

ABSTRACT INTRODUCTION CLINICAL PROJECT TRACKER OF SAS TASKS. Paper PH

The Spatiotemporal Visualization of Historical Earthquake Data in Yellowstone National Park Using ArcGIS

Intro to GIS Winter Data Visualization Part I

Charting Your Course: Charts and Graphs for IT Projects

How To Use Statgraphics Centurion Xvii (Version 17) On A Computer Or A Computer (For Free)

Section 1 Spreadsheet Design

UCINET Visualization and Quantitative Analysis Tutorial

Exploratory data analysis (Chapter 2) Fall 2011

Microsoft Access Glossary of Terms

NHA. User Guide, Version 1.0. Production Tool

Algorithm & Flowchart & Pseudo code. Staff Incharge: S.Sasirekha

Descriptive Statistics

Transcription:

Automated Analysis of Survey Data using STATA Vinay A. Patel Deputy Manager National Dairy Development Board

1. Background 2. Prior to using STATA 3. Survey Data Characteristics 4. Estimation of Important Variables 5. Automation of Survey data analysis 6. Important Commands-need to know 7. Interface between GIS and STATA

Background Types of Activities Various Surveys Repetitive Surveys Survey procedures

Prior to using STATA Time required @ every stage Data Collection Data Analysis in EXCEL Hypothesis Testing Report Preparation

Single-Stage Design: Survey Data Characteristics svyset [ psu ] [ weight ] [, strata (varname) fpc (varname) ] Primary Sampling Unit (psu) Sampling weights-pweight Strata Finite Population Correction (fpc) svyset VILLAGE [pweight=invweight], strata(quadstrata) fpc(fpc) poststrata(pstrata) postweight(thlacthh) vce(linearized)

Multi-Stage Design: svyset psu [weight] [, strata (varname) fpc (varname) ] [ ssu [, strata (varname) fpc (varname) ] Stages are delimited by SSU secondary / subsequent sampling units

Estimation of Important Variables Important Variables Social Category Economic Category Village Surplus Milch Animals Producer Surplus Milk Production Size of Family Milch Animal Households Milch Animals Yield

Problem: % Distribution of Households by Social Category Through Stata command Through Pivot table in excel tabulate Tehsil CASTE if CASTE > 0, nofreq row svy linearized : proportion CASTE,over(PSTRATA) cformat(%9.2f)

Problem: Composition of Milch Animals (%) Through Stata command Through Pivot table in excel tabstat lcper cbper bfper totaniper, by(tehsil) columns(variables) format(%9.0f) svy linearized : total LCIM CBIM BFIM inmilk lc cb bf MilchAnimal, over (PSTRATA) cformat(%9.2f)

Problem: Estimated Milk Production Species-wise Through Stata command Through Pivot table in excel tabstat lcshare cbshare bfshare allshare, by(tehsil) columns(variables) format(%9.0f) svy linearized : total LCPROD CBPROD BFPROD TOTPROD, over(pstrata) cformat(%9.0f)

Problem: Milk Procured by different Agencies Through Stata command Through Pivot table in excel tabstat sldcsper sllocalper sldudhiaper sloutvillper slpvtdairyper totsaleper, by(tehsil) columns(variables) format(%9.2f) svy linearized : total SLDCS SLLOCAL SLDUDHIA SLOUTVILL SLPVTDAIRY totalsale, over(pstrata) cformat(%9.2f)

Automation of Survey data analysis Create a Do-file Convert into ado-file Use same do or ado-file for repetitive use Vinay A. Patel N.D.D.B

Important Commands-need to know For graphs tableplot vioplot tabplot catplot For Export of tables tabout xml_tab outreg2 collapse

Interface between GIS and STATA shp2dta - Converts shape boundary files to Stata datasets shp2dta using shpfilename, database(filename) coordinates(filename) [options] spmap - Visualization of spatial data spmap DCSSALE using thlscoor.dta, id(_id) title("sold to DCS as a % of Total Sale in Tehsil") fcolor(white gs15 ltkhaki ltblue ) point( xcoord( x_c) ycoord( y_c) proportional( DCSSALE) size(*2) fcolor(eltgreen) ocolor(white)) label( xcoord( x_c) ycoord( y_c) label( teh_dcs) size(*.5))

THANK YOU