IPUMS Int.l Online Data Analysis



Similar documents
Data analysis of informal employment in China and India. Xizhe Peng, Yaowu Wu, Jeemol Unni


G20 EMPLOYMENT WORKING GROUP COUNTRY SELF-REPORTING TEMPLATE ON IMPLEMENTATION OF G20 EMPLOYMENT PLANS

How to set the main menu of STATA to default factory settings standards

Using Microsoft Excel to Manage and Analyze Data: Some Tips

IPUMS-International: Harmonizing Big Data for Smart Research

Instructions for applying data validation(s) to data fields in Microsoft Excel

Over-Age, Under-Age, and On-Time Students in Primary School, Uganda

2013 FDIC National Survey of Unbanked and Underbanked Households

IBM SPSS Statistics 20 Part 1: Descriptive Statistics

OWNERSHIP, ACCESS AND USE OF COMPUTERS, INFORMATION TECHNOLOGY AND OTHER E-TECHNOLOGIES BY STUDENTS IN SUBURBAN BEIJING SCHOOLS

Is it statistically significant? The chi-square test

17% of cell phone owners do most of their online browsing on their phone, rather than a computer or other device

Supplementary Materials for Chapter 15 - Analysing Data

Children in Egypt 2014 A STATISTICAL DIGEST

1. TRENDS IN INTERNATIONAL MIGRATION

STEP TWO: Highlight the data set, then select DATA PIVOT TABLE

DIY Exercises. Linda Clark Data Dissemination Specialist U.S. Census Bureau Alaska, Idaho, Oregon, Washington

BEST PRACTICES AND YOUR PROGRAMS. Brendon Riggs Texas Juvenile Probation Commission Data Coordinators Conference 2008

FAO Standard Seed Security Assessment CREATING MS EXCEL DATABASE

Tablet Ownership 2013

Attitudes towards Financial Planners. Factuality research

Lloyd Potter is the Texas State Demographer and the Director of the Texas State Data Center based at the University of Texas at San Antonio.

7TH ANNUAL PARENTS, KIDS & MONEY SURVEY: SUPPLEMENTAL DATA

Stay Rates of Foreign Doctorate Recipients from U.S. Universities, Prepared by:

Migrate to invest? An Analysis of Direct and Indirect Effects of International Migration on Personal Investment in Senegal

Household Survey Data Basics

Data entry and analysis Evaluation resources from Wilder Research

Report Subscription Tutorial

ASPIRA Management Information System OJJDP General Intake Information

In 1994, the U.S. Congress passed the Schoolto-Work

How to Use the Oklahoma Data Query System

Teens and Technology 2013

Lesson 3: Calculating Conditional Probabilities and Evaluating Independence Using Two-Way Tables

SPSS for Windows Version 16.0: A Basic Tutorial

WHO STEPS Surveillance Support Materials. STEPS Epi Info Training Guide

Before You Start a Business A Basic Checklist

SPSS The Basics. Jennifer Thach RHS Assessment Office March 3 rd, 2014

Networks and Mis-allocation: Insurance, Migration, and the Rural-Urban Wage Gap. Kaivan Munshi. University of Cambridge. Mark R.

SPSS Workbook 1 Data Entry : Questionnaire Data

CALL FOR PAPERS JOHANNESBURG SOUTH AFRICA, NOV. 30 DEC 4, 2015 DEMOGRAPHIC DIVIDEND IN AFRICA: PROSPECTS, OPPORTUNITIES AND CHALLENGES

User guide for generating bespoke tabulations in Showcase

Social protection and poverty reduction

Hands-on Exercise Using DataFerrett American Community Survey 5-Year Summary File: Foreign Born by Age by Sex

INDICATOR REGION WORLD

Excel Charts & Graphs

Row vs. Column Percents. tab PRAYER DEGREE, row col

Data exploration with Microsoft Excel: analysing more than one variable

Federal Employee Viewpoint Survey Online Reporting and Analysis Tool

Elementary Statistics

Running Descriptive Statistics: Sample and Population Values

U.S. Population Projections: 2012 to 2060

Distribution of Population by Religions

Guidance Counselor Sex Equity Survey

AUGUST 26, 2013 Kathryn Zickuhr Aaron Smith

Profiles and Data Analysis. 5.1 Introduction

STATISTICS 8, FINAL EXAM. Last six digits of Student ID#: Circle your Discussion Section:

Summary. Accessibility and utilisation of health services in Ghana 245

Administrator's Guide

Course 13: Interactive Exercise Workers Compensation & Lost Wage Example

Making an online form in Serif WebPlus

The Effects of Demographics on Consumer Perceptions of Identity Theft in Rural and Urban Settings

2014 Only Influencers Marketing Salary Guide

Introduction to PASW Statistics

DATA WAREHOUSE STAT.UNIDO.ORG

The Supply and Demand for Registered Nurses and Licensed Practical Nurses in Nebraska

Cell Phone Activities 2013

Chartpack. August 2008

Adverse Impact Ratio for Females (0/ 1) = 0 (5/ 17) = Adverse impact as defined by the 4/5ths rule was not found in the above data.

Stock Rover is a powerful platform for finding, comparing, and tracking stocks, ETFs, and funds.

IRIS Payroll Business

nielsen Training Guide

BRIEFING. Characteristics and Outcomes of Migrants in the UK Labour Market.

S P S S Statistical Package for the Social Sciences

OECD.Stat Web Browser User Guide

The Chi-Square Test. STAT E-50 Introduction to Statistics

Tamil Nadu DATA HIGHLIGHTS : THE SCHEDULED CASTES Census of India 2001

Chapter 2 Introduction to SPSS

Creating and Managing Online Surveys LEVEL 2

APPLYING BENFORD'S LAW This PDF contains step-by-step instructions on how to apply Benford's law using Microsoft Excel, which is commonly used by

WHAT S NEW IN MS EXCEL 2013

IBM SPSS Statistics for Beginners for Windows

Over-Age, Under-Age, and On-Time Students in Primary School, Tanzania

NUMBERS, FACTS AND TRENDS SHAPING THE WORLD APRIL 3, 2014 FOR FURTHER INFORMATION ON THIS REPORT: Aaron Smith, Senior Researcher

Doctors and romance: Not only of interest to Mills and Boon readers

Opening a door: Peering inside retirement

Primary School Net and Gross Attendance Rates, Kenya. Over-Age, Under-Age, and On-Time Students in Primary School, Kenya

PROC LOGISTIC: Traps for the unwary Peter L. Flom, Independent statistical consultant, New York, NY

Creating a Storyboard using Microsoft Word

Introduction Course in SPSS - Evening 1

Finding Supporters. Political Predictive Analytics Using Logistic Regression. Multivariate Solutions

All About Auto Insurance

THE DEMOGRAPHY OF POPULATION AGEING

Excel 2013 What s New. Introduction. Modified Backstage View. Viewing the Backstage. Process Summary Introduction. Modified Backstage View

ILO-RAS/NET/Vanuatu/R.2. Vanuatu. Report on the implementation of the Health Insurance Scheme in Vanuatu

INDICATOR REGION WORLD

How easy was it to get information about the college? Did the range of courses appeal to you? Post-entry Survey Summary Report by FE,HE Charts FE HE

United States

Our Mission. Promoting Independence by Providing Car Care

Proposal Metrics Dashboard. What Gets Measured Gets Done

Transcription:

Minnesota Population Center Training and Development IPUMS Int.l Online Data Analysis Exercise 1 OBJECTIVE: Gain an understanding of how the IPUMS dataset is structured and how it can be leveraged to explore your research interests. This exercise will use the IPUMS dataset to explore cell phone ownership, fertility, and migration patterns. 10/24/2012

Page1 IPUMS-I Training and Development Research Questions What are the patterns of cell phone ownership in South Africa? What is the trend in fertility in China? What is the relationship between income and migration in India? Objectives Select datasets and variables of interest the data using sample code Validate data analysis work using answer key IPUMS-I Variables CELL: Cellular phone availability URBAN: Urban/rural status CHBORN: Children ever born to the individual CHBORNF/CHBORNM: Children ever born to the individual, female/male MGCAUSE: Cause of migration INCWAGE: Wage and salary income per week SDA Code to Review Field Row Column Control Selection Filter Purpose Represents the primary variable of interest Divides the analysis of the variable of interest into categories Creates a separate chart for each category of the control Allows you to select cases; ex: year(2000-*) -> all years 2000-onward Review Answer Key (page 6) Common Mistakes to Avoid 1 Choosing numerical instead of categorical variables for the Frequencies/Cross Tabulation Program. For these, use the Compar ison of Means Program instead 2 Forgetting to specify the years of interest

Page2 Step 1 Select a Sample Getting Started Go to https://international.ipums.org/international/, and on the left-hand side of the page, select " Data Online" You will have the option to analyze single year or multi-year samples of one country, or datasets of all countries within a region for all available years Choose the multi-year sample for South Africa The default analysis is frequencies and cross-tabulation Step 2 Research Variables of Interest Search on the main IPUMS-I site for variable descriptions, codes, universe, etc, or browse under Household and Person variable categories Row and Column are the variables of interest that you will perform the cross-tabulation on Filters select only specified cases A Control creates multiple tables for row and column variables, separated by a third categorical variable. For example, if you include the variable SEX as a control, you will get two frequency tables The Weight default is person weight (wtper), which extrapolates the sample to represent the entire population

Page3 Part I - South Africa Cell Phone Ownership A) What are the codes and universe for CELL? Is this a person variable or a household variable? B) What are the respective shares of households for which a cell phone was available? i. Rural households across all samples? ii. Urban households? Row: cell Column: urban Weight: wthh Selection Filters: cell(1-2), urban(1-2) C) Compare the share of rural or urban households where a cell phone was available in 1996 to 2007. Comment on the change over time. Row: cell Column: urban Weight: wthh Control: year Selection Filters: cell(1-2), urban(1-2)

Page4 Part II Fertility in China Choose the multi-year sample for China. In the upper-left corner, hover over Analysis and then select Comparison of Means. A) What is the universe for CHBORN? Why would a frequency table by CHBORN and SEX not be informative? B) What is the code in CHBORN for NIU (Not in Universe)? Should they be excluded from a comparison of means? C) Compare the means of CHBORN between 1982 and 1990. D) What was the ratio of male children ever born to female Dependent: chborn Row: year Selection filter: sex(2), chborn(*-30) children ever born in 1990? Either return to the main SDA page and choose the China 1990 sample, or create a year (1990) filter. Dependent: chbornf, chbornm Row: year Filter: chbornf(*-30), chbornm(*-30), sex(2)

Page5 Part III Migration and Income in India Choose the sample for India 1983. Use the Frequencies and Crosstabulations first. A) What are the reasons for migration in India in 1983? Most common reason? Least common reason? Row: mgcause Selection filter: mgcause(10-69) B) Now change to Comparison of Means. What was the average income for migrants who moved due to: i. Natural disaster? ii. Social and political problems? iii. Job relocation? Why might this be? Dependent: incwage Row: mgcause Selection filter: mgcause(10-69), incwage(*-9999997) Complete! Validate Your Answers

Page6 ANSWERS: Part I - South Africa Cell Phone Ownership A) What are the codes and universe for CELL? Is this a person variable or a household variable? Codes: 0 NIU (not in universe); 1 Yes; 2 No; 9 Unknown; Universe: South Africa 1996: Private households; South Africa 2001: Non-homeless households; South Africa 2007: Private household; CELL is a household variable B) What are the respective shares of households for which a cell phone was available? i. Rural households across all samples? Yes: 33%, No: 67% ii. Urban households? Yes: 56.7% No: 43.3% Row: cell Column: urban Weight: wthh Selection Filters: cell(1-2), urban(1-2) C) Compare the share of rural or urban households where a cell phone was available in 1996 to 2007. Comment on the change over time. 1996: Yes: 3.9% No: 96.1%; 2007: Yes: 71.6% No: 28.4% The availability of cell phones to rural households in South Africa is converging on the availability of cell phones in urban households. Row: cell Column: urban Weight: wthh Control: year Selection Filters: cell(1-2), urban(1-2)

Page7 ANSWERS: Part II Fertility in China Choose the multi-year sample for China. In the upper-left corner, hover over Analysis and then select Comparison of Means. A) What is the universe for CHBORN? Why would a frequency table by CHBORN and SEX not be informative? Females ages 15 to 64. There are no men in the universe of this question. B) What is the code in CHBORN for NIU (Not in Universe)? Should they be excluded from a comparison of means? The code for NIU is 99, and 98 is Unknown. If these aren't excluded, they will be included in the means of children ever born and bias the estimates upwards. C) Compare the means of CHBORN between 1982 and 1990. 1982: Average of 2.61 children ever born; 1990: Average of 2.1 Dependent: chborn Row: year Selection filter: sex(2), chborn(*-30) D) What was the ratio of male children ever born to female children ever born in 1990? The mean number of male children ever born was 1.09, and mean number of female children born was 1.01, for a ratio of 1.09/1.01= 1.079 boys to each girl. Either return to the main SDA page and choose the China 1990 sample, or create a year (1990) filter. Dependent: chbornf, chbornm Row: year Filter: chbornf(*-30), chbornm(*-30), sex(2)

Page8 ANSWERS: Part III Migration and Income in India Choose the sample for India 1983. Use the Frequencies and Crosstabulations first. A) What are the reasons for migration in India in 1983? Most common reason? Marriage or union Least common reason? Natural disaster Row: mgcause Selection filter: mgcause(10-69) B) Now change to Comparison of Means. What was the average income for migrants who moved due to: i. Natural disaster? 14.86 rupees ii. Social and political problems? 23.38 iii. Job relocation? 166.78 Why might this be? The income for migrants who moved due to natural disasters are likely to earn much less than average in the past week because they may have lost everything or did not have enough capital to rebuild and stay in the same place. Families that are slightly wealthier than average might be able to afford to move due to security. Job relocators are paid the most because the pay must be higher to exceed the costs of moving. Dependent: incwage Row: mgcause Selection filter: mgcause(10-69), incwage(*-9999997)