HOW TO CREATE AND MERGE DATASETS IN SPSS



Similar documents
What is a Mail Merge?

SPSS: Getting Started. For Windows

Create Mailing Labels Using Excel Data (Mail Merge)

Personalizing your Access Database with a Switchboard

How To Read Data Files With Spss For Free On Windows (Spss)

Create a PDF File. Tip. In this lesson, you will learn how to:

Affiliation Security

Basics Working with Filters

Managing Contacts in Outlook

The LSUHSC N.O. Archive

Human Computer Interaction Final Project Tutorial. Hardware Inventory Management System (HIMS) By M. Michael Nourai

Access 2010: The Navigation Pane

Access Queries (Office 2003)

Division of Student Affairs Quota Practices / Guidelines

Logging Service and Log Viewer for CPC Monitoring

USING STUFFIT DELUXE THE STUFFIT START PAGE CREATING ARCHIVES (COMPRESSED FILES)

Directions for Frequency Tables, Histograms, and Frequency Bar Charts

Introduction to SPSS 16.0

Microsoft Excel 2013: Using a Data Entry Form

Using the GroupWise Client

Getting Started With SPSS

Using Ad-Hoc Reporting

Excel for Data Cleaning and Management

Follow these procedures for QuickBooks Direct or File Integration: Section 1: Direct QuickBooks Integration [Export, Import or Both]

Mail Merge: Create Mailing Labels Using Excel Data and Filtering the Contents in the Data

This exhibit describes how to upload project information from Estimator (PC) to Trns.port PES (server). Figure 1 summarizes this process.

Lesson 07: MS ACCESS - Handout. Introduction to database (30 mins)

Creating a Marketing Campaign

Organizing and Managing

Help Desk Web User Guide

Unit 10: Microsoft Access Queries

You must have at least Editor access to your own mail database to run archiving.

Colligo Manager 5.1. User Guide

Word 2007: Mail Merge Learning Guide

ECDL. European Computer Driving Licence. Database Software BCS ITQ Level 1. Syllabus Version 1.0

Tutorial 3. Maintaining and Querying a Database

Microsoft Office 2010

How To Use Spss

PROJECT ON MICROSOFT ACCESS (HOME TAB AND EXTERNAL DATA TAB) SUBMITTED BY: SUBMITTED TO: NAME: ROLL NO: REGN NO: BATCH:

Pharmacy Affairs Branch. Website Database Downloads PUBLIC ACCESS GUIDE

Note: With v3.2, the DocuSign Fetch application was renamed DocuSign Retrieve.

TABLE OF CONTENTS BACKGROUND: HIGH IMPACT 4.0 PROFESSIONAL AND ACT!. 3 SELECT MAIL MERGE OPTION ON THE MAIN SCREEN.0 TEMPLATE.

MEDILINK ESI (R2) How To: Use the Medilink Document Management System. Casey Pittman Developer - APS Medilink 2011/08/12

Access II 2007 Workshop

Enterprise Reporting Advanced Web Intelligence Training. Enterprise Reporting Services

Microsoft Access 2010 Overview of Basics

MS Access. Microsoft Access is a relational database management system for windows. Using this package, following tasks can be performed.

OUTLOOK 2007 USER GUIDE

Step One. Step Two. Step Three USING EXPORTED DATA IN MICROSOFT ACCESS (LAST REVISED: 12/10/2013)

Producing Listings and Reports Using SAS and Crystal Reports Krishna (Balakrishna) Dandamudi, PharmaNet - SPS, Kennett Square, PA

CHAPTER 6: SEARCHING AN ONLINE DATABASE

How To Insert Hyperlinks In Powerpoint Powerpoint

Reduced Quality Sample

CHAPTER 11: DOCUMENT ARCHIVING

Importing Contacts to Outlook

Introduction to Microsoft Access 2003

Call Recorder Quick CD Access System

Merging Labels, Letters, and Envelopes Word 2013

Section Documents. Contents

7. Data Packager: Sharing and Merging Data

PC Agent Quick Start. Open the Agent. Autonomy Connected Backup. Version 8.8. Revision 0

M4 Systems. Batch & Document Management (BDM) Brochure

Introduction. Inserting Hyperlinks. PowerPoint 2010 Hyperlinks and Action Buttons. About Hyperlinks. Page 1

Netmail Search for Outlook 2010

Introduction to Microsoft Access 2007

Mail Merge in Word. Workbook

Chapter 2: Clients, charts of accounts, and bank accounts

TheEducationEdge. Export Guide

Simply Accounting Intelligence Tips and Tricks Booklet Vol. 1

SPSS Workbook 1 Data Entry : Questionnaire Data

Mac Outlook Calendar/Scheduler and Tasks

Introduction to Using SPSS Command Files

Making Visio Diagrams Come Alive with Data

MAS 500 Intelligence Tips and Tricks Booklet Vol. 1

Microsoft Outlook Reference Guide for Lotus Notes Users

Analyzing Excel Data Using Pivot Tables

Sales Person Commission

Excel 2010 Sorting and Filtering

SPSS (Statistical Package for the Social Sciences)

IT HELP Desk Dashboard ManageEngine Service Desk Plus User Guide

Exporting s from Outlook Version 1.00

PortfolioCenter Export Wizard in Practice: Evaluating IRA Account Holder Ages and Calculating Required Minimum Distribution (RMD) Amounts

Database Program Instructions

MS OUTLOOK

RightFax FaxUtil. Quick reference guide to getting started. Note: This document applies to OpenText RightFax version 10.0

How to create pop-up menus

Information Exchange Network (IEN) System Operator Training Day 3

Introduction to Microsoft Access 2010

Task Force on Technology / EXCEL

Outlook 2007: Managing your mailbox

AdventNet ManageEngine SupportCenter Plus :: User Guide. Table Of Contents INTRODUCTION... 3 REQUEST Creating a New Request...

Microsoft Office Access 2007 Basics

1. Click the Site Actions dropdown arrow and select Show Page Editing Toolbar. 2. Click Edit Page to begin changing the page layout

Using Big Datasets in Stata

Transcription:

HOW TO CREATE AND MERGE DATASETS IN SPSS If the original datasets to be merged are large, the process may be slow and unwieldy. Therefore the preferred method for working on multiple sweeps of data is to create bespoke datasets with the necessary variables (using DROP or KEEP commands) and then merge these datasets together. For this workshop it is the number of cases in the Sweep 1 sample which has been reduced (to approximately 30% of the original sample), and those cases only have been selected (when applicable) in the subsequent datasets for cross-sweep comparison. CREATING BESPOKE DATASETS USING THE DROP AND KEEP COMMANDS The KEEP command allows you to open a large data file specifying which of the variables from that file you wish to INCLUDE in your working data file. The KEEP command can be appended to either the GET FILE or SAVE OUTFILE commands Both individual variables and ranges of variables can be specified The case unique identifier Idnumber - will usually have to be included (to permit later merging with other datasets) Examples: GET FILE= C:\temp\GUSSW3B_30.sav' /Keep = idnumber, dcwinc01, dchgmag2 to ddmedu02. SAVE OUTFILE= C:\temp\Keep Save As Test.sav' /Keep = idnumber, dcwinc01, dchgmag2. The DROP command allows you to open a large data file specifying which of the variables from that file you wish to REMOVE from your working data file. The DROP command can be appended to either the GET FILE or SAVE OUTFILE commands Both individual variables and ranges of variables can be specified Again, the case unique identifier Idnumber - will usually have to be included (to permit later merging with other datasets Examples: GET FILE= C:\temp\GUSSW3B_30.sav' /Drop = samptype to dcwtchd2. SAVE OUTFILE= C:\temp\Drop Save As Test.sav' /Drop = dcurind1, dcurind2.

MERGING DATASETS Datasets can be merged using the unique case ID stored in the variable IDnumber. Whenever you are first merging files, it is easier to use the SPSS menus and then paste the syntax (automatically generated and recorded in the output) rather than using the syntax from scratch as it can be quite tricky depending on how large each of your datasets are and how many identical variables are in each already. The datasets to be merged must always be sorted on the same variable before merging otherwise the matching will not proceed. 1) Open the dataset you want to merge data into: in the example below it is the Sweep 1 birth cohort dataset 2) Sort this dataset on the key variable IDnumber in ascending order via the menu: go to Data\Sort Cases: o Select the variable IDnumber on the left part of the screen o And move it to the right part of the screen using the arrow the default option is Ascending order o Click OK 3) Repeat the same process 1) and 2) above with the dataset you want to extract the data from: the Sweep 2 birth cohort in the example below, to be added to the 1 st dataset = Sweep 1 birth cohort 4) On the menu of the 1st dataset go to: Data\Merge files\add variables

5) In the dialogue box, unless the dataset from which you want to merge is already open, select the button for An external SPSS data file and click Browse. If the dataset is open then select it in the open dataset box (as below). 6) If not already open, browse to the dataset of interest and double-click on it 7) Click Continue 8) The following dialog box will come up; in this example you can see that there is a big list of Excluded Variables on the left, which are the variables shared by both datasets, instead of just the expected variable IDnumber. This is due to the feed forward process: the archived datasets from Sweep 2 include some of the previous sweep variables since original information is only updated when applicable and we want the full information for all cases at each sweep, including those with no changes. To get the full information for this type of variable you need to incorporate the successive sweeps variables.

9) In this Add variables dialogue box, click the box Match cases on key variables in sorted files, and browse to and highlight the variable IDnumber : 10) Click on the arrow next to the Key variables box. IDnumber should now appear in the Key variables box.

The steps you take next will depend on what dataset you re already working on: 11) Under Match cases on key if you select o Both files provide cases (default option): All cases from the merged dataset will be transferred into the working dataset. If you are working on a later dataset and merging in data from an earlier dataset, choosing this option means that additional cases from the earlier dataset will be merged along with the variables. These cases will have missing data for the variables at the later sweep because they were not achieved at that sweep. o Non-active dataset is keyed table : Only merged data for those cases already in the working dataset will be transferred. This avoids the above issue if you are working on a later dataset and merging in a variable from an earlier sweep. Only information from those cases in the working (later) dataset will be merged so you won t generate entire cases with missing data which would need to be deleted or filtered out later on. o Active dataset is keyed table : All cases from the merged dataset will be transferred into the working dataset. This produces the same result as the first scenario. 12) In this example you need to select Option 2 Non-active dataset is keyed table 13) Click OK and again OK in the warning message re cases needing to be sorted before merging