Cohort Analysis for Genetic Epidemiology (C. A.G. E.) User Reference Manual

Size: px
Start display at page:

Download "Cohort Analysis for Genetic Epidemiology (C. A.G. E.) User Reference Manual"

Transcription

1 Cohort Analysis for Genetic Epidemiology (C. A.G. E.) User Reference Manual CAGE is a UNIX based program, which calculates the standardized cancer incidence ratios (Observed / Expected) with 95% confidence intervals assuming that the observed number of malignancies follow a Poisson distribution. Some of its application examples include familial aggregation using pedigree data, second onset cancer within cohort of cancer patients and so on. The CAGE program requires three data input files: 1) Registry data file: Connecticut or SEER data in R1 format Note: This file is provided to the user 2) Cohort data file: Your input data file which contains the members of the population (.dat file) 3) Data Input file: A Cohort Analysis Language (CAL) program file telling CAGE how to manipulate the data and what analysis to perform (.inp file) Note: You need to create this file to run your analysis To run CAGE, use the following command line in UNIX: % cage registry data file name data input file name > output file name % cage conn9.r1 test.inp > test.out Registry Data File The newly updated Connecticut Tumor Registry data file (conn9.r1_ctr) and SEER data files (conn9.r1_seerwhite and conn9.r1_seerblack) are both ICD-9 R1 format Connecticut Tumor Registry file, where the conn9.r1_seerwhite data is for SEER white population and the conn9.r1_seerblack data is for SEER black population. Cohort Data File The cohort data file is an ASCII file of rows and columns of data. Each row is made up of columns which represent parameters that describe or apply to one member of the population being studied. Data columns may be of type string (alphanumeric), floating point number or integer. Columns can be delimited by spaces or tabs. Missing value is coded as -1. Certain information is required to be included for each member. These must-have columns are: 1) Sex 2) Age 1

2 3) Birth year 4) Affection status Do not label the column in the first row of the file. Each column must be numeric type. CAGE will give an error message Bus Error (core dumped) if the cohort data file exceeds rows (observation). So if you have a large dataset, please cut your data to two or more files and make sure each file contains less than rows (observations). The commands you can use in Unix to cut your data are: % head [-counts] [file name] > [ new file name] % tail [-counts] [file name] > [new file name] For If you have a large cohort data BT.dat which contains observations, you should cut this data into 2 files: % head BT.dat > first10kbt.dat where first10kbt.dat contains the first obs of BT.dat % tail 5678 BT.dat > last5678bt.dat where last5678bt.dat contains the last 5678 obs of BT.dat You need to run CAGE for each of the two files separately, and then run a S-plus program to combine the two results. Please consult Carol J. Etzel or Mei Liu about how to run the S-plus program. 2

3 Data Input File The input file which uses the CAL describes and manipulates the cohort data and performs analyses of that data. The input file includes five parts and here is an data file is test.dat Part I: Specify the cohort data file labels (REL, BYR, SEX, PBrace, PBsex, PBYOB, cancer, AGE) ; Part II: Label variables sex is col SEX where (male=1, female=2) ; age is col AGE; birth year is col BYR ; affected_status is col cancer where (affected == 1, unaffected == 2) ; Part III: Identify the required fields: Sex, Age, Birth year, Affection status columns group FDR by REL where value == 1 REL where value == 2 REL where value == 3 REL where value == 12; group Fathers by REL where value ==2; group Mothers by REL where value ==3; group Brothers by REL where value ==1 && SEX where value ==1; group Sisters by REL where value ==1 && SEX where value ==2; Part IV: Define analysis groups using group-by-statement analyze FDR for site (MLG) ; analyze Fathers for site (MLG) ; analyze Mothers for site (MLG) ; Part V: Analyze defined groups using analyze-statement analyze Brothers for site (MLG) ; analyze Sisters for site (MLG) ; 3

4 Part I: Specify the cohort data file Tells CAGE what cohort data file to be analyzed data file is filename data file is test.dat Part II: Label variables Tells CAGE what variables in the cohort data file labels(variable1, variable2, Sex, varilble3, Age, variable4, Birth, variable5, variable6, Affected, variable7, variable8, ) * Where variable1, variable2, variable3 are the other variables except age, sex, birth year, affected status in your data file. labels (Familyid, Sex, Age, Affected, Relationship, Birth) * The different variables must be separated by comma * The variable order should be exact the same as they appear in the dataset * Be aware the lower case sex, age, affected status and birth year are keywords for CAL, so they are not allowed using in the label statement. Using upper case Sex, Age, etc, because CAL is case-sensitive. Part III: Identify Sex, Age, Birth year, Affection status columns Tells CAGE where to find the age, sex, birth year, and affected status information age is col column name sex is col column name where(male = value, female = value) birth year is col column name affected_status is col column name where(affected relational-operator value, unaffected relational-operator value) * where relational-operator includes any of == < > <= >= * Be aware = but not == is set for the sex variable age is col Age sex is col Sex where(male =1, female = 2) birth year is col Birth affected_status is col Affected where (affected == 1, unaffected == 0) 4

5 Part IV: Define analysis groups using group-by-statement group-by-statement tells CAGE to define a subgroup with the stated logical commonalties. If the subgroup is defined by only one column variable: group identifier by column name where value relational-op constant * where relational-operator includes any of == < > <= >= group females by Sex where value == 2; group whites by Race where value == 1; If the subgroup is defined by more than one column variable: group identifier by column where value relational-op constant relational-op column where value relational-op constant relational-op column where value relational-op constant *where relational-op includes: (logical or) and &&(logical and) group PBMale by REL where value == 1 REL where value == 2 REL where value == 3 REL where value == 12 && PBsex where value == 1; Part IV: Analyze defined groups using analyze-statement analyze-statement performs observed/expected analysis using the Connecticut Tumor Registry data or SEER data. This analysis is performed on the defined group. analyze group name for site (site list) print summary reset variables * site list is a comma delimited list of the one or more sites taken from the PREFERENT CAUSE LIST * print summary tells CAGE to show the detailed analysis results * reset variables tells CAGE to start a new grouping context analysis analyze females for site(brf,lip) print summary reset variables 5

6 analyze PBAge20 for site (BLA) print summary reset variables Analysis Output File: CAGE gives you the analysis results for each group you define in the input file. The output file contains the input file information, the missing value information, and the analysis results. Here is the output from the analysis for the given input file CONNECTICUT TUMOR FILE IS conn9.r1_ctr CAL FILE IS test.inp POPULATION DATA FILE IS test.dat Input file information Missing essential parameter on member 680 of group FDR: age -1, sex 2, birthyear 1917 Missing essential parameter on member 1170 of group FDR: age -1, sex 2, birthyear 1903 Missing essential parameter on member 1296 of group FDR: age -1, sex 1, birthyear 1933 Missing essential parameter on member 2254 of group FDR: age -1, sex 2, birthyear 1985 Missing essential parameter on member 2358 of group FDR: age -1, sex 2, birthyear 1957 Missing essential parameter on member 2359 of group FDR: age -1, sex 1, birthyear 1958 Missing essential parameter on member 2944 of group FDR: age -1, sex 1, birthyear 1951 Missing essential parameter on member 3122 of group FDR: age 39, sex -1, birthyear 1952 Missing essential parameter on member 3288 of group FDR: age 70, sex -1, birthyear 1921 Missing essential parameter on member 3328 of group FDR: age -1, sex 2, birthyear 1939 Missing essential parameter on member 3329 of group FDR: age -1, sex 1, birthyear 1910 Observations with missing value which are not analyzed by CAGE Group Site Total Pop Tot Affected Sum of Exp O/E L/E U/E Person Years FDR MLG, TOTALS: FDR Missing essential parameter on member 475 of group Fathers: age -1, sex 1, birthyear 1951 Missing essential parameter on member 529 of group Fathers: age 70, sex -1, birthyear 1921 Missing essential parameter on member 534 of group Fathers: age -1, sex 1, birthyear 1910 Analysis results Fathers MLG, TOTALS: Fathers Missing essential parameter on member 104 of group Mothers: age -1, sex 2, birthyear 1917 Missing essential parameter on member 184 of group Mothers: age -1, sex 2, birthyear 1903 Mothers MLG, TOTALS: Mothers Missing essential parameter on member 270 of group Brothers: age -1, sex 1, birthyear 1933 Brothers MLG, TOTALS:

7 Brothers Missing essential parameter on member 638 of group Sisters: age -1, sex 2, birthyear 1939 Sisters MLG, TOTALS: Sisters The analysis results contain group name, site name, total population in your cohort dataset, total affected cases in your cohort dataset, expected number of cases which is obtained by multiplying the age- and gender-specific cancer incidence rates in Connecticut or SEER database by corresponding person-years of your cohort data, the ratio of observed to expected numbers of cases (SIR) with likelihood-based 95% confidence intervals (CI) from Poisson models and person-years. Calculation of Person-years in CAGE CAGE calculates the person- years for each group by adding the person-years for that specific group with the person-years for the preceding analysis groups. Therefore, in order to get the person-years for only the specific group, you need to subtract the personyears by the previous person-years. Below is an Person-years are from the output file example and the actual person-years were recalculated for each analysis group. Group Person-years from CAGE Actual Person-years FDR Father Mother Brother Sister Interpretation of the Results: Using the above output file as an Significantly increased SIRs were observed for MLG for the Sister group (SIR= 1.47, 95% CI =( )). Decreased SIRs were observed for the FDR, Father, Mother, Brother groups, however, the results are insignificant because all the 95% CIs include 1. How to analyze second cancer within cohort of cancer patients? To calculate the risk of second cancer within the cohort of cancer patients, you need to know the age onset for both of the first and second cancers. You will also need an accompanying SPLUS/R function to calculate the final SIRs. This function is available 7

8 upon request. The following are the steps you need to complete to obtain an SIR for a second cancer: First, run CAGE to get the expected number of cancers (E1) and total number of person years (PY1) from the birth to the age onset of the first cancer within the cohort. Second, run CAGE to get the expected number of cancers (E2) total number of person years (PY2) from the birth to the age onset of the second cancer within the cohort Third, Calculate the corrected expected number of cancers (EC) from the age onset of the first cancer to the second cancer: EC=E2-E1 Calculate the correct person years (PYC) from the age onset of the first cancer to the second cancer: PYC=PY2-PY1 Fourth, run the Splus program using EC, PYC and the observed number of second cancers within the cohort to get the SIR and 95% confidence interval for the second cancer. EC=E2-E1 The adjusted person-years for the second onset can be obtained by subtracting the person-years Birth of the second cancer by the person-years First of the first cancer. Second Cancer Cancer E1 E2 8

Time Clock Import Setup & Use

Time Clock Import Setup & Use Time Clock Import Setup & Use Document # Product Module Category CenterPoint Payroll Processes (How To) This document outlines how to setup and use of the Time Clock Import within CenterPoint Payroll.

More information

Using the American Community Survey Data

Using the American Community Survey Data Using the American Community Survey Data The ACS website is accessible from www.census.gov/acs. In the middle of the screen choose In-depth Data Go directly to the ACS Datasets Tab The ACS Datasets Tab

More information

Figure 1.1 Percentage of persons without health insurance coverage: all ages, United States, 1997-2001

Figure 1.1 Percentage of persons without health insurance coverage: all ages, United States, 1997-2001 Figure 1.1 Percentage of persons without health insurance coverage: all ages, United States, 1997-2001 DATA SOURCE: Family Core component of the 1997-2001 National Health Interview Surveys. The estimate

More information

How to set the main menu of STATA to default factory settings standards

How to set the main menu of STATA to default factory settings standards University of Pretoria Data analysis for evaluation studies Examples in STATA version 11 List of data sets b1.dta (To be created by students in class) fp1.xls (To be provided to students) fp1.txt (To be

More information

Adverse Impact Ratio for Females (0/ 1) = 0 (5/ 17) = 0.2941 Adverse impact as defined by the 4/5ths rule was not found in the above data.

Adverse Impact Ratio for Females (0/ 1) = 0 (5/ 17) = 0.2941 Adverse impact as defined by the 4/5ths rule was not found in the above data. 1 of 9 12/8/2014 12:57 PM (an On-Line Internet based application) Instructions: Please fill out the information into the form below. Once you have entered your data below, you may select the types of analysis

More information

Odds ratio, Odds ratio test for independence, chi-squared statistic.

Odds ratio, Odds ratio test for independence, chi-squared statistic. Odds ratio, Odds ratio test for independence, chi-squared statistic. Announcements: Assignment 5 is live on webpage. Due Wed Aug 1 at 4:30pm. (9 days, 1 hour, 58.5 minutes ) Final exam is Aug 9. Review

More information

Using Stata for Categorical Data Analysis

Using Stata for Categorical Data Analysis Using Stata for Categorical Data Analysis NOTE: These problems make extensive use of Nick Cox s tab_chi, which is actually a collection of routines, and Adrian Mander s ipf command. From within Stata,

More information

Running Descriptive Statistics: Sample and Population Values

Running Descriptive Statistics: Sample and Population Values Running Descriptive Statistics: Sample and Population Values Goal This exercise is an introduction to a few of the variables in the household- and person-level LIS data sets. The exercise concentrates

More information

RATIOS, PROPORTIONS, PERCENTAGES, AND RATES

RATIOS, PROPORTIONS, PERCENTAGES, AND RATES RATIOS, PROPORTIOS, PERCETAGES, AD RATES 1. Ratios: ratios are one number expressed in relation to another by dividing the one number by the other. For example, the sex ratio of Delaware in 1990 was: 343,200

More information

Summary Measures (Ratio, Proportion, Rate) Marie Diener-West, PhD Johns Hopkins University

Summary Measures (Ratio, Proportion, Rate) Marie Diener-West, PhD Johns Hopkins University This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike License. Your use of this material constitutes acceptance of that license and the conditions of use of materials on this

More information

Death Data: CDC Wonder, Texas Health Data, and VitalWeb

Death Data: CDC Wonder, Texas Health Data, and VitalWeb Death Data: CDC Wonder, Texas Health Data, and VitalWeb Evidence-Based Public Health Practice Step 2: Quantify the Issue This handout demonstrates how to access CDC Wonder, Texas Health Data, and VitalWeb

More information

Using SPSS, Chapter 2: Descriptive Statistics

Using SPSS, Chapter 2: Descriptive Statistics 1 Using SPSS, Chapter 2: Descriptive Statistics Chapters 2.1 & 2.2 Descriptive Statistics 2 Mean, Standard Deviation, Variance, Range, Minimum, Maximum 2 Mean, Median, Mode, Standard Deviation, Variance,

More information

Two Related Samples t Test

Two Related Samples t Test Two Related Samples t Test In this example 1 students saw five pictures of attractive people and five pictures of unattractive people. For each picture, the students rated the friendliness of the person

More information

EXST SAS Lab Lab #4: Data input and dataset modifications

EXST SAS Lab Lab #4: Data input and dataset modifications EXST SAS Lab Lab #4: Data input and dataset modifications Objectives 1. Import an EXCEL dataset. 2. Infile an external dataset (CSV file) 3. Concatenate two datasets into one 4. The PLOT statement will

More information

Introduction to STATA 11 for Windows

Introduction to STATA 11 for Windows 1/27/2012 Introduction to STATA 11 for Windows Stata Sizes...3 Documentation...3 Availability...3 STATA User Interface...4 Stata Language Syntax...5 Entering and Editing Stata Commands...6 Stata Online

More information

Summary of R software commands used to generate bootstrap and permutation test output and figures in Chapter 16

Summary of R software commands used to generate bootstrap and permutation test output and figures in Chapter 16 Summary of R software commands used to generate bootstrap and permutation test output and figures in Chapter 16 Since R is command line driven and the primary software of Chapter 16, this document details

More information

Constructing a Table of Survey Data with Percent and Confidence Intervals in every Direction

Constructing a Table of Survey Data with Percent and Confidence Intervals in every Direction Constructing a Table of Survey Data with Percent and Confidence Intervals in every Direction David Izrael, Abt Associates Sarah W. Ball, Abt Associates Sara M.A. Donahue, Abt Associates ABSTRACT We examined

More information

T-SQL STANDARD ELEMENTS

T-SQL STANDARD ELEMENTS T-SQL STANDARD ELEMENTS SLIDE Overview Types of commands and statement elements Basic SELECT statements Categories of T-SQL statements Data Manipulation Language (DML*) Statements for querying and modifying

More information

Federal Employee Viewpoint Survey Online Reporting and Analysis Tool

Federal Employee Viewpoint Survey Online Reporting and Analysis Tool Federal Employee Viewpoint Survey Online Reporting and Analysis Tool Tutorial January 2013 NOTE: If you have any questions about the FEVS Online Reporting and Analysis Tool, please contact your OPM point

More information

New Hampshire Childhood Cancer

New Hampshire Childhood Cancer Introduction: New Hampshire Childhood Cancer New Hampshire, Childhood Cancer, January 2009 Issue Brief Cancer in children is relatively uncommon, impacting fewer than twenty two of every 100,000 children

More information

Independent t- Test (Comparing Two Means)

Independent t- Test (Comparing Two Means) Independent t- Test (Comparing Two Means) The objectives of this lesson are to learn: the definition/purpose of independent t-test when to use the independent t-test the use of SPSS to complete an independent

More information

Is it statistically significant? The chi-square test

Is it statistically significant? The chi-square test UAS Conference Series 2013/14 Is it statistically significant? The chi-square test Dr Gosia Turner Student Data Management and Analysis 14 September 2010 Page 1 Why chi-square? Tests whether two categorical

More information

CHILDHOOD CANCER SURVIVOR STUDY Analysis Concept Proposal

CHILDHOOD CANCER SURVIVOR STUDY Analysis Concept Proposal CHILDHOOD CANCER SURVIVOR STUDY Analysis Concept Proposal 1. STUDY TITLE: Longitudinal Assessment of Chronic Health Conditions: The Aging of Childhood Cancer Survivors 2. WORKING GROUP AND INVESTIGATORS:

More information

Creating Basic Excel Formulas

Creating Basic Excel Formulas Creating Basic Excel Formulas Formulas are equations that perform calculations on values in your worksheet. Depending on how you build a formula in Excel will determine if the answer to your formula automatically

More information

3.4 Statistical inference for 2 populations based on two samples

3.4 Statistical inference for 2 populations based on two samples 3.4 Statistical inference for 2 populations based on two samples Tests for a difference between two population means The first sample will be denoted as X 1, X 2,..., X m. The second sample will be denoted

More information

Embedded Systems. Review of ANSI C Topics. A Review of ANSI C and Considerations for Embedded C Programming. Basic features of C

Embedded Systems. Review of ANSI C Topics. A Review of ANSI C and Considerations for Embedded C Programming. Basic features of C Embedded Systems A Review of ANSI C and Considerations for Embedded C Programming Dr. Jeff Jackson Lecture 2-1 Review of ANSI C Topics Basic features of C C fundamentals Basic data types Expressions Selection

More information

WHO STEPS Surveillance Support Materials. STEPS Epi Info Training Guide

WHO STEPS Surveillance Support Materials. STEPS Epi Info Training Guide STEPS Epi Info Training Guide Department of Chronic Diseases and Health Promotion World Health Organization 20 Avenue Appia, 1211 Geneva 27, Switzerland For further information: www.who.int/chp/steps WHO

More information

Appendix III: SPSS Preliminary

Appendix III: SPSS Preliminary Appendix III: SPSS Preliminary SPSS is a statistical software package that provides a number of tools needed for the analytical process planning, data collection, data access and management, analysis,

More information

Microsoft Excel 2010 Part 3: Advanced Excel

Microsoft Excel 2010 Part 3: Advanced Excel CALIFORNIA STATE UNIVERSITY, LOS ANGELES INFORMATION TECHNOLOGY SERVICES Microsoft Excel 2010 Part 3: Advanced Excel Winter 2015, Version 1.0 Table of Contents Introduction...2 Sorting Data...2 Sorting

More information

Incorrect Analyses of Radiation and Mesothelioma in the U.S. Transuranium and Uranium Registries Joey Zhou, Ph.D.

Incorrect Analyses of Radiation and Mesothelioma in the U.S. Transuranium and Uranium Registries Joey Zhou, Ph.D. Incorrect Analyses of Radiation and Mesothelioma in the U.S. Transuranium and Uranium Registries Joey Zhou, Ph.D. At the Annual Meeting of the Health Physics Society July 15, 2014 in Baltimore A recently

More information

Chapter 2 Probability Topics SPSS T tests

Chapter 2 Probability Topics SPSS T tests Chapter 2 Probability Topics SPSS T tests Data file used: gss.sav In the lecture about chapter 2, only the One-Sample T test has been explained. In this handout, we also give the SPSS methods to perform

More information

Guidelines for Data Collection & Data Entry

Guidelines for Data Collection & Data Entry Guidelines for Data Collection & Data Entry Theresa A Scott, MS Vanderbilt University Department of Biostatistics [email protected] http://biostat.mc.vanderbilt.edu/theresascott Theresa A Scott,

More information

Relational Database: Additional Operations on Relations; SQL

Relational Database: Additional Operations on Relations; SQL Relational Database: Additional Operations on Relations; SQL Greg Plaxton Theory in Programming Practice, Fall 2005 Department of Computer Science University of Texas at Austin Overview The course packet

More information

EXCEL Tutorial: How to use EXCEL for Graphs and Calculations.

EXCEL Tutorial: How to use EXCEL for Graphs and Calculations. EXCEL Tutorial: How to use EXCEL for Graphs and Calculations. Excel is powerful tool and can make your life easier if you are proficient in using it. You will need to use Excel to complete most of your

More information

Getting started with the Stata

Getting started with the Stata Getting started with the Stata 1. Begin by going to a Columbia Computer Labs. 2. Getting started Your first Stata session. Begin by starting Stata on your computer. Using a PC: 1. Click on start menu 2.

More information

Ad Hoc Advanced Table of Contents

Ad Hoc Advanced Table of Contents Ad Hoc Advanced Table of Contents Functions... 1 Adding a Function to the Adhoc Query:... 1 Constant... 2 Coalesce... 4 Concatenate... 6 Add/Subtract... 7 Logical Expressions... 8 Creating a Logical Expression:...

More information

Advanced Statistical Analysis of Mortality. Rhodes, Thomas E. and Freitas, Stephen A. MIB, Inc. 160 University Avenue. Westwood, MA 02090

Advanced Statistical Analysis of Mortality. Rhodes, Thomas E. and Freitas, Stephen A. MIB, Inc. 160 University Avenue. Westwood, MA 02090 Advanced Statistical Analysis of Mortality Rhodes, Thomas E. and Freitas, Stephen A. MIB, Inc 160 University Avenue Westwood, MA 02090 001-(781)-751-6356 fax 001-(781)-329-3379 [email protected] Abstract

More information

Optimization of sampling strata with the SamplingStrata package

Optimization of sampling strata with the SamplingStrata package Optimization of sampling strata with the SamplingStrata package Package version 1.1 Giulio Barcaroli January 12, 2016 Abstract In stratified random sampling the problem of determining the optimal size

More information

Instructions for applying data validation(s) to data fields in Microsoft Excel

Instructions for applying data validation(s) to data fields in Microsoft Excel 1 of 10 Instructions for applying data validation(s) to data fields in Microsoft Excel According to Microsoft Excel, a data validation is used to control the type of data or the values that users enter

More information

Tutorial Segmentation and Classification

Tutorial Segmentation and Classification MARKETING ENGINEERING FOR EXCEL TUTORIAL VERSION 1.0.8 Tutorial Segmentation and Classification Marketing Engineering for Excel is a Microsoft Excel add-in. The software runs from within Microsoft Excel

More information

IBM SPSS Statistics for Beginners for Windows

IBM SPSS Statistics for Beginners for Windows ISS, NEWCASTLE UNIVERSITY IBM SPSS Statistics for Beginners for Windows A Training Manual for Beginners Dr. S. T. Kometa A Training Manual for Beginners Contents 1 Aims and Objectives... 3 1.1 Learning

More information

Company Setup 401k Tab

Company Setup 401k Tab Reference Sheet Company Setup 401k Tab Use this page to define company level 401(k) information, including employee status codes, 401(k) sources, and 401(k) funds. The definitions you create here become

More information

Cancer Cluster Investigation French Limited Superfund Site, Harris County, Texas

Cancer Cluster Investigation French Limited Superfund Site, Harris County, Texas Cancer Cluster Investigation French Limited Superfund Site, Harris County, Texas Time Period: 1995-2011 Prepared by the Texas Department of State Health Services Summary Some residents living in the vicinity

More information

A Guide to Stat/Transfer File Transfer Utility, Version 10

A Guide to Stat/Transfer File Transfer Utility, Version 10 A Guide to Stat/Transfer File Transfer Utility, Version 10 Table of Contents 1.) What is Stat/Transfer, and when can it be used? 2 2.) What files does Stat/Transfer Version 10 support?...2 3.) Doing a

More information

ee-quipment.com ee203 RTCM USB Quick-Start Guide

ee-quipment.com ee203 RTCM USB Quick-Start Guide ee-quipment.com ee203 RTCM USB Quick-Start Guide The ee203 USB interface consists of a flash drive and two virtual serial ports. The required drivers are built into Windows, but the serial ports require

More information

Appendix G STATISTICAL METHODS INFECTIOUS METHODS STATISTICAL ROADMAP. Prepared in Support of: CDC/NCEH Cross Sectional Assessment Study.

Appendix G STATISTICAL METHODS INFECTIOUS METHODS STATISTICAL ROADMAP. Prepared in Support of: CDC/NCEH Cross Sectional Assessment Study. Appendix G STATISTICAL METHODS INFECTIOUS METHODS STATISTICAL ROADMAP Prepared in Support of: CDC/NCEH Cross Sectional Assessment Study Prepared by: Centers for Disease Control and Prevention National

More information

Conditionals (with solutions)

Conditionals (with solutions) Conditionals (with solutions) For exercises 1 to 27, indicate the output that will be produced. Assume the following declarations: final int MAX = 25, LIMIT = 100; int num1 = 12, num2 = 25, num3 = 87;

More information

SPSS: Getting Started. For Windows

SPSS: Getting Started. For Windows For Windows Updated: August 2012 Table of Contents Section 1: Overview... 3 1.1 Introduction to SPSS Tutorials... 3 1.2 Introduction to SPSS... 3 1.3 Overview of SPSS for Windows... 3 Section 2: Entering

More information

1-3 id id no. of respondents 101-300 4 respon 1 responsible for maintenance? 1 = no, 2 = yes, 9 = blank

1-3 id id no. of respondents 101-300 4 respon 1 responsible for maintenance? 1 = no, 2 = yes, 9 = blank Basic Data Analysis Graziadio School of Business and Management Data Preparation & Entry Editing: Inspection & Correction Field Edit: Immediate follow-up (complete? legible? comprehensible? consistent?

More information

Importing Data from a Dat or Text File into SPSS

Importing Data from a Dat or Text File into SPSS Importing Data from a Dat or Text File into SPSS 1. Select File Open Data (Using Text Wizard) 2. Under Files of type, choose Text (*.txt,*.dat) 3. Select the file you want to import. The dat or text file

More information

SAS Analyst for Windows Tutorial

SAS Analyst for Windows Tutorial Updated: August 2012 Table of Contents Section 1: Introduction... 3 1.1 About this Document... 3 1.2 Introduction to Version 8 of SAS... 3 Section 2: An Overview of SAS V.8 for Windows... 3 2.1 Navigating

More information

Guido s Guide to PROC FREQ A Tutorial for Beginners Using the SAS System Joseph J. Guido, University of Rochester Medical Center, Rochester, NY

Guido s Guide to PROC FREQ A Tutorial for Beginners Using the SAS System Joseph J. Guido, University of Rochester Medical Center, Rochester, NY Guido s Guide to PROC FREQ A Tutorial for Beginners Using the SAS System Joseph J. Guido, University of Rochester Medical Center, Rochester, NY ABSTRACT PROC FREQ is an essential procedure within BASE

More information

NCSS Statistical Software

NCSS Statistical Software Chapter 115 Introduction NCSS can import from a wide variety of spreadsheets, databases, and statistical systems. When you import a file, the entire dataset is replaced with the imported data, so make

More information

Client Marketing: Sets

Client Marketing: Sets Client Marketing Client Marketing: Sets Purpose Client Marketing Sets are used for selecting clients from the client records based on certain criteria you designate. Once the clients are selected, you

More information

TUTORIAL: RETRIEVING AND IMPORTING CHCS DATA INTO MICROSOFT EXCEL Purpose: The purpose of this tutorial is to provide step by step instructions on how to retrieve data from CHCS(Composite Healthcare System)

More information

Two Correlated Proportions (McNemar Test)

Two Correlated Proportions (McNemar Test) Chapter 50 Two Correlated Proportions (Mcemar Test) Introduction This procedure computes confidence intervals and hypothesis tests for the comparison of the marginal frequencies of two factors (each with

More information

Lifetime Likelihood of Going to State or Federal Prison

Lifetime Likelihood of Going to State or Federal Prison U.S. Department of Justice Office of Justice Programs Bureau of Justice Statistics Special Report March 1997, NCJ-160092 Lifetime Likelihood of Going to State or Federal Prison By Thomas P. Bonczar and

More information

Analysis of Population Cancer Risk Factors in National Information System SVOD

Analysis of Population Cancer Risk Factors in National Information System SVOD Analysis of Population Cancer Risk Factors in National Information System SVOD Mužík J. 1, Dušek L. 1,2, Pavliš P. 1, Koptíková J. 1, Žaloudík J. 3, Vyzula R. 3 Abstract Human risk assessment requires

More information

Simply Accounting Intelligence Tips and Tricks Booklet Vol. 1

Simply Accounting Intelligence Tips and Tricks Booklet Vol. 1 Simply Accounting Intelligence Tips and Tricks Booklet Vol. 1 1 Contents Accessing the SAI reports... 3 Running, Copying and Pasting reports... 4 Creating and linking a report... 5 Auto e-mailing reports...

More information

Methodologies for Converting Microsoft Excel Spreadsheets to SAS datasets

Methodologies for Converting Microsoft Excel Spreadsheets to SAS datasets Methodologies for Converting Microsoft Excel Spreadsheets to SAS datasets Karin LaPann ViroPharma Incorporated ABSTRACT Much functionality has been added to the SAS to Excel procedures in SAS version 9.

More information

The Little Man Computer

The Little Man Computer The Little Man Computer The Little Man Computer - an instructional model of von Neuman computer architecture John von Neuman (1903-1957) and Alan Turing (1912-1954) each independently laid foundation for

More information

Lesson 14 14 Outline Outline

Lesson 14 14 Outline Outline Lesson 14 Confidence Intervals of Odds Ratio and Relative Risk Lesson 14 Outline Lesson 14 covers Confidence Interval of an Odds Ratio Review of Odds Ratio Sampling distribution of OR on natural log scale

More information

SPSS Workbook 1 Data Entry : Questionnaire Data

SPSS Workbook 1 Data Entry : Questionnaire Data TEESSIDE UNIVERSITY SCHOOL OF HEALTH & SOCIAL CARE SPSS Workbook 1 Data Entry : Questionnaire Data Prepared by: Sylvia Storey [email protected] SPSS data entry 1 This workbook is designed to introduce

More information

Lecture 2 ESTIMATING THE SURVIVAL FUNCTION. One-sample nonparametric methods

Lecture 2 ESTIMATING THE SURVIVAL FUNCTION. One-sample nonparametric methods Lecture 2 ESTIMATING THE SURVIVAL FUNCTION One-sample nonparametric methods There are commonly three methods for estimating a survivorship function S(t) = P (T > t) without resorting to parametric models:

More information

MAS 500 Intelligence Tips and Tricks Booklet Vol. 1

MAS 500 Intelligence Tips and Tricks Booklet Vol. 1 MAS 500 Intelligence Tips and Tricks Booklet Vol. 1 1 Contents Accessing the Sage MAS Intelligence Reports... 3 Copying, Pasting and Renaming Reports... 4 To create a new report from an existing report...

More information

Supplementary online appendix

Supplementary online appendix Supplementary online appendix 1 Table A1: Five-state sample: Data summary Year AZ CA MD NJ NY Total 1991 0 1,430 0 0 0 1,430 1992 0 1,428 0 0 0 1,428 1993 0 1,346 0 0 0 1,346 1994 0 1,410 0 0 0 1,410 1995

More information

Microsoft Access Glossary of Terms

Microsoft Access Glossary of Terms Microsoft Access Glossary of Terms A Free Document From www.chimpytech.com COPYRIGHT NOTICE This document is copyright chimpytech.com. Please feel free to distribute and give away this document to your

More information

Using Formulas, Functions, and Data Analysis Tools Excel 2010 Tutorial

Using Formulas, Functions, and Data Analysis Tools Excel 2010 Tutorial Using Formulas, Functions, and Data Analysis Tools Excel 2010 Tutorial Excel file for use with this tutorial Tutor1Data.xlsx File Location http://faculty.ung.edu/kmelton/data/tutor1data.xlsx Introduction:

More information

The Center for Teaching, Learning, & Technology

The Center for Teaching, Learning, & Technology The Center for Teaching, Learning, & Technology Instructional Technology Workshops Microsoft Excel 2010 Formulas and Charts Albert Robinson / Delwar Sayeed Faculty and Staff Development Programs Colston

More information

Clever SFTP Instructions

Clever SFTP Instructions Clever SFTP Instructions November 10, 2015 Contents 1 Introduction 2 2 General SFTP Setup 2 3 Preparing CSV Files 3 3.1 Preparing schools.csv............................... 4 3.2 Preparing students.csv...............................

More information

Part A. EpiData Entry

Part A. EpiData Entry Part A. EpiData Entry Part A: Quality-assured data capture with EpiData Manager and EpiData EntryClient Exercise 1 A data documentation sheet for a simple questionnaire Exercise 2 Create a basic data entry

More information

Calculating Survival Probabilities Accepted for Publication in Journal of Legal Economics, 2009, Vol. 16(1), pp. 111-126.

Calculating Survival Probabilities Accepted for Publication in Journal of Legal Economics, 2009, Vol. 16(1), pp. 111-126. Calculating Survival Probabilities Accepted for Publication in Journal of Legal Economics, 2009, Vol. 16(1), pp. 111-126. David G. Tucek Value Economics, LLC 13024 Vinson Court St. Louis, MO 63043 Tel:

More information

Once saved, if the file was zipped you will need to unzip it. For the files that I will be posting you need to change the preferences.

Once saved, if the file was zipped you will need to unzip it. For the files that I will be posting you need to change the preferences. 1 Commands in JMP and Statcrunch Below are a set of commands in JMP and Statcrunch which facilitate a basic statistical analysis. The first part concerns commands in JMP, the second part is for analysis

More information

Linear Models in STATA and ANOVA

Linear Models in STATA and ANOVA Session 4 Linear Models in STATA and ANOVA Page Strengths of Linear Relationships 4-2 A Note on Non-Linear Relationships 4-4 Multiple Linear Regression 4-5 Removal of Variables 4-8 Independent Samples

More information

Statistical Analysis for Genetic Epidemiology (S.A.G.E.) Version 6.2 Graphical User Interface (GUI) Manual

Statistical Analysis for Genetic Epidemiology (S.A.G.E.) Version 6.2 Graphical User Interface (GUI) Manual Statistical Analysis for Genetic Epidemiology (S.A.G.E.) Version 6.2 Graphical User Interface (GUI) Manual Department of Epidemiology and Biostatistics Wolstein Research Building 2103 Cornell Rd Case Western

More information

Chapter 4 Displaying and Describing Categorical Data

Chapter 4 Displaying and Describing Categorical Data Chapter 4 Displaying and Describing Categorical Data Chapter Goals Learning Objectives This chapter presents three basic techniques for summarizing categorical data. After completing this chapter you should

More information

PROC SUMMARY Options Beyond the Basics Susmita Pattnaik, PPD Inc, Morrisville, NC

PROC SUMMARY Options Beyond the Basics Susmita Pattnaik, PPD Inc, Morrisville, NC Paper BB-12 PROC SUMMARY Options Beyond the Basics Susmita Pattnaik, PPD Inc, Morrisville, NC ABSTRACT PROC SUMMARY is used for summarizing the data across all observations and is familiar to most SAS

More information

Help File. Version 1.1.4.0 February, 2010. MetaDigger for PC

Help File. Version 1.1.4.0 February, 2010. MetaDigger for PC Help File Version 1.1.4.0 February, 2010 MetaDigger for PC How to Use the Sound Ideas MetaDigger for PC Program: The Sound Ideas MetaDigger for PC program will help you find and work with digital sound

More information

Basic Statistical and Modeling Procedures Using SAS

Basic Statistical and Modeling Procedures Using SAS Basic Statistical and Modeling Procedures Using SAS One-Sample Tests The statistical procedures illustrated in this handout use two datasets. The first, Pulse, has information collected in a classroom

More information

Using Microsoft Access

Using Microsoft Access Using Microsoft Access USING MICROSOFT ACCESS 1 Queries 2 Exercise 1. Setting up a Query 3 Exercise 2. Selecting Fields for Query Output 4 Exercise 3. Saving a Query 5 Query Criteria 6 Exercise 4. Adding

More information

In the general population of 0 to 4-year-olds, the annual incidence of asthma is 1.4%

In the general population of 0 to 4-year-olds, the annual incidence of asthma is 1.4% Hypothesis Testing for a Proportion Example: We are interested in the probability of developing asthma over a given one-year period for children 0 to 4 years of age whose mothers smoke in the home In the

More information

Cal Answers Analysis Training Part I. Creating Analyses in OBIEE

Cal Answers Analysis Training Part I. Creating Analyses in OBIEE Cal Answers Analysis Training Part I Creating Analyses in OBIEE University of California, Berkeley March 2012 Table of Contents Table of Contents... 1 Overview... 2 Getting Around OBIEE... 2 Cal Answers

More information

HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1. used confidence intervals to answer questions such as...

HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1. used confidence intervals to answer questions such as... HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1 PREVIOUSLY used confidence intervals to answer questions such as... You know that 0.25% of women have red/green color blindness. You conduct a study of men

More information

Fewer people with coronary heart disease are being diagnosed as compared to the expected figures.

Fewer people with coronary heart disease are being diagnosed as compared to the expected figures. JSNA Coronary heart disease 1) Key points 2) Introduction 3) National picture 4) Local picture of CHD prevalence 5) Mortality from coronary heart disease in Suffolk County 6) Trends in mortality rates

More information

Travel Distance to Healthcare Centers is Associated with Advanced Colon Cancer at Presentation

Travel Distance to Healthcare Centers is Associated with Advanced Colon Cancer at Presentation Travel Distance to Healthcare Centers is Associated with Advanced Colon Cancer at Presentation Yan Xing, MD, PhD, Ryaz B. Chagpar, MD, MS, Y Nancy You MD, MHSc, Yi Ju Chiang, MSPH, Barry W. Feig, MD, George

More information

Infinite Campus Ad Hoc Reporting Basics

Infinite Campus Ad Hoc Reporting Basics Infinite Campus Ad Hoc Reporting Basics May, 2012 1 Overview The Ad hoc Reporting module allows a user to create reports and run queries for various types of data in Campus. Ad hoc queries may be used

More information

VisionMate Flat Bed Scanner 2D Tube Barcode Reader

VisionMate Flat Bed Scanner 2D Tube Barcode Reader VisionMate Flat Bed Scanner 2D Tube Barcode Reader User s Manual Page 1 Catalog #3111 MAN-21256 Rev G Contact Information North America: Tel: 800.345.0206 email: [email protected] Europe: Tel:

More information

How to use the UNIX commands for incident handling. June 12, 2013 Koichiro (Sparky) Komiyama Sam Sasaki JPCERT Coordination Center, Japan

How to use the UNIX commands for incident handling. June 12, 2013 Koichiro (Sparky) Komiyama Sam Sasaki JPCERT Coordination Center, Japan How to use the UNIX commands for incident handling June 12, 2013 Koichiro (Sparky) Komiyama Sam Sasaki JPCERT Coordination Center, Japan Agenda Training Environment Commands for incident handling network

More information

HOW TO COLLECT AND USE DATA IN EXCEL. Brendon Riggs Texas Juvenile Probation Commission Data Coordinators Conference 2008

HOW TO COLLECT AND USE DATA IN EXCEL. Brendon Riggs Texas Juvenile Probation Commission Data Coordinators Conference 2008 HOW TO COLLECT AND USE DATA IN EXCEL Brendon Riggs Texas Juvenile Probation Commission Data Coordinators Conference 2008 Goals To be able to gather and organize information in Excel To be able to perform

More information

MULTIPLE REGRESSION EXAMPLE

MULTIPLE REGRESSION EXAMPLE MULTIPLE REGRESSION EXAMPLE For a sample of n = 166 college students, the following variables were measured: Y = height X 1 = mother s height ( momheight ) X 2 = father s height ( dadheight ) X 3 = 1 if

More information

Scatter Plots with Error Bars

Scatter Plots with Error Bars Chapter 165 Scatter Plots with Error Bars Introduction The procedure extends the capability of the basic scatter plot by allowing you to plot the variability in Y and X corresponding to each point. Each

More information

Example of a Java program

Example of a Java program Example of a Java program class SomeNumbers static int square (int x) return x*x; public static void main (String[] args) int n=20; if (args.length > 0) // change default n = Integer.parseInt(args[0]);

More information

Moving from CS 61A Scheme to CS 61B Java

Moving from CS 61A Scheme to CS 61B Java Moving from CS 61A Scheme to CS 61B Java Introduction Java is an object-oriented language. This document describes some of the differences between object-oriented programming in Scheme (which we hope you

More information

How to Download Census Data from American Factfinder and Display it in ArcMap

How to Download Census Data from American Factfinder and Display it in ArcMap How to Download Census Data from American Factfinder and Display it in ArcMap Factfinder provides census and ACS (American Community Survey) data that can be downloaded in a tabular format and joined with

More information

Data Management and Analysis for Successful Clinical Research. Lily Wang, PhD Department of Biostatistics Vanderbilt University

Data Management and Analysis for Successful Clinical Research. Lily Wang, PhD Department of Biostatistics Vanderbilt University Data Management and Analysis for Successful Clinical Research Lily Wang, PhD Department of Biostatistics Vanderbilt University Goals of This Presentation Provide an overview on data management and analysis

More information