Cohort Analysis for Genetic Epidemiology (C. A.G. E.) User Reference Manual
|
|
|
- Maria Walton
- 9 years ago
- Views:
Transcription
1 Cohort Analysis for Genetic Epidemiology (C. A.G. E.) User Reference Manual CAGE is a UNIX based program, which calculates the standardized cancer incidence ratios (Observed / Expected) with 95% confidence intervals assuming that the observed number of malignancies follow a Poisson distribution. Some of its application examples include familial aggregation using pedigree data, second onset cancer within cohort of cancer patients and so on. The CAGE program requires three data input files: 1) Registry data file: Connecticut or SEER data in R1 format Note: This file is provided to the user 2) Cohort data file: Your input data file which contains the members of the population (.dat file) 3) Data Input file: A Cohort Analysis Language (CAL) program file telling CAGE how to manipulate the data and what analysis to perform (.inp file) Note: You need to create this file to run your analysis To run CAGE, use the following command line in UNIX: % cage registry data file name data input file name > output file name % cage conn9.r1 test.inp > test.out Registry Data File The newly updated Connecticut Tumor Registry data file (conn9.r1_ctr) and SEER data files (conn9.r1_seerwhite and conn9.r1_seerblack) are both ICD-9 R1 format Connecticut Tumor Registry file, where the conn9.r1_seerwhite data is for SEER white population and the conn9.r1_seerblack data is for SEER black population. Cohort Data File The cohort data file is an ASCII file of rows and columns of data. Each row is made up of columns which represent parameters that describe or apply to one member of the population being studied. Data columns may be of type string (alphanumeric), floating point number or integer. Columns can be delimited by spaces or tabs. Missing value is coded as -1. Certain information is required to be included for each member. These must-have columns are: 1) Sex 2) Age 1
2 3) Birth year 4) Affection status Do not label the column in the first row of the file. Each column must be numeric type. CAGE will give an error message Bus Error (core dumped) if the cohort data file exceeds rows (observation). So if you have a large dataset, please cut your data to two or more files and make sure each file contains less than rows (observations). The commands you can use in Unix to cut your data are: % head [-counts] [file name] > [ new file name] % tail [-counts] [file name] > [new file name] For If you have a large cohort data BT.dat which contains observations, you should cut this data into 2 files: % head BT.dat > first10kbt.dat where first10kbt.dat contains the first obs of BT.dat % tail 5678 BT.dat > last5678bt.dat where last5678bt.dat contains the last 5678 obs of BT.dat You need to run CAGE for each of the two files separately, and then run a S-plus program to combine the two results. Please consult Carol J. Etzel or Mei Liu about how to run the S-plus program. 2
3 Data Input File The input file which uses the CAL describes and manipulates the cohort data and performs analyses of that data. The input file includes five parts and here is an data file is test.dat Part I: Specify the cohort data file labels (REL, BYR, SEX, PBrace, PBsex, PBYOB, cancer, AGE) ; Part II: Label variables sex is col SEX where (male=1, female=2) ; age is col AGE; birth year is col BYR ; affected_status is col cancer where (affected == 1, unaffected == 2) ; Part III: Identify the required fields: Sex, Age, Birth year, Affection status columns group FDR by REL where value == 1 REL where value == 2 REL where value == 3 REL where value == 12; group Fathers by REL where value ==2; group Mothers by REL where value ==3; group Brothers by REL where value ==1 && SEX where value ==1; group Sisters by REL where value ==1 && SEX where value ==2; Part IV: Define analysis groups using group-by-statement analyze FDR for site (MLG) ; analyze Fathers for site (MLG) ; analyze Mothers for site (MLG) ; Part V: Analyze defined groups using analyze-statement analyze Brothers for site (MLG) ; analyze Sisters for site (MLG) ; 3
4 Part I: Specify the cohort data file Tells CAGE what cohort data file to be analyzed data file is filename data file is test.dat Part II: Label variables Tells CAGE what variables in the cohort data file labels(variable1, variable2, Sex, varilble3, Age, variable4, Birth, variable5, variable6, Affected, variable7, variable8, ) * Where variable1, variable2, variable3 are the other variables except age, sex, birth year, affected status in your data file. labels (Familyid, Sex, Age, Affected, Relationship, Birth) * The different variables must be separated by comma * The variable order should be exact the same as they appear in the dataset * Be aware the lower case sex, age, affected status and birth year are keywords for CAL, so they are not allowed using in the label statement. Using upper case Sex, Age, etc, because CAL is case-sensitive. Part III: Identify Sex, Age, Birth year, Affection status columns Tells CAGE where to find the age, sex, birth year, and affected status information age is col column name sex is col column name where(male = value, female = value) birth year is col column name affected_status is col column name where(affected relational-operator value, unaffected relational-operator value) * where relational-operator includes any of == < > <= >= * Be aware = but not == is set for the sex variable age is col Age sex is col Sex where(male =1, female = 2) birth year is col Birth affected_status is col Affected where (affected == 1, unaffected == 0) 4
5 Part IV: Define analysis groups using group-by-statement group-by-statement tells CAGE to define a subgroup with the stated logical commonalties. If the subgroup is defined by only one column variable: group identifier by column name where value relational-op constant * where relational-operator includes any of == < > <= >= group females by Sex where value == 2; group whites by Race where value == 1; If the subgroup is defined by more than one column variable: group identifier by column where value relational-op constant relational-op column where value relational-op constant relational-op column where value relational-op constant *where relational-op includes: (logical or) and &&(logical and) group PBMale by REL where value == 1 REL where value == 2 REL where value == 3 REL where value == 12 && PBsex where value == 1; Part IV: Analyze defined groups using analyze-statement analyze-statement performs observed/expected analysis using the Connecticut Tumor Registry data or SEER data. This analysis is performed on the defined group. analyze group name for site (site list) print summary reset variables * site list is a comma delimited list of the one or more sites taken from the PREFERENT CAUSE LIST * print summary tells CAGE to show the detailed analysis results * reset variables tells CAGE to start a new grouping context analysis analyze females for site(brf,lip) print summary reset variables 5
6 analyze PBAge20 for site (BLA) print summary reset variables Analysis Output File: CAGE gives you the analysis results for each group you define in the input file. The output file contains the input file information, the missing value information, and the analysis results. Here is the output from the analysis for the given input file CONNECTICUT TUMOR FILE IS conn9.r1_ctr CAL FILE IS test.inp POPULATION DATA FILE IS test.dat Input file information Missing essential parameter on member 680 of group FDR: age -1, sex 2, birthyear 1917 Missing essential parameter on member 1170 of group FDR: age -1, sex 2, birthyear 1903 Missing essential parameter on member 1296 of group FDR: age -1, sex 1, birthyear 1933 Missing essential parameter on member 2254 of group FDR: age -1, sex 2, birthyear 1985 Missing essential parameter on member 2358 of group FDR: age -1, sex 2, birthyear 1957 Missing essential parameter on member 2359 of group FDR: age -1, sex 1, birthyear 1958 Missing essential parameter on member 2944 of group FDR: age -1, sex 1, birthyear 1951 Missing essential parameter on member 3122 of group FDR: age 39, sex -1, birthyear 1952 Missing essential parameter on member 3288 of group FDR: age 70, sex -1, birthyear 1921 Missing essential parameter on member 3328 of group FDR: age -1, sex 2, birthyear 1939 Missing essential parameter on member 3329 of group FDR: age -1, sex 1, birthyear 1910 Observations with missing value which are not analyzed by CAGE Group Site Total Pop Tot Affected Sum of Exp O/E L/E U/E Person Years FDR MLG, TOTALS: FDR Missing essential parameter on member 475 of group Fathers: age -1, sex 1, birthyear 1951 Missing essential parameter on member 529 of group Fathers: age 70, sex -1, birthyear 1921 Missing essential parameter on member 534 of group Fathers: age -1, sex 1, birthyear 1910 Analysis results Fathers MLG, TOTALS: Fathers Missing essential parameter on member 104 of group Mothers: age -1, sex 2, birthyear 1917 Missing essential parameter on member 184 of group Mothers: age -1, sex 2, birthyear 1903 Mothers MLG, TOTALS: Mothers Missing essential parameter on member 270 of group Brothers: age -1, sex 1, birthyear 1933 Brothers MLG, TOTALS:
7 Brothers Missing essential parameter on member 638 of group Sisters: age -1, sex 2, birthyear 1939 Sisters MLG, TOTALS: Sisters The analysis results contain group name, site name, total population in your cohort dataset, total affected cases in your cohort dataset, expected number of cases which is obtained by multiplying the age- and gender-specific cancer incidence rates in Connecticut or SEER database by corresponding person-years of your cohort data, the ratio of observed to expected numbers of cases (SIR) with likelihood-based 95% confidence intervals (CI) from Poisson models and person-years. Calculation of Person-years in CAGE CAGE calculates the person- years for each group by adding the person-years for that specific group with the person-years for the preceding analysis groups. Therefore, in order to get the person-years for only the specific group, you need to subtract the personyears by the previous person-years. Below is an Person-years are from the output file example and the actual person-years were recalculated for each analysis group. Group Person-years from CAGE Actual Person-years FDR Father Mother Brother Sister Interpretation of the Results: Using the above output file as an Significantly increased SIRs were observed for MLG for the Sister group (SIR= 1.47, 95% CI =( )). Decreased SIRs were observed for the FDR, Father, Mother, Brother groups, however, the results are insignificant because all the 95% CIs include 1. How to analyze second cancer within cohort of cancer patients? To calculate the risk of second cancer within the cohort of cancer patients, you need to know the age onset for both of the first and second cancers. You will also need an accompanying SPLUS/R function to calculate the final SIRs. This function is available 7
8 upon request. The following are the steps you need to complete to obtain an SIR for a second cancer: First, run CAGE to get the expected number of cancers (E1) and total number of person years (PY1) from the birth to the age onset of the first cancer within the cohort. Second, run CAGE to get the expected number of cancers (E2) total number of person years (PY2) from the birth to the age onset of the second cancer within the cohort Third, Calculate the corrected expected number of cancers (EC) from the age onset of the first cancer to the second cancer: EC=E2-E1 Calculate the correct person years (PYC) from the age onset of the first cancer to the second cancer: PYC=PY2-PY1 Fourth, run the Splus program using EC, PYC and the observed number of second cancers within the cohort to get the SIR and 95% confidence interval for the second cancer. EC=E2-E1 The adjusted person-years for the second onset can be obtained by subtracting the person-years Birth of the second cancer by the person-years First of the first cancer. Second Cancer Cancer E1 E2 8
Time Clock Import Setup & Use
Time Clock Import Setup & Use Document # Product Module Category CenterPoint Payroll Processes (How To) This document outlines how to setup and use of the Time Clock Import within CenterPoint Payroll.
Using the American Community Survey Data
Using the American Community Survey Data The ACS website is accessible from www.census.gov/acs. In the middle of the screen choose In-depth Data Go directly to the ACS Datasets Tab The ACS Datasets Tab
Figure 1.1 Percentage of persons without health insurance coverage: all ages, United States, 1997-2001
Figure 1.1 Percentage of persons without health insurance coverage: all ages, United States, 1997-2001 DATA SOURCE: Family Core component of the 1997-2001 National Health Interview Surveys. The estimate
How to set the main menu of STATA to default factory settings standards
University of Pretoria Data analysis for evaluation studies Examples in STATA version 11 List of data sets b1.dta (To be created by students in class) fp1.xls (To be provided to students) fp1.txt (To be
Adverse Impact Ratio for Females (0/ 1) = 0 (5/ 17) = 0.2941 Adverse impact as defined by the 4/5ths rule was not found in the above data.
1 of 9 12/8/2014 12:57 PM (an On-Line Internet based application) Instructions: Please fill out the information into the form below. Once you have entered your data below, you may select the types of analysis
Odds ratio, Odds ratio test for independence, chi-squared statistic.
Odds ratio, Odds ratio test for independence, chi-squared statistic. Announcements: Assignment 5 is live on webpage. Due Wed Aug 1 at 4:30pm. (9 days, 1 hour, 58.5 minutes ) Final exam is Aug 9. Review
Using Stata for Categorical Data Analysis
Using Stata for Categorical Data Analysis NOTE: These problems make extensive use of Nick Cox s tab_chi, which is actually a collection of routines, and Adrian Mander s ipf command. From within Stata,
Running Descriptive Statistics: Sample and Population Values
Running Descriptive Statistics: Sample and Population Values Goal This exercise is an introduction to a few of the variables in the household- and person-level LIS data sets. The exercise concentrates
RATIOS, PROPORTIONS, PERCENTAGES, AND RATES
RATIOS, PROPORTIOS, PERCETAGES, AD RATES 1. Ratios: ratios are one number expressed in relation to another by dividing the one number by the other. For example, the sex ratio of Delaware in 1990 was: 343,200
Summary Measures (Ratio, Proportion, Rate) Marie Diener-West, PhD Johns Hopkins University
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike License. Your use of this material constitutes acceptance of that license and the conditions of use of materials on this
Death Data: CDC Wonder, Texas Health Data, and VitalWeb
Death Data: CDC Wonder, Texas Health Data, and VitalWeb Evidence-Based Public Health Practice Step 2: Quantify the Issue This handout demonstrates how to access CDC Wonder, Texas Health Data, and VitalWeb
Using SPSS, Chapter 2: Descriptive Statistics
1 Using SPSS, Chapter 2: Descriptive Statistics Chapters 2.1 & 2.2 Descriptive Statistics 2 Mean, Standard Deviation, Variance, Range, Minimum, Maximum 2 Mean, Median, Mode, Standard Deviation, Variance,
Two Related Samples t Test
Two Related Samples t Test In this example 1 students saw five pictures of attractive people and five pictures of unattractive people. For each picture, the students rated the friendliness of the person
EXST SAS Lab Lab #4: Data input and dataset modifications
EXST SAS Lab Lab #4: Data input and dataset modifications Objectives 1. Import an EXCEL dataset. 2. Infile an external dataset (CSV file) 3. Concatenate two datasets into one 4. The PLOT statement will
Introduction to STATA 11 for Windows
1/27/2012 Introduction to STATA 11 for Windows Stata Sizes...3 Documentation...3 Availability...3 STATA User Interface...4 Stata Language Syntax...5 Entering and Editing Stata Commands...6 Stata Online
Summary of R software commands used to generate bootstrap and permutation test output and figures in Chapter 16
Summary of R software commands used to generate bootstrap and permutation test output and figures in Chapter 16 Since R is command line driven and the primary software of Chapter 16, this document details
Constructing a Table of Survey Data with Percent and Confidence Intervals in every Direction
Constructing a Table of Survey Data with Percent and Confidence Intervals in every Direction David Izrael, Abt Associates Sarah W. Ball, Abt Associates Sara M.A. Donahue, Abt Associates ABSTRACT We examined
T-SQL STANDARD ELEMENTS
T-SQL STANDARD ELEMENTS SLIDE Overview Types of commands and statement elements Basic SELECT statements Categories of T-SQL statements Data Manipulation Language (DML*) Statements for querying and modifying
Federal Employee Viewpoint Survey Online Reporting and Analysis Tool
Federal Employee Viewpoint Survey Online Reporting and Analysis Tool Tutorial January 2013 NOTE: If you have any questions about the FEVS Online Reporting and Analysis Tool, please contact your OPM point
New Hampshire Childhood Cancer
Introduction: New Hampshire Childhood Cancer New Hampshire, Childhood Cancer, January 2009 Issue Brief Cancer in children is relatively uncommon, impacting fewer than twenty two of every 100,000 children
Independent t- Test (Comparing Two Means)
Independent t- Test (Comparing Two Means) The objectives of this lesson are to learn: the definition/purpose of independent t-test when to use the independent t-test the use of SPSS to complete an independent
Is it statistically significant? The chi-square test
UAS Conference Series 2013/14 Is it statistically significant? The chi-square test Dr Gosia Turner Student Data Management and Analysis 14 September 2010 Page 1 Why chi-square? Tests whether two categorical
CHILDHOOD CANCER SURVIVOR STUDY Analysis Concept Proposal
CHILDHOOD CANCER SURVIVOR STUDY Analysis Concept Proposal 1. STUDY TITLE: Longitudinal Assessment of Chronic Health Conditions: The Aging of Childhood Cancer Survivors 2. WORKING GROUP AND INVESTIGATORS:
Creating Basic Excel Formulas
Creating Basic Excel Formulas Formulas are equations that perform calculations on values in your worksheet. Depending on how you build a formula in Excel will determine if the answer to your formula automatically
3.4 Statistical inference for 2 populations based on two samples
3.4 Statistical inference for 2 populations based on two samples Tests for a difference between two population means The first sample will be denoted as X 1, X 2,..., X m. The second sample will be denoted
Embedded Systems. Review of ANSI C Topics. A Review of ANSI C and Considerations for Embedded C Programming. Basic features of C
Embedded Systems A Review of ANSI C and Considerations for Embedded C Programming Dr. Jeff Jackson Lecture 2-1 Review of ANSI C Topics Basic features of C C fundamentals Basic data types Expressions Selection
WHO STEPS Surveillance Support Materials. STEPS Epi Info Training Guide
STEPS Epi Info Training Guide Department of Chronic Diseases and Health Promotion World Health Organization 20 Avenue Appia, 1211 Geneva 27, Switzerland For further information: www.who.int/chp/steps WHO
Appendix III: SPSS Preliminary
Appendix III: SPSS Preliminary SPSS is a statistical software package that provides a number of tools needed for the analytical process planning, data collection, data access and management, analysis,
Microsoft Excel 2010 Part 3: Advanced Excel
CALIFORNIA STATE UNIVERSITY, LOS ANGELES INFORMATION TECHNOLOGY SERVICES Microsoft Excel 2010 Part 3: Advanced Excel Winter 2015, Version 1.0 Table of Contents Introduction...2 Sorting Data...2 Sorting
Incorrect Analyses of Radiation and Mesothelioma in the U.S. Transuranium and Uranium Registries Joey Zhou, Ph.D.
Incorrect Analyses of Radiation and Mesothelioma in the U.S. Transuranium and Uranium Registries Joey Zhou, Ph.D. At the Annual Meeting of the Health Physics Society July 15, 2014 in Baltimore A recently
Chapter 2 Probability Topics SPSS T tests
Chapter 2 Probability Topics SPSS T tests Data file used: gss.sav In the lecture about chapter 2, only the One-Sample T test has been explained. In this handout, we also give the SPSS methods to perform
Guidelines for Data Collection & Data Entry
Guidelines for Data Collection & Data Entry Theresa A Scott, MS Vanderbilt University Department of Biostatistics [email protected] http://biostat.mc.vanderbilt.edu/theresascott Theresa A Scott,
Relational Database: Additional Operations on Relations; SQL
Relational Database: Additional Operations on Relations; SQL Greg Plaxton Theory in Programming Practice, Fall 2005 Department of Computer Science University of Texas at Austin Overview The course packet
EXCEL Tutorial: How to use EXCEL for Graphs and Calculations.
EXCEL Tutorial: How to use EXCEL for Graphs and Calculations. Excel is powerful tool and can make your life easier if you are proficient in using it. You will need to use Excel to complete most of your
Getting started with the Stata
Getting started with the Stata 1. Begin by going to a Columbia Computer Labs. 2. Getting started Your first Stata session. Begin by starting Stata on your computer. Using a PC: 1. Click on start menu 2.
Ad Hoc Advanced Table of Contents
Ad Hoc Advanced Table of Contents Functions... 1 Adding a Function to the Adhoc Query:... 1 Constant... 2 Coalesce... 4 Concatenate... 6 Add/Subtract... 7 Logical Expressions... 8 Creating a Logical Expression:...
Advanced Statistical Analysis of Mortality. Rhodes, Thomas E. and Freitas, Stephen A. MIB, Inc. 160 University Avenue. Westwood, MA 02090
Advanced Statistical Analysis of Mortality Rhodes, Thomas E. and Freitas, Stephen A. MIB, Inc 160 University Avenue Westwood, MA 02090 001-(781)-751-6356 fax 001-(781)-329-3379 [email protected] Abstract
Optimization of sampling strata with the SamplingStrata package
Optimization of sampling strata with the SamplingStrata package Package version 1.1 Giulio Barcaroli January 12, 2016 Abstract In stratified random sampling the problem of determining the optimal size
Instructions for applying data validation(s) to data fields in Microsoft Excel
1 of 10 Instructions for applying data validation(s) to data fields in Microsoft Excel According to Microsoft Excel, a data validation is used to control the type of data or the values that users enter
Tutorial Segmentation and Classification
MARKETING ENGINEERING FOR EXCEL TUTORIAL VERSION 1.0.8 Tutorial Segmentation and Classification Marketing Engineering for Excel is a Microsoft Excel add-in. The software runs from within Microsoft Excel
IBM SPSS Statistics for Beginners for Windows
ISS, NEWCASTLE UNIVERSITY IBM SPSS Statistics for Beginners for Windows A Training Manual for Beginners Dr. S. T. Kometa A Training Manual for Beginners Contents 1 Aims and Objectives... 3 1.1 Learning
Company Setup 401k Tab
Reference Sheet Company Setup 401k Tab Use this page to define company level 401(k) information, including employee status codes, 401(k) sources, and 401(k) funds. The definitions you create here become
Cancer Cluster Investigation French Limited Superfund Site, Harris County, Texas
Cancer Cluster Investigation French Limited Superfund Site, Harris County, Texas Time Period: 1995-2011 Prepared by the Texas Department of State Health Services Summary Some residents living in the vicinity
A Guide to Stat/Transfer File Transfer Utility, Version 10
A Guide to Stat/Transfer File Transfer Utility, Version 10 Table of Contents 1.) What is Stat/Transfer, and when can it be used? 2 2.) What files does Stat/Transfer Version 10 support?...2 3.) Doing a
ee-quipment.com ee203 RTCM USB Quick-Start Guide
ee-quipment.com ee203 RTCM USB Quick-Start Guide The ee203 USB interface consists of a flash drive and two virtual serial ports. The required drivers are built into Windows, but the serial ports require
Appendix G STATISTICAL METHODS INFECTIOUS METHODS STATISTICAL ROADMAP. Prepared in Support of: CDC/NCEH Cross Sectional Assessment Study.
Appendix G STATISTICAL METHODS INFECTIOUS METHODS STATISTICAL ROADMAP Prepared in Support of: CDC/NCEH Cross Sectional Assessment Study Prepared by: Centers for Disease Control and Prevention National
Conditionals (with solutions)
Conditionals (with solutions) For exercises 1 to 27, indicate the output that will be produced. Assume the following declarations: final int MAX = 25, LIMIT = 100; int num1 = 12, num2 = 25, num3 = 87;
SPSS: Getting Started. For Windows
For Windows Updated: August 2012 Table of Contents Section 1: Overview... 3 1.1 Introduction to SPSS Tutorials... 3 1.2 Introduction to SPSS... 3 1.3 Overview of SPSS for Windows... 3 Section 2: Entering
1-3 id id no. of respondents 101-300 4 respon 1 responsible for maintenance? 1 = no, 2 = yes, 9 = blank
Basic Data Analysis Graziadio School of Business and Management Data Preparation & Entry Editing: Inspection & Correction Field Edit: Immediate follow-up (complete? legible? comprehensible? consistent?
Importing Data from a Dat or Text File into SPSS
Importing Data from a Dat or Text File into SPSS 1. Select File Open Data (Using Text Wizard) 2. Under Files of type, choose Text (*.txt,*.dat) 3. Select the file you want to import. The dat or text file
SAS Analyst for Windows Tutorial
Updated: August 2012 Table of Contents Section 1: Introduction... 3 1.1 About this Document... 3 1.2 Introduction to Version 8 of SAS... 3 Section 2: An Overview of SAS V.8 for Windows... 3 2.1 Navigating
Guido s Guide to PROC FREQ A Tutorial for Beginners Using the SAS System Joseph J. Guido, University of Rochester Medical Center, Rochester, NY
Guido s Guide to PROC FREQ A Tutorial for Beginners Using the SAS System Joseph J. Guido, University of Rochester Medical Center, Rochester, NY ABSTRACT PROC FREQ is an essential procedure within BASE
NCSS Statistical Software
Chapter 115 Introduction NCSS can import from a wide variety of spreadsheets, databases, and statistical systems. When you import a file, the entire dataset is replaced with the imported data, so make
Client Marketing: Sets
Client Marketing Client Marketing: Sets Purpose Client Marketing Sets are used for selecting clients from the client records based on certain criteria you designate. Once the clients are selected, you
TUTORIAL: RETRIEVING AND IMPORTING CHCS DATA INTO MICROSOFT EXCEL Purpose: The purpose of this tutorial is to provide step by step instructions on how to retrieve data from CHCS(Composite Healthcare System)
Two Correlated Proportions (McNemar Test)
Chapter 50 Two Correlated Proportions (Mcemar Test) Introduction This procedure computes confidence intervals and hypothesis tests for the comparison of the marginal frequencies of two factors (each with
Lifetime Likelihood of Going to State or Federal Prison
U.S. Department of Justice Office of Justice Programs Bureau of Justice Statistics Special Report March 1997, NCJ-160092 Lifetime Likelihood of Going to State or Federal Prison By Thomas P. Bonczar and
Analysis of Population Cancer Risk Factors in National Information System SVOD
Analysis of Population Cancer Risk Factors in National Information System SVOD Mužík J. 1, Dušek L. 1,2, Pavliš P. 1, Koptíková J. 1, Žaloudík J. 3, Vyzula R. 3 Abstract Human risk assessment requires
Simply Accounting Intelligence Tips and Tricks Booklet Vol. 1
Simply Accounting Intelligence Tips and Tricks Booklet Vol. 1 1 Contents Accessing the SAI reports... 3 Running, Copying and Pasting reports... 4 Creating and linking a report... 5 Auto e-mailing reports...
Methodologies for Converting Microsoft Excel Spreadsheets to SAS datasets
Methodologies for Converting Microsoft Excel Spreadsheets to SAS datasets Karin LaPann ViroPharma Incorporated ABSTRACT Much functionality has been added to the SAS to Excel procedures in SAS version 9.
The Little Man Computer
The Little Man Computer The Little Man Computer - an instructional model of von Neuman computer architecture John von Neuman (1903-1957) and Alan Turing (1912-1954) each independently laid foundation for
Lesson 14 14 Outline Outline
Lesson 14 Confidence Intervals of Odds Ratio and Relative Risk Lesson 14 Outline Lesson 14 covers Confidence Interval of an Odds Ratio Review of Odds Ratio Sampling distribution of OR on natural log scale
SPSS Workbook 1 Data Entry : Questionnaire Data
TEESSIDE UNIVERSITY SCHOOL OF HEALTH & SOCIAL CARE SPSS Workbook 1 Data Entry : Questionnaire Data Prepared by: Sylvia Storey [email protected] SPSS data entry 1 This workbook is designed to introduce
Lecture 2 ESTIMATING THE SURVIVAL FUNCTION. One-sample nonparametric methods
Lecture 2 ESTIMATING THE SURVIVAL FUNCTION One-sample nonparametric methods There are commonly three methods for estimating a survivorship function S(t) = P (T > t) without resorting to parametric models:
MAS 500 Intelligence Tips and Tricks Booklet Vol. 1
MAS 500 Intelligence Tips and Tricks Booklet Vol. 1 1 Contents Accessing the Sage MAS Intelligence Reports... 3 Copying, Pasting and Renaming Reports... 4 To create a new report from an existing report...
Supplementary online appendix
Supplementary online appendix 1 Table A1: Five-state sample: Data summary Year AZ CA MD NJ NY Total 1991 0 1,430 0 0 0 1,430 1992 0 1,428 0 0 0 1,428 1993 0 1,346 0 0 0 1,346 1994 0 1,410 0 0 0 1,410 1995
Microsoft Access Glossary of Terms
Microsoft Access Glossary of Terms A Free Document From www.chimpytech.com COPYRIGHT NOTICE This document is copyright chimpytech.com. Please feel free to distribute and give away this document to your
Using Formulas, Functions, and Data Analysis Tools Excel 2010 Tutorial
Using Formulas, Functions, and Data Analysis Tools Excel 2010 Tutorial Excel file for use with this tutorial Tutor1Data.xlsx File Location http://faculty.ung.edu/kmelton/data/tutor1data.xlsx Introduction:
The Center for Teaching, Learning, & Technology
The Center for Teaching, Learning, & Technology Instructional Technology Workshops Microsoft Excel 2010 Formulas and Charts Albert Robinson / Delwar Sayeed Faculty and Staff Development Programs Colston
Clever SFTP Instructions
Clever SFTP Instructions November 10, 2015 Contents 1 Introduction 2 2 General SFTP Setup 2 3 Preparing CSV Files 3 3.1 Preparing schools.csv............................... 4 3.2 Preparing students.csv...............................
Part A. EpiData Entry
Part A. EpiData Entry Part A: Quality-assured data capture with EpiData Manager and EpiData EntryClient Exercise 1 A data documentation sheet for a simple questionnaire Exercise 2 Create a basic data entry
Calculating Survival Probabilities Accepted for Publication in Journal of Legal Economics, 2009, Vol. 16(1), pp. 111-126.
Calculating Survival Probabilities Accepted for Publication in Journal of Legal Economics, 2009, Vol. 16(1), pp. 111-126. David G. Tucek Value Economics, LLC 13024 Vinson Court St. Louis, MO 63043 Tel:
Once saved, if the file was zipped you will need to unzip it. For the files that I will be posting you need to change the preferences.
1 Commands in JMP and Statcrunch Below are a set of commands in JMP and Statcrunch which facilitate a basic statistical analysis. The first part concerns commands in JMP, the second part is for analysis
Linear Models in STATA and ANOVA
Session 4 Linear Models in STATA and ANOVA Page Strengths of Linear Relationships 4-2 A Note on Non-Linear Relationships 4-4 Multiple Linear Regression 4-5 Removal of Variables 4-8 Independent Samples
Statistical Analysis for Genetic Epidemiology (S.A.G.E.) Version 6.2 Graphical User Interface (GUI) Manual
Statistical Analysis for Genetic Epidemiology (S.A.G.E.) Version 6.2 Graphical User Interface (GUI) Manual Department of Epidemiology and Biostatistics Wolstein Research Building 2103 Cornell Rd Case Western
Chapter 4 Displaying and Describing Categorical Data
Chapter 4 Displaying and Describing Categorical Data Chapter Goals Learning Objectives This chapter presents three basic techniques for summarizing categorical data. After completing this chapter you should
PROC SUMMARY Options Beyond the Basics Susmita Pattnaik, PPD Inc, Morrisville, NC
Paper BB-12 PROC SUMMARY Options Beyond the Basics Susmita Pattnaik, PPD Inc, Morrisville, NC ABSTRACT PROC SUMMARY is used for summarizing the data across all observations and is familiar to most SAS
Help File. Version 1.1.4.0 February, 2010. MetaDigger for PC
Help File Version 1.1.4.0 February, 2010 MetaDigger for PC How to Use the Sound Ideas MetaDigger for PC Program: The Sound Ideas MetaDigger for PC program will help you find and work with digital sound
Basic Statistical and Modeling Procedures Using SAS
Basic Statistical and Modeling Procedures Using SAS One-Sample Tests The statistical procedures illustrated in this handout use two datasets. The first, Pulse, has information collected in a classroom
Using Microsoft Access
Using Microsoft Access USING MICROSOFT ACCESS 1 Queries 2 Exercise 1. Setting up a Query 3 Exercise 2. Selecting Fields for Query Output 4 Exercise 3. Saving a Query 5 Query Criteria 6 Exercise 4. Adding
In the general population of 0 to 4-year-olds, the annual incidence of asthma is 1.4%
Hypothesis Testing for a Proportion Example: We are interested in the probability of developing asthma over a given one-year period for children 0 to 4 years of age whose mothers smoke in the home In the
Cal Answers Analysis Training Part I. Creating Analyses in OBIEE
Cal Answers Analysis Training Part I Creating Analyses in OBIEE University of California, Berkeley March 2012 Table of Contents Table of Contents... 1 Overview... 2 Getting Around OBIEE... 2 Cal Answers
HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1. used confidence intervals to answer questions such as...
HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1 PREVIOUSLY used confidence intervals to answer questions such as... You know that 0.25% of women have red/green color blindness. You conduct a study of men
Fewer people with coronary heart disease are being diagnosed as compared to the expected figures.
JSNA Coronary heart disease 1) Key points 2) Introduction 3) National picture 4) Local picture of CHD prevalence 5) Mortality from coronary heart disease in Suffolk County 6) Trends in mortality rates
Travel Distance to Healthcare Centers is Associated with Advanced Colon Cancer at Presentation
Travel Distance to Healthcare Centers is Associated with Advanced Colon Cancer at Presentation Yan Xing, MD, PhD, Ryaz B. Chagpar, MD, MS, Y Nancy You MD, MHSc, Yi Ju Chiang, MSPH, Barry W. Feig, MD, George
Infinite Campus Ad Hoc Reporting Basics
Infinite Campus Ad Hoc Reporting Basics May, 2012 1 Overview The Ad hoc Reporting module allows a user to create reports and run queries for various types of data in Campus. Ad hoc queries may be used
VisionMate Flat Bed Scanner 2D Tube Barcode Reader
VisionMate Flat Bed Scanner 2D Tube Barcode Reader User s Manual Page 1 Catalog #3111 MAN-21256 Rev G Contact Information North America: Tel: 800.345.0206 email: [email protected] Europe: Tel:
How to use the UNIX commands for incident handling. June 12, 2013 Koichiro (Sparky) Komiyama Sam Sasaki JPCERT Coordination Center, Japan
How to use the UNIX commands for incident handling June 12, 2013 Koichiro (Sparky) Komiyama Sam Sasaki JPCERT Coordination Center, Japan Agenda Training Environment Commands for incident handling network
HOW TO COLLECT AND USE DATA IN EXCEL. Brendon Riggs Texas Juvenile Probation Commission Data Coordinators Conference 2008
HOW TO COLLECT AND USE DATA IN EXCEL Brendon Riggs Texas Juvenile Probation Commission Data Coordinators Conference 2008 Goals To be able to gather and organize information in Excel To be able to perform
MULTIPLE REGRESSION EXAMPLE
MULTIPLE REGRESSION EXAMPLE For a sample of n = 166 college students, the following variables were measured: Y = height X 1 = mother s height ( momheight ) X 2 = father s height ( dadheight ) X 3 = 1 if
Scatter Plots with Error Bars
Chapter 165 Scatter Plots with Error Bars Introduction The procedure extends the capability of the basic scatter plot by allowing you to plot the variability in Y and X corresponding to each point. Each
Example of a Java program
Example of a Java program class SomeNumbers static int square (int x) return x*x; public static void main (String[] args) int n=20; if (args.length > 0) // change default n = Integer.parseInt(args[0]);
Moving from CS 61A Scheme to CS 61B Java
Moving from CS 61A Scheme to CS 61B Java Introduction Java is an object-oriented language. This document describes some of the differences between object-oriented programming in Scheme (which we hope you
How to Download Census Data from American Factfinder and Display it in ArcMap
How to Download Census Data from American Factfinder and Display it in ArcMap Factfinder provides census and ACS (American Community Survey) data that can be downloaded in a tabular format and joined with
Data Management and Analysis for Successful Clinical Research. Lily Wang, PhD Department of Biostatistics Vanderbilt University
Data Management and Analysis for Successful Clinical Research Lily Wang, PhD Department of Biostatistics Vanderbilt University Goals of This Presentation Provide an overview on data management and analysis
