Applied Statistics: Lent Term
|
|
- Tamsyn Sarah Long
- 7 years ago
- Views:
Transcription
1 Applied Statistics: Lent Term Example: Survival data analysis Right Censored Survival Data The data are from a clinical trial investigating the effect of the drug 6-mercaptopurine (6-MP) on the remission times (in weeks) of leukaemia patients. There are two groups of patients in this study: those who were randomised to receive 6-MP and those who were randomised to receive the placebo. These data can be found in the file Leukaem, which is a comma separated value file. Also recorded in this data-set is information on the logarithm (base 10) of the white blood cell count (log(wbc)) and the gender of the 42 patients. In addition, the data-set contains a censoring indicator which identifies those patients who came out of remission (CENSOR=1) and those who were censored (CENSOR=0). Read into R the data-set and call it Leukaem. Examine the data-set. Do you observe any initial differences between the two therapy groups? Kaplan-Meier curves Attach the survival package: library(survival) To create a survival object that describes the right censored survival data to R, we issue the following command: Surv(time=Leukaem$RMISTIME,event=Leukaem$CENSOR) where the two arguments correspond to the observed survival time (time) and the event status indicator (event). Type?Surv to find out more about Surv. The overall survival curve with confidence band can be obtained as follows: plot(survfit(surv(rmistime,censor)~1,data=leukaem),conf.int=t,col=2) The tabular form of this curve can also be obtained: summary(survfit(surv(rmistime,censor)~1,data=leukaem)) The survival curves for the two therapy groups can be displayed as follows:
2 plot(survfit(surv(rmistime,censor)~drug,data=leukaem),col=2:3,lty=1:2,xlab="remission times in weeks",ylab="proportion in remission") legend(25,0.8,c("placebo","6-mp"), lty=1:2,col=2:3) title("kaplan-meier curves for the Leukaemia Patients") Plot now the survival curves by sex and then by log(wbc). Note that for the log(wbc) variable you need to decide how to categorise it. Comparing survival curves From looking at the survival curves for the two drug groups there appears to be a difference in the remission experiences of the 6-MP and Placebo groups, with patients in the 6-MP group remaining, overall, in remission longer than patients in the Placebo group. To confirm this we perform the log-rank test. survdiff(surv(rmistime,censor)~drug,data=leukaem) Perform now the log-rank test on the gender and the categorised log(wbc) variables. What are the results? Was it appropriate to use the log-rank test for these variables? Cox proportional hazards model Below are the commands for fitting a particular Cox proportional hazards model. Use these commands to find the "best" fitting model. leuk.ph <- coxph(surv(rmistime,censor)~drug,data=leukaem) summary(leuk.ph) Do you understand the output? Is there a drug*log(wbc) interaction? Interpret the results. Can you figure out how to plot the predicted survival curve for a person who is on 6-MP and has a log(wbc) of 4.00? (Hint:?survfit) Assessing the validity of the proportional hazards assumption We now assess whether the proportional hazards assumption is valid for the drug variable. First graphically, plot(survfit(surv(rmistime,censor)~drug,data=leukaem),col=2:3,lty=1:2,fun="cloglog", xlim=c(1,40), ylim= c(-3,3), xlab="time of remission (weeks)",ylab="log(-log(s(t)))") Then analytically, leuk.ph <- coxph(surv(rmistime,censor)~drug,data=leukaem) cox.zph(leuk.ph) # using the default K-M transformation of time Test whether the proportional hazards assumption is being satisfied for the sex and log(wbc) variables (and any other variable you create or use). If the PH assumption is not satisfied, re-fit the Cox model but now stratify by the variable not satisfying the
3 PH assumption. This is done by using the function strata("variable name") as an argument on the right hand side of the formula in coxph(.). That is, coxph(surv(rmistime,censor) ~ strata(.) +...,data=leukaem). Now find the most appropriate model and interpret your results. Other R commands which may be useful are step and basehaz. Below is the Leukaem data used for the above practical. RMISTIME DRUG CENSOR LOGWBC SEX
4 Male =1; Female =0; 6-MP =1 Placebo =0 and CENSOR=1 indicates not in remission anymore. Remission is a disease symptom-free state.
5 Right Censored and Left Truncated Survival Data The data-set used in this part of the practical is taken from Klein and Moeschberger (2003). It consists of the ages (in months) of 462 elderly individuals (97 males and 365 females) who were resident at the retirement centre, Channing House, during the period January 1964 to July 1975 when they died or left the centre, as well as the ages (in months) at which the individuals entered the retirement community. Interest lies in the life times of the individuals. However, in addition to right censoring, the life times in this data-set are subject to left truncation as an individual must survive to a sufficient age to enter the retirement community. Individuals who die at an early age are excluded from the study. Also, included in the data-set is the gender of the individuals (coded: Male=1 and Female=2). The data channing can be found in library(kmsurv). To get access to the data once the KMsurv package is loaded type data(channing) Familiarise yourself with the data-set. Do you notice anything peculiar? Modified Kaplan-Meier curves Before estimating the Kaplan-Meier estimator we need to tell R that this is a survival data-set (i.e. a survival object) with right censoring and left truncation. We do this through use of the function Surv(), once again. srvobj <- Surv(time = channing$ageentry, time2 = channing$age, event = channing$death, type="counting")) Can you tell why this does not work? To overcome this problem we create a new data-set called channing1. channing1 <- channing[(channing$time!=0),] Now repeat the earlier step using channing1 instead of channing to create the survival object. Below we have used the function with srvobj <- with(channing1, Surv(time = ageentry, time2 = age, event = death, type="counting")) srvobj The overall estimated K-M curve can now be calculated. summary(survfit(srvobj~1)) plot(survfit(srvobj~1), xlab ="Age at death (in months)", ylab = "Estimated Conditional Proportion Surviving") How do you interpret this estimator?
6 The estimated survival curves by gender can also be calculated. plot(survfit(srvobj~gender,data=channing1), lty=1:2, col=1:2, xlab = "Age at death (in months)", ylab = "Estimated Conditional Proportion Surviving") legend(200,0.9,c("male","female"),lty=1:2,col=1:2) Do you notice anything peculiar? Let us now look at the males in a bit more detail. summary(survfit(srvobj~1,data=channing1,subset=(gender==1))) Do you understand what has happened here? Does this estimator have any real meaning? Let us now estimate the conditional survival curves for males and females separately, given that they have survived up to age 68 years old (i.e. 816 months). channing2 <- channing1[(channing1$ageentry>=816),] srvobj2 <- Surv(time = channing2$ageentry, time2 = channing2$age, event = channing2$death, type="counting") srvobj2 plot(survfit(srvobj2~gender,data=channing2), lty=1:2, col=1:2, xlim = c(800,1250), xlab = "Age at death (in months)", ylab = "Estimated Conditional Proportion Surviving") legend(1100,0.9,c("male","female"),lty=1:2,col=1:2) Is there any evidence (by eye-balling) of a difference between the survival experiences of males and females? Can you figure out how to estimate the conditional survival probabilities for males and females separately, given that they have survived up to age 75 years old? Cox proportional hazards model Let us now investigate formally whether there is an effect of gender on the survival experiences of the individuals in Channing House. channing.cox <- coxph(srvobj~factor(gender),data=channing1) summary(channing.cox) Can you interpret the results obtained? What is the hazards ratio for males versus females? The R function relevel is a useful command when one would like to change the reference level of a factor variable. See if you can figure out how to apply it above. Let us end by checking the proportional hazards assumption analytically. cox.zph(channing.cox) Is there any evidence for non-proportionality of hazards?
Linda Staub & Alexandros Gekenidis
Seminar in Statistics: Survival Analysis Chapter 2 Kaplan-Meier Survival Curves and the Log- Rank Test Linda Staub & Alexandros Gekenidis March 7th, 2011 1 Review Outcome variable of interest: time until
More informationPackage smoothhr. November 9, 2015
Encoding UTF-8 Type Package Depends R (>= 2.12.0),survival,splines Package smoothhr November 9, 2015 Title Smooth Hazard Ratio Curves Taking a Reference Value Version 1.0.2 Date 2015-10-29 Author Artur
More informationThe Kaplan-Meier Plot. Olaf M. Glück
The Kaplan-Meier Plot 1 Introduction 2 The Kaplan-Meier-Estimator (product limit estimator) 3 The Kaplan-Meier Curve 4 From planning to the Kaplan-Meier Curve. An Example 5 Sources & References 1 Introduction
More informationVignette for survrm2 package: Comparing two survival curves using the restricted mean survival time
Vignette for survrm2 package: Comparing two survival curves using the restricted mean survival time Hajime Uno Dana-Farber Cancer Institute March 16, 2015 1 Introduction In a comparative, longitudinal
More informationSurvey, Statistics and Psychometrics Core Research Facility University of Nebraska-Lincoln. Log-Rank Test for More Than Two Groups
Survey, Statistics and Psychometrics Core Research Facility University of Nebraska-Lincoln Log-Rank Test for More Than Two Groups Prepared by Harlan Sayles (SRAM) Revised by Julia Soulakova (Statistics)
More informationCompeting-risks regression
Competing-risks regression Roberto G. Gutierrez Director of Statistics StataCorp LP Stata Conference Boston 2010 R. Gutierrez (StataCorp) Competing-risks regression July 15-16, 2010 1 / 26 Outline 1. Overview
More informationLecture 2 ESTIMATING THE SURVIVAL FUNCTION. One-sample nonparametric methods
Lecture 2 ESTIMATING THE SURVIVAL FUNCTION One-sample nonparametric methods There are commonly three methods for estimating a survivorship function S(t) = P (T > t) without resorting to parametric models:
More informationA Handbook of Statistical Analyses Using R. Brian S. Everitt and Torsten Hothorn
A Handbook of Statistical Analyses Using R Brian S. Everitt and Torsten Hothorn CHAPTER 6 Logistic Regression and Generalised Linear Models: Blood Screening, Women s Role in Society, and Colonic Polyps
More informationMeasures of Prognosis. Sukon Kanchanaraksa, PhD Johns Hopkins University
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike License. Your use of this material constitutes acceptance of that license and the conditions of use of materials on this
More informationKaplan-Meier Survival Analysis 1
Version 4.0 Step-by-Step Examples Kaplan-Meier Survival Analysis 1 With some experiments, the outcome is a survival time, and you want to compare the survival of two or more groups. Survival curves show,
More informationSAS and R calculations for cause specific hazard ratios in a competing risks analysis with time dependent covariates
SAS and R calculations for cause specific hazard ratios in a competing risks analysis with time dependent covariates Martin Wolkewitz, Ralf Peter Vonberg, Hajo Grundmann, Jan Beyersmann, Petra Gastmeier,
More informationTips for surviving the analysis of survival data. Philip Twumasi-Ankrah, PhD
Tips for surviving the analysis of survival data Philip Twumasi-Ankrah, PhD Big picture In medical research and many other areas of research, we often confront continuous, ordinal or dichotomous outcomes
More informationA short course in Longitudinal Data Analysis ESRC Research Methods and Short Course Material for Practicals with the joiner package.
A short course in Longitudinal Data Analysis ESRC Research Methods and Short Course Material for Practicals with the joiner package. Lab 2 - June, 2008 1 jointdata objects To analyse longitudinal data
More informationComparison of Survival Curves
Comparison of Survival Curves We spent the last class looking at some nonparametric approaches for estimating the survival function, Ŝ(t), over time for a single sample of individuals. Now we want to compare
More information5 Correlation and Data Exploration
5 Correlation and Data Exploration Correlation In Unit 3, we did some correlation analyses of data from studies related to the acquisition order and acquisition difficulty of English morphemes by both
More informationSurvival Analysis of the Patients Diagnosed with Non-Small Cell Lung Cancer Using SAS Enterprise Miner 13.1
Paper 11682-2016 Survival Analysis of the Patients Diagnosed with Non-Small Cell Lung Cancer Using SAS Enterprise Miner 13.1 Raja Rajeswari Veggalam, Akansha Gupta; SAS and OSU Data Mining Certificate
More informationLife Tables. Marie Diener-West, PhD Sukon Kanchanaraksa, PhD
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike License. Your use of this material constitutes acceptance of that license and the conditions of use of materials on this
More informationIntroduction. Survival Analysis. Censoring. Plan of Talk
Survival Analysis Mark Lunt Arthritis Research UK Centre for Excellence in Epidemiology University of Manchester 01/12/2015 Survival Analysis is concerned with the length of time before an event occurs.
More informationRandomized trials versus observational studies
Randomized trials versus observational studies The case of postmenopausal hormone therapy and heart disease Miguel Hernán Harvard School of Public Health www.hsph.harvard.edu/causal Joint work with James
More informationEfficacy analysis and graphical representation in Oncology trials - A case study
Efficacy analysis and graphical representation in Oncology trials - A case study Anindita Bhattacharjee Vijayalakshmi Indana Cytel, Pune The views expressed in this presentation are our own and do not
More informationTechnical Information
Technical Information Trials The questions for Progress Test in English (PTE) were developed by English subject experts at the National Foundation for Educational Research. For each test level of the paper
More informationAbstract. Introduction. System Requirement. GUI Design. Paper AD17-2011
Paper AD17-2011 Application for Survival Analysis through Microsoft Access GUI Zhong Yan, i3, Indianapolis, IN Jie Li, i3, Austin, Texas Jiazheng (Steven) He, i3, San Francisco, California Abstract This
More informationGraphics in R. Biostatistics 615/815
Graphics in R Biostatistics 615/815 Last Lecture Introduction to R Programming Controlling Loops Defining your own functions Today Introduction to Graphics in R Examples of commonly used graphics functions
More informationLOGISTIC REGRESSION ANALYSIS
LOGISTIC REGRESSION ANALYSIS C. Mitchell Dayton Department of Measurement, Statistics & Evaluation Room 1230D Benjamin Building University of Maryland September 1992 1. Introduction and Model Logistic
More informationUnit 12 Logistic Regression Supplementary Chapter 14 in IPS On CD (Chap 16, 5th ed.)
Unit 12 Logistic Regression Supplementary Chapter 14 in IPS On CD (Chap 16, 5th ed.) Logistic regression generalizes methods for 2-way tables Adds capability studying several predictors, but Limited to
More informationSUMAN DUVVURU STAT 567 PROJECT REPORT
SUMAN DUVVURU STAT 567 PROJECT REPORT SURVIVAL ANALYSIS OF HEROIN ADDICTS Background and introduction: Current illicit drug use among teens is continuing to increase in many countries around the world.
More informationBIOM611 Biological Data Analysis
BIOM611 Biological Data Analysis Spring, 2015 Tentative Syllabus Introduction BIOMED611 is a ½ unit course required for all 1 st year BGS students (except GCB students). It will provide an introduction
More informationTests for Two Survival Curves Using Cox s Proportional Hazards Model
Chapter 730 Tests for Two Survival Curves Using Cox s Proportional Hazards Model Introduction A clinical trial is often employed to test the equality of survival distributions of two treatment groups.
More informationJournal of Statistical Software
JSS Journal of Statistical Software January 2011, Volume 38, Issue 5. http://www.jstatsoft.org/ Lexis: An R Class for Epidemiological Studies with Long-Term Follow-Up Martyn Plummer International Agency
More informationParametric and non-parametric statistical methods for the life sciences - Session I
Why nonparametric methods What test to use? Rank Tests Parametric and non-parametric statistical methods for the life sciences - Session I Liesbeth Bruckers Geert Molenberghs Interuniversity Institute
More information2 Right Censoring and Kaplan-Meier Estimator
2 Right Censoring and Kaplan-Meier Estimator In biomedical applications, especially in clinical trials, two important issues arise when studying time to event data (we will assume the event to be death.
More informationLung Cancer. Public Outcomes Report. Submitted by Omar A. Majid, MD
Public Outcomes Report Lung Cancer Submitted by Omar A. Majid, MD Lung cancer is the most common cancer-related cause of death among men and women. It has been estimated that there will be 226,1 new cases
More information200609 - ATV - Lifetime Data Analysis
Coordinating unit: Teaching unit: Academic year: Degree: ECTS credits: 2015 200 - FME - School of Mathematics and Statistics 715 - EIO - Department of Statistics and Operations Research 1004 - UB - (ENG)Universitat
More informationIf several different trials are mentioned in one publication, the data of each should be extracted in a separate data extraction form.
General Remarks This template of a data extraction form is intended to help you to start developing your own data extraction form, it certainly has to be adapted to your specific question. Delete unnecessary
More informationPackage SurvCorr. February 26, 2015
Type Package Title Correlation of Bivariate Survival Times Version 1.0 Date 2015-02-25 Package SurvCorr February 26, 2015 Author Meinhard Ploner, Alexandra Kaider and Georg Heinze Maintainer Georg Heinze
More informationStage IV Renal Cell Carcinoma. Changing Management in A Comprehensive Community Cancer Center. Susquehanna Health Cancer Center
Stage IV Renal Cell Carcinoma Changing Management in A Comprehensive Community Cancer Center Susquehanna Health Cancer Center 2000 2009 Warren L. Robinson, MD, FACP January 27, 2014 Introduction 65,150
More informationPaper PO06. Randomization in Clinical Trial Studies
Paper PO06 Randomization in Clinical Trial Studies David Shen, WCI, Inc. Zaizai Lu, AstraZeneca Pharmaceuticals ABSTRACT Randomization is of central importance in clinical trials. It prevents selection
More informationBasic Statistics and Data Analysis for Health Researchers from Foreign Countries
Basic Statistics and Data Analysis for Health Researchers from Foreign Countries Volkert Siersma siersma@sund.ku.dk The Research Unit for General Practice in Copenhagen Dias 1 Content Quantifying association
More informationADVANCE: a factorial randomised trial of blood pressure lowering and intensive glucose control in 11,140 patients with type 2 diabetes
ADVANCE: a factorial randomised trial of blood pressure lowering and intensive glucose control in 11,140 patients with type 2 diabetes Effects of a fixed combination of the ACE inhibitor, perindopril,
More informationEach function call carries out a single task associated with drawing the graph.
Chapter 3 Graphics with R 3.1 Low-Level Graphics R has extensive facilities for producing graphs. There are both low- and high-level graphics facilities. The low-level graphics facilities provide basic
More informationSupplementary appendix
Supplementary appendix This appendix formed part of the original submission and has been peer reviewed. We post it as supplied by the authors. Supplement to: Gold R, Giovannoni G, Selmaj K, et al, for
More informationSPONTANEOUS MESOTHELIOMA DATA: AN INTERPRETATION. Robin Howie, Robin Howie Associates, Edinburgh.
SPONTANEOUS MESOTHELIOMA DATA: AN INTERPRETATION Robin Howie, Robin Howie Associates, Edinburgh. SPONTANEOUS MESOTHELIOMA MESOTHELIOMA DEATHS HSE (2003a) estimated there are about 26 spontaneous deaths
More informationII. DISTRIBUTIONS distribution normal distribution. standard scores
Appendix D Basic Measurement And Statistics The following information was developed by Steven Rothke, PhD, Department of Psychology, Rehabilitation Institute of Chicago (RIC) and expanded by Mary F. Schmidt,
More informationSolutions to Homework 10 Statistics 302 Professor Larget
s to Homework 10 Statistics 302 Professor Larget Textbook Exercises 7.14 Rock-Paper-Scissors (Graded for Accurateness) In Data 6.1 on page 367 we see a table, reproduced in the table below that shows the
More informationProduct Development News
Article from: Product Development News May 2006 Issue 65 Features Comfort Food for an Actuary: Cognitive Testing in Underwriting the Elderly 1 by Eric D. Golus, Laura Vecchione and Thomas Ashley Eric D.
More informationPRACTICE PROBLEMS FOR BIOSTATISTICS
PRACTICE PROBLEMS FOR BIOSTATISTICS BIOSTATISTICS DESCRIBING DATA, THE NORMAL DISTRIBUTION 1. The duration of time from first exposure to HIV infection to AIDS diagnosis is called the incubation period.
More informationSimple Predictive Analytics Curtis Seare
Using Excel to Solve Business Problems: Simple Predictive Analytics Curtis Seare Copyright: Vault Analytics July 2010 Contents Section I: Background Information Why use Predictive Analytics? How to use
More informationSummary of R software commands used to generate bootstrap and permutation test output and figures in Chapter 16
Summary of R software commands used to generate bootstrap and permutation test output and figures in Chapter 16 Since R is command line driven and the primary software of Chapter 16, this document details
More informationTopic 3 - Survival Analysis
Topic 3 - Survival Analysis 1. Topics...2 2. Learning objectives...3 3. Grouped survival data - leukemia example...4 3.1 Cohort survival data schematic...5 3.2 Tabulation of events and time at risk...6
More informationNormal Distribution. Definition A continuous random variable has a normal distribution if its probability density. f ( y ) = 1.
Normal Distribution Definition A continuous random variable has a normal distribution if its probability density e -(y -µ Y ) 2 2 / 2 σ function can be written as for < y < as Y f ( y ) = 1 σ Y 2 π Notation:
More information# load in the files containing the methyaltion data and the source # code containing the SSRPMM functions
################ EXAMPLE ANALYSES TO ILLUSTRATE SS-RPMM ######################## # load in the files containing the methyaltion data and the source # code containing the SSRPMM functions # Note, the SSRPMM
More informationPharmacogenomic markers in EGFR-targeted therapy of lung cancer
Pharmacogenomic markers in EGFR-targeted therapy of lung cancer Rafal Dziadziuszko, MD, PhD University of Colorado Cancer Center, Aurora, CO, USA Medical University of Gdansk, Poland EMEA Workshop on Biomarkers,
More informationStudents' Opinion about Universities: The Faculty of Economics and Political Science (Case Study)
Cairo University Faculty of Economics and Political Science Statistics Department English Section Students' Opinion about Universities: The Faculty of Economics and Political Science (Case Study) Prepared
More informationEvaluation of Treatment Pathways in Oncology: Modeling Approaches. Feng Pan, PhD United BioSource Corporation Bethesda, MD
Evaluation of Treatment Pathways in Oncology: Modeling Approaches Feng Pan, PhD United BioSource Corporation Bethesda, MD 1 Objectives Rationale for modeling treatment pathways Treatment pathway simulation
More information2. Background This was the fourth submission for everolimus requesting listing for clear cell renal carcinoma.
PUBLIC SUMMARY DOCUMENT Product: Everolimus, tablets, 5 mg and 10 mg, Afinitor Sponsor: Novartis Pharmaceuticals Australia Pty Ltd Date of PBAC Consideration: November 2011 1. Purpose of Application To
More informationRegression Modeling Strategies
Frank E. Harrell, Jr. Regression Modeling Strategies With Applications to Linear Models, Logistic Regression, and Survival Analysis With 141 Figures Springer Contents Preface Typographical Conventions
More informationMissing Data in Survival Analysis and Results from the MESS Trial
Missing Data in Survival Analysis and Results from the MESS Trial J. K. Rogers J. L. Hutton K. Hemming Department of Statistics University of Warwick Research Students Conference, 2008 Outline Background
More informationAnalysis and Interpretation of Clinical Trials. How to conclude?
www.eurordis.org Analysis and Interpretation of Clinical Trials How to conclude? Statistical Issues Dr Ferran Torres Unitat de Suport en Estadística i Metodología - USEM Statistics and Methodology Support
More informationGetting started with qplot
Chapter 2 Getting started with qplot 2.1 Introduction In this chapter, you will learn to make a wide variety of plots with your first ggplot2 function, qplot(), short for quick plot. qplot makes it easy
More informationThe Cox Proportional Hazards Model
The Cox Proportional Hazards Model Mario Chen, PhD Advanced Biostatistics and RCT Workshop Office of AIDS Research, NIH ICSSC, FHI Goa, India, September 2009 1 The Model h i (t)=h 0 (t)exp(z i ), Z i =
More informationQ&A on methodology on HIV estimates
Q&A on methodology on HIV estimates 09 Understanding the latest estimates of the 2008 Report on the global AIDS epidemic Part one: The data 1. What data do UNAIDS and WHO base their HIV prevalence estimates
More informationCHILDHOOD CANCER SURVIVOR STUDY Analysis Concept Proposal
CHILDHOOD CANCER SURVIVOR STUDY Analysis Concept Proposal 1. STUDY TITLE: Longitudinal Assessment of Chronic Health Conditions: The Aging of Childhood Cancer Survivors 2. WORKING GROUP AND INVESTIGATORS:
More informationAdvanced Quantitative Methods for Health Care Professionals PUBH 742 Spring 2015
1 Advanced Quantitative Methods for Health Care Professionals PUBH 742 Spring 2015 Instructor: Joanne M. Garrett, PhD e-mail: joanne_garrett@med.unc.edu Class Notes: Copies of the class lecture slides
More informationIs it statistically significant? The chi-square test
UAS Conference Series 2013/14 Is it statistically significant? The chi-square test Dr Gosia Turner Student Data Management and Analysis 14 September 2010 Page 1 Why chi-square? Tests whether two categorical
More informationSeton Medical Center Hepatocellular Carcinoma Patterns of Care Study Rate of Treatment with Chemoembolization 2007 2012 N = 50
General Data Seton Medical Center Hepatocellular Carcinoma Patterns of Care Study Rate of Treatment with Chemoembolization 2007 2012 N = 50 The vast majority of the patients in this study were diagnosed
More informationDidacticiel Études de cas
1 Theme Data Mining with R The rattle package. R (http://www.r project.org/) is one of the most exciting free data mining software projects of these last years. Its popularity is completely justified (see
More informationExpectancy Effects of Performance Enhancing Supplements on Motivation to Exercise. Chris Dawson and Alfonso Ribero.
Running Head: EXPECTANCY EFFECTS ON MOTIVATION Expectancy Effects of Performance Enhancing Supplements on Motivation to Exercise Chris Dawson and Alfonso Ribero Hanover College Expectancy Effects 2 Abstract
More informationSome Statistical Applications In The Financial Services Industry
Some Statistical Applications In The Financial Services Industry Wenqing Lu May 30, 2008 1 Introduction Examples of consumer financial services credit card services mortgage loan services auto finance
More informationPrognostic impact of uric acid in patients with stable coronary artery disease
Prognostic impact of uric acid in patients with stable coronary artery disease Gjin Ndrepepa, Siegmund Braun, Martin Hadamitzky, Massimiliano Fusaro, Hans-Ullrich Haase, Kathrin A. Birkmeier, Albert Schomig,
More informationHow to get accurate sample size and power with nquery Advisor R
How to get accurate sample size and power with nquery Advisor R Brian Sullivan Statistical Solutions Ltd. ASA Meeting, Chicago, March 2007 Sample Size Two group t-test χ 2 -test Survival Analysis 2 2 Crossover
More informationsample median Sample quartiles sample deciles sample quantiles sample percentiles Exercise 1 five number summary # Create and view a sorted
Sample uartiles We have seen that the sample median of a data set {x 1, x, x,, x n }, sorted in increasing order, is a value that divides it in such a way, that exactly half (i.e., 50%) of the sample observations
More informationVariability in Life-Table Estimates Based on the National Health Interview Survey Linked Mortality Files
Variability in Life-Table Estimates Based on the National Health Interview Survey Linked Mortality Files Van Parsons 1, Nathaniel Schenker, Kimberly Lochner, Gloria Wheatcroft and Elsie Pamuk 1 National
More informationLife Insurance Modelling: Notes for teachers. Overview
Life Insurance Modelling: Notes for teachers In this mathematics activity, students model the following situation: An investment fund is set up Investors pay a set amount at the beginning of 20 years.
More informationWhat Cancer Patients Need To Know
Taking Part in Clinical Trials What Cancer Patients Need To Know NATIONAL INSTITUTES OF HEALTH National Cancer Institute Generous support for this publication was provided by Novartis Oncology. Taking
More informationThrombin generation and recurrent venous thromboembolism
Thrombin generation and recurrent venous thromboembolism T. Zhu, W. Ye, F. Dali Ali, L. Carcaillon, V. Remones, M. Alhenc Gelas, P. Gaussem, P.Y. Scarabin, J. Emmerich INSERM U765, University Paris Descartes,
More informationDevelopment of the nomolog program and its evolution
Development of the nomolog program and its evolution Towards the implementation of a nomogram generator for the Cox regression Alexander Zlotnik, Telecom.Eng. Víctor Abraira Santos, PhD Ramón y Cajal University
More informationSurvival Analysis of Left Truncated Income Protection Insurance Data. [March 29, 2012]
Survival Analysis of Left Truncated Income Protection Insurance Data [March 29, 2012] 1 Qing Liu 2 David Pitt 3 Yan Wang 4 Xueyuan Wu Abstract One of the main characteristics of Income Protection Insurance
More informationThe Procedures of Monte Carlo Simulation (and Resampling)
154 Resampling: The New Statistics CHAPTER 10 The Procedures of Monte Carlo Simulation (and Resampling) A Definition and General Procedure for Monte Carlo Simulation Summary Until now, the steps to follow
More informationMethodology Understanding the HIV estimates
UNAIDS July 2014 Methodology Understanding the HIV estimates Produced by the Strategic Information and Monitoring Division Notes on UNAIDS methodology Unless otherwise stated, findings in this report are
More informationAn Application of Weibull Analysis to Determine Failure Rates in Automotive Components
An Application of Weibull Analysis to Determine Failure Rates in Automotive Components Jingshu Wu, PhD, PE, Stephen McHenry, Jeffrey Quandt National Highway Traffic Safety Administration (NHTSA) U.S. Department
More informationBig data size isn t enough! Irene Petersen, PhD Primary Care & Population Health
Big data size isn t enough! Irene Petersen, PhD Primary Care & Population Health Introduction Reader (Statistics and Epidemiology) Research team epidemiologists/statisticians/phd students Primary care
More informationTutorial 3: Graphics and Exploratory Data Analysis in R Jason Pienaar and Tom Miller
Tutorial 3: Graphics and Exploratory Data Analysis in R Jason Pienaar and Tom Miller Getting to know the data An important first step before performing any kind of statistical analysis is to familiarize
More informationLife Settlements and Viaticals
Life Settlements and Viaticals Paul Wilmott, Wilmott Associates, paul@wilmottcom This is taken from the Second Edition of Paul Wilmott On Quantitative Finance, published by John Wiley & Sons 26 In this
More informationBeyond Age and Sex: Enhancing Annuity Pricing
Beyond Age and Sex: Enhancing Annuity Pricing Joelle HY. Fong The Wharton School (IRM Dept), and CEPAR CEPAR Demography & Longevity Workshop University of New South Wales 26 July 2011 Motivation Current
More informationStatistical estimation using confidence intervals
0894PP_ch06 15/3/02 11:02 am Page 135 6 Statistical estimation using confidence intervals In Chapter 2, the concept of the central nature and variability of data and the methods by which these two phenomena
More informationAdvanced Statistical Analysis of Mortality. Rhodes, Thomas E. and Freitas, Stephen A. MIB, Inc. 160 University Avenue. Westwood, MA 02090
Advanced Statistical Analysis of Mortality Rhodes, Thomas E. and Freitas, Stephen A. MIB, Inc 160 University Avenue Westwood, MA 02090 001-(781)-751-6356 fax 001-(781)-329-3379 trhodes@mib.com Abstract
More informationBinary Logistic Regression
Binary Logistic Regression Main Effects Model Logistic regression will accept quantitative, binary or categorical predictors and will code the latter two in various ways. Here s a simple model including
More informationDECISION AND SUMMARY OF RATIONALE
DECISION AND SUMMARY OF RATIONALE Indication under consideration Clinical evidence Clofarabine in the treatment of relapsed acute myeloid leukaemia (AML) The application was for clofarabine to remain in
More informationModelling spousal mortality dependence: evidence of heterogeneities and implications
1/23 Modelling spousal mortality dependence: evidence of heterogeneities and implications Yang Lu Scor and Aix-Marseille School of Economics Lyon, September 2015 2/23 INTRODUCTION 3/23 Motivation It has
More informationModeling the Claim Duration of Income Protection Insurance Policyholders Using Parametric Mixture Models
Modeling the Claim Duration of Income Protection Insurance Policyholders Using Parametric Mixture Models Abstract This paper considers the modeling of claim durations for existing claimants under income
More informationPackage dsmodellingclient
Package dsmodellingclient Maintainer Author Version 4.1.0 License GPL-3 August 20, 2015 Title DataSHIELD client site functions for statistical modelling DataSHIELD
More informationHealth Disparities in Multiple Myeloma. Kenneth R. Bridges, M.D. Senior Medical Director Onyx Pharmaceuticals, Inc.
Health Disparities in Multiple Myeloma Kenneth R. Bridges, M.D. Senior Medical Director Onyx Pharmaceuticals, Inc. Multiple Myeloma Overview Multiple myeloma (MM) is a type of blood cancer that develops
More informationTime Series Analysis with R - Part I. Walter Zucchini, Oleg Nenadić
Time Series Analysis with R - Part I Walter Zucchini, Oleg Nenadić Contents 1 Getting started 2 1.1 Downloading and Installing R.................... 2 1.2 Data Preparation and Import in R.................
More informationSupplementary Online Content
Supplementary Online Content Arterburn DE, Olsen MK, Smith VA, Livingston EH, Van Scoyoc L, Yancy WS, Jr. Association Between Bariatric Surgery and Long-Term Survival. JAMA. doi:10.1001/jama.2014.16968.
More informationA LONGITUDINAL AND SURVIVAL MODEL WITH HEALTH CARE USAGE FOR INSURED ELDERLY. Workshop
A LONGITUDINAL AND SURVIVAL MODEL WITH HEALTH CARE USAGE FOR INSURED ELDERLY Ramon Alemany Montserrat Guillén Xavier Piulachs Lozada Riskcenter - IREA Universitat de Barcelona http://www.ub.edu/riskcenter
More informationCenter Activity (07/01/2009-06/30/2010) Center Region United States Tables for More Information
Program Summary Center Activity (07/01/2009-06/30/2010) Center Region United States Tables for More Information Deceased donor transplants (n=number) 0 169 0 07C,08C,09C On waitlist at start (n) 0 0 0
More informationCancer Survival - How Long Do People Survive?
A research briefing paper by Macmillan Cancer Support Introduction Key findings 3 People with cancer are surviving longer 4 Median survival time has seen dramatic improvement for some cancers 5 Median
More informationSCHOOL OF HEALTH AND HUMAN SCIENCES DON T FORGET TO RECODE YOUR MISSING VALUES
SCHOOL OF HEALTH AND HUMAN SCIENCES Using SPSS Topics addressed today: 1. Differences between groups 2. Graphing Use the s4data.sav file for the first part of this session. DON T FORGET TO RECODE YOUR
More informationSAS Enterprise Guide in Pharmaceutical Applications: Automated Analysis and Reporting Alex Dmitrienko, Ph.D., Eli Lilly and Company, Indianapolis, IN
Paper PH200 SAS Enterprise Guide in Pharmaceutical Applications: Automated Analysis and Reporting Alex Dmitrienko, Ph.D., Eli Lilly and Company, Indianapolis, IN ABSTRACT SAS Enterprise Guide is a member
More information