EXAM IN TMA4275 LIFETIME ANALYSIS Saturday 4 June 2005 Time: 09:00 13:00

Similar documents
SUMAN DUVVURU STAT 567 PROJECT REPORT

Statistics in Retail Finance. Chapter 6: Behavioural models

12.5: CHI-SQUARE GOODNESS OF FIT TESTS

Introduction. Survival Analysis. Censoring. Plan of Talk

Interpretation of Somers D under four simple models

Regression Modeling Strategies

Tips for surviving the analysis of survival data. Philip Twumasi-Ankrah, PhD

TMA4213/4215 Matematikk 4M/N Vår 2013

Applied Statistics. J. Blanchet and J. Wadsworth. Institute of Mathematics, Analysis, and Applications EPF Lausanne

Survival Analysis of Left Truncated Income Protection Insurance Data. [March 29, 2012]

Modeling the Claim Duration of Income Protection Insurance Policyholders Using Parametric Mixture Models

Generalized Linear Models

ATV - Lifetime Data Analysis

Modeling and Analysis of Call Center Arrival Data: A Bayesian Approach

Introduction to nonparametric regression: Least squares vs. Nearest neighbors

Survival analysis methods in Insurance Applications in car insurance contracts

Service courses for graduate students in degree programs other than the MS or PhD programs in Biostatistics.

Multivariate Normal Distribution

Name of the module: Multivariate biostatistics and SPSS Number of module:

Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics

Survey, Statistics and Psychometrics Core Research Facility University of Nebraska-Lincoln. Log-Rank Test for More Than Two Groups

Statistics Graduate Courses

Comparison of resampling method applied to censored data

Master s Theory Exam Spring 2006

Tests for Two Survival Curves Using Cox s Proportional Hazards Model

1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96

171:290 Model Selection Lecture II: The Akaike Information Criterion

Lecture 15 Introduction to Survival Analysis

Classification Problems

7.1 The Hazard and Survival Functions

Nominal and ordinal logistic regression

SAS Software to Fit the Generalized Linear Model

Survival Analysis of Dental Implants. Abstracts

Modelling Cost of Maintenance Contract for Rail Infrastructure

LOGIT AND PROBIT ANALYSIS

Example: Credit card default, we may be more interested in predicting the probabilty of a default than classifying individuals as default or not.

BayesX - Software for Bayesian Inference in Structured Additive Regression

Noncrash fire safety recall losses

BA 275 Review Problems - Week 6 (10/30/06-11/3/06) CD Lessons: 53, 54, 55, 56 Textbook: pp , ,

INTERPRETING THE ONE-WAY ANALYSIS OF VARIANCE (ANOVA)

Math 108 Exam 3 Solutions Spring 00

Statistical Analysis of Life Insurance Policy Termination and Survivorship

Pearson's Correlation Tests

Checking proportionality for Cox s regression model

Simple Linear Regression Inference

Ordinal Regression. Chapter

Chapter 13 Introduction to Nonlinear Regression( 非 線 性 迴 歸 )

Predicting Customer Default Times using Survival Analysis Methods in SAS

HYPOTHESIS TESTING: POWER OF THE TEST

Least Squares Estimation

Distance to Event vs. Propensity of Event A Survival Analysis vs. Logistic Regression Approach

Elements of statistics (MATH0487-1)

Fairfield Public Schools

CS 688 Pattern Recognition Lecture 4. Linear Models for Classification

Poisson Models for Count Data

LOGISTIC REGRESSION ANALYSIS

An Application of Weibull Analysis to Determine Failure Rates in Automotive Components

A Study to Predict No Show Probability for a Scheduled Appointment at Free Health Clinic

Introduction to General and Generalized Linear Models

Modeling Customer Lifetime Value Using Survival Analysis An Application in the Telecommunications Industry

Parametric Survival Models

Come scegliere un test statistico

CHAPTER 6: Continuous Uniform Distribution: 6.1. Definition: The density function of the continuous random variable X on the interval [A, B] is.

Multinomial and Ordinal Logistic Regression

Survival Analysis of the Patients Diagnosed with Non-Small Cell Lung Cancer Using SAS Enterprise Miner 13.1

Programme du parcours Clinical Epidemiology UMR 1. Methods in therapeutic evaluation A Dechartres/A Flahault

Server Load Prediction

STATISTICAL ANALYSIS OF SAFETY DATA IN LONG-TERM CLINICAL TRIALS

Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model

Chapter 3 RANDOM VARIATE GENERATION

Lecture 2 ESTIMATING THE SURVIVAL FUNCTION. One-sample nonparametric methods

Gamma Distribution Fitting

Regression III: Advanced Methods

EVALUATION OF PROBABILITY MODELS ON INSURANCE CLAIMS IN GHANA

Confidence Intervals for Exponential Reliability

An Application of the Cox Proportional Hazards Model to the Construction of Objective Vintages for Credit in Financial Institutions, Using PROC PHREG

Handling missing data in Stata a whirlwind tour

Automated Biosurveillance Data from England and Wales,

Regression with a Binary Dependent Variable

Data Mining and Data Warehousing. Henryk Maciejewski. Data Mining Predictive modelling: regression

STAT 360 Probability and Statistics. Fall 2012

WebFOCUS RStat. RStat. Predict the Future and Make Effective Decisions Today. WebFOCUS RStat

Christfried Webers. Canberra February June 2015

Personalized Predictive Medicine and Genomic Clinical Trials

Study Guide for the Final Exam

Portfolio Using Queuing Theory

List of Examples. Examples 319

Computer exercise 4 Poisson Regression

MEU. INSTITUTE OF HEALTH SCIENCES COURSE SYLLABUS. Biostatistics

13. Poisson Regression Analysis

Systematic Reviews and Meta-analyses

Research Methods & Experimental Design

Experimental Design. Power and Sample Size Determination. Proportions. Proportions. Confidence Interval for p. The Binomial Test

Statistical tests for SPSS

II. DISTRIBUTIONS distribution normal distribution. standard scores

Chapter 5 Analysis of variance SPSS Analysis of variance

Normality Testing in Excel

11. Analysis of Case-control Studies Logistic Regression

NEW YORK CITY COLLEGE OF TECHNOLOGY The City University of New York

Simple Regression Theory II 2010 Samuel L. Baker

Transcription:

Norges teknisk naturvitenskapelige universitet Institutt for matematiske fag Page 1 of 5 Contact during exam: Bo Lindqvist 975 89 418 EXAM IN TMA4275 LIFETIME ANALYSIS Saturday 4 June 2005 Time: 09:00 13:00 Aids: All printed and handwritten aids. All calculators permitted. Grading: 25 June 2005. Problem 1 A study has been conducted to find out how the lifetime of a particular brand of brake pads depends on the type of driving. Six cars of the same make and year were selected, and the type of driving for each of them was classified by the variable w with values 0 = mainly city driving, 1 = mixed driving, or 2 = mainly highway driving. The following data were obtained: y i d i w i x i1 x i2 22.7 1 0 0 0 41.0 0 1 1 0 54.2 1 2 0 1 59.8 0 0 0 0 62.4 1 1 1 0 73.1 0 2 0 1 Here y i (with unit 1000 km) is the observed brake pad lifetime and d i is the censoring status for car number i (d i = 1 means non-censored, d i = 0 means right censored). The covariate w i is the value of the variable w for type of driving. In the analysis we will represent w by the covariates x 1 and x 2, where x 1 = 1 if w = 1 and x 1 = 0 otherwise; and x 2 = 1 if w = 2 and x 2 = 0 otherwise. The values of x 1 and x 2 for each car are also given in the table for simplicity.

TMA4275 Lifetime Analysis Page 2 of 5 In the study one uses a Cox model for the data. a) Write down an expression for the hazard rate (failure rate) of the lifetime of a brake pad as a function of time t and the covariates x 1 and x 2. What are the underlying assumptions when a Cox model is used for this situation? Find an expression for the reliability function (survival function) of a brake pad of a car which is mainly used on highways. b) Calculate an expression for Cox partial likelihood for the given data. c) It turns out that the maximum value of the log partial likelihood is 3.67. Use this to compute the value of a test statistic for testing the null hypothesis that type of driving has no influence on the lifetime of the brake pads. What is the conclusion of this test? Use significance level 5%. Problem 2 One has recorded times of critical failures of a single compressor until τ = 650 days after start. It is assumed that repairs are minimal and take a negligible time. Thus we assume that failure times follow a non-homogeneous Poisson process with intensity function w(t) and cumulative intensity function W (t). The following data were observed (in days after start): i s i 1 120 2 347 3 420 4 512 5 595 a) How would you estimate the cumulative intensity W (t) when no parametric assumptions are made? Draw the estimated curve for W (t). Does the plot indicate a trend in the failure intensity? Find an approximate 95% confidence interval for the value of W (595). b) Perform the Laplace test to investigate whether there is a trend in the failure times. What is the conclusion if you use significance level 5%? What can the sign of this test statistic tell us about possible increasing or decreasing trend?

TMA4275 Lifetime Analysis Page 3 of 5 In the rest of the problem we assume that the ROCOF w(t) of the failure process is of so called log linear type, i.e. w(t) = e α+βt for t > 0, where α > 0 and < β < are unknown parameters. c) Show that W (t) = eα (e βt 1). β Show that the log likelihood function of the data can be written l(α, β) = 5α + 1994β eα (e 650β 1). β d) Calculate the profile log likelihood function l(β) of β. A graph of l(β) is given below. 29 29.5 30 30.5 31 31.5 32 32.5 0.004 0.002 0 0.002 0.004 0.006 0.008 0.01 beta

TMA4275 Lifetime Analysis Page 4 of 5 Use the graph (roughly) 1. to find the maximum likelihood estimate of β, 2. to find an approximate 95% confidence interval of β, 3. to test, with significance level 5%, the null hypothesis of no trend in the failure process. e) Suppose that the data were instead given in intervals only, i.e. as follows: Interval Number of failures (0, 100] 0 (100, 200] 1 (200, 300] 0 (300, 400] 1 (400, 500] 1 (500, 600] 2 (600, 650] 0 Find an expression for the log likelihood function in this case. Problem 3 Let Φ 0 (w) and φ 0 (w) be, respectively, the cumulative distribution function and the probability density function of a random variable W with possible values in (, ). Assume furthermore that φ 0 (w) > 0 for w (, ). a) Explain how Φ 0 (w) and φ 0 (w) are used to define a log-location-scale family of distributions for a lifetime T, indexed by parameters < µ < and σ > 0. In particular write down the cumulative distribution function and the probability density function for T, expressed in terms of Φ 0 (w), φ 0 (w), µ and σ. Let W have the standard logistic distribution, i.e. let Φ 0 (w) = ew for < w <. (1) 1 + ew Recall that the corresponding family of distributions for T is then called the log-logistic family. b) Show that the log-logistic family of distributions for T can be reparameterized with α > 0, γ > 0 so that the reliability function (survival function) can be written R T (t) = 1 for t > 0. 1 + γtα Explain how this can be used to construct a probability plot for checking whether a right censored data sample can be assumed to come from the log-logistic family.

TMA4275 Lifetime Analysis Page 5 of 5 c) Explain how W can be used to define a parametric regression model for lifetimes T with corresponding covariate vector x = (x 1,..., x k ). Give an expression for the reliability function of T with covariate vector x when Φ 0 (w) is the standard logistic distribution function given by (1).