Reliability and Risk Analysis. Analysis of Model Tasks from Selected Application Areas Using a Computer (software R, Excel)

Similar documents
Maximum Likelihood Estimation

Exact Confidence Intervals

MATH4427 Notebook 2 Spring MATH4427 Notebook Definitions and Examples Performance Measures for Estimators...

12.5: CHI-SQUARE GOODNESS OF FIT TESTS

Department of Mathematics, Indian Institute of Technology, Kharagpur Assignment 2-3, Probability and Statistics, March Due:-March 25, 2015.

LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING

Practice problems for Homework 11 - Point Estimation

Properties of Future Lifetime Distributions and Estimation

BA 275 Review Problems - Week 6 (10/30/06-11/3/06) CD Lessons: 53, 54, 55, 56 Textbook: pp , ,

SOCIETY OF ACTUARIES/CASUALTY ACTUARIAL SOCIETY EXAM C CONSTRUCTION AND EVALUATION OF ACTUARIAL MODELS EXAM C SAMPLE QUESTIONS

Statistics I for QBIC. Contents and Objectives. Chapters 1 7. Revised: August 2013

A LOGNORMAL MODEL FOR INSURANCE CLAIMS DATA

Constrained Bayes and Empirical Bayes Estimator Applications in Insurance Pricing

EVALUATION OF PROBABILITY MODELS ON INSURANCE CLAIMS IN GHANA

Exam C, Fall 2006 PRELIMINARY ANSWER KEY

1 Prior Probability and Posterior Probability

Curriculum Map Statistics and Probability Honors (348) Saugus High School Saugus Public Schools

Basics of Statistical Machine Learning

A revisit of the hierarchical insurance claims modeling

Aggregate Loss Models

Introduction to General and Generalized Linear Models

Dongfeng Li. Autumn 2010

Practice problems for Homework 12 - confidence intervals and hypothesis testing. Open the Homework Assignment 12 and solve the problems.

Likelihood Approaches for Trial Designs in Early Phase Oncology

Principle of Data Reduction

Nominal and ordinal logistic regression

How NOT to do a data analysis

Part 2: One-parameter models

Review #2. Statistics

Examining credit card consumption pattern


Comparing Multiple Proportions, Test of Independence and Goodness of Fit

Simple Linear Regression Inference

Chapter 19 The Chi-Square Test

Modeling Individual Claims for Motor Third Party Liability of Insurance Companies in Albania

Factors affecting online sales

Statistics in Retail Finance. Chapter 6: Behavioural models

Optimum allocation of weights to assets in a portfolio: the case of nominal annualisation versus effective annualisation of returns

Two-Sample T-Tests Assuming Equal Variance (Enter Means)

CHAPTER 6: Continuous Uniform Distribution: 6.1. Definition: The density function of the continuous random variable X on the interval [A, B] is.

Monte Carlo Simulation

BA 275 Review Problems - Week 5 (10/23/06-10/27/06) CD Lessons: 48, 49, 50, 51, 52 Textbook: pp

Least Squares Estimation

Statistical Machine Learning

How To Test For Significance On A Data Set

Chapter 3 RANDOM VARIATE GENERATION

Statistical Analysis of Life Insurance Policy Termination and Survivorship

Chapter 8 Hypothesis Testing Chapter 8 Hypothesis Testing 8-1 Overview 8-2 Basics of Hypothesis Testing

Package cpm. July 28, 2015

A) B) C) D)

Detection of changes in variance using binary segmentation and optimal partitioning

Tests of Hypotheses Using Statistics

Is a Brownian motion skew?

Section 5.1 Continuous Random Variables: Introduction

Questions and Answers

Applied Statistics. J. Blanchet and J. Wadsworth. Institute of Mathematics, Analysis, and Applications EPF Lausanne

LOGNORMAL MODEL FOR STOCK PRICES

Math 108 Exam 3 Solutions Spring 00

Variations of Statistical Models

Non-Inferiority Tests for Two Means using Differences

Notes on the Negative Binomial Distribution

Calculating P-Values. Parkland College. Isela Guerra Parkland College. Recommended Citation

TEST 2 STUDY GUIDE. 1. Consider the data shown below.

Math 425 (Fall 08) Solutions Midterm 2 November 6, 2008

Hierarchical Insurance Claims Modeling

Experimental Design. Power and Sample Size Determination. Proportions. Proportions. Confidence Interval for p. The Binomial Test

Modelling the Scores of Premier League Football Matches

STT315 Chapter 4 Random Variables & Probability Distributions KM. Chapter 4.5, 6, 8 Probability Distributions for Continuous Random Variables

3F3: Signal and Pattern Processing

Tests for Two Survival Curves Using Cox s Proportional Hazards Model

Using Stata for Categorical Data Analysis

Bivariate Statistics Session 2: Measuring Associations Chi-Square Test

Poisson Models for Count Data

GLM, insurance pricing & big data: paying attention to convergence issues.

Overview of Monte Carlo Simulation, Probability Review and Introduction to Matlab

Bayesian logistic betting strategy against probability forecasting. Akimichi Takemura, Univ. Tokyo. November 12, 2012

Joint Exam 1/P Sample Exam 1

Lecture 5 : The Poisson Distribution

2. Illustration of the Nikkei 225 option data

Introduction to Basic Reliability Statistics. James Wheeler, CMRP

Using the Delta Method to Construct Confidence Intervals for Predicted Probabilities, Rates, and Discrete Changes

Permutation Tests for Comparing Two Populations

2WB05 Simulation Lecture 8: Generating random variables

Module 2 Probability and Statistics

Is the Basis of the Stock Index Futures Markets Nonlinear?

Note on the EM Algorithm in Linear Regression Model

Chapter 7 Notes - Inference for Single Samples. You know already for a large sample, you can invoke the CLT so:

DERIVATIVE SECURITIES Lecture 2: Binomial Option Pricing and Call Options

Math 461 Fall 2006 Test 2 Solutions

APPLIED MATHEMATICS ADVANCED LEVEL

CREDIT SCORING MODEL APPLICATIONS:

Package EstCRM. July 13, 2015

Data Modeling & Analysis Techniques. Probability & Statistics. Manfred Huber

FISHER CAM-H300C-3F CAM-H650C-3F CAM-H1300C-3F

Testing a claim about a population mean

How To Calculate The Power Of A Cluster In Erlang (Orchestra)

How To Test Granger Causality Between Time Series

Stat 5102 Notes: Nonparametric Tests and. confidence interval

VISUALIZATION OF DENSITY FUNCTIONS WITH GEOGEBRA

Parametric Survival Models

Transcription:

Reliability and Risk Analysis Analysis of Model Tasks from Selected Application Areas Using a Computer (software R, Excel)

Task The aim of this study is to assess the wastewater treatment plant (WWTP) with given the presence of synthetic musk compounds in water passed this WWTP. The real sample consists of 60 fish from the carp family, specifically of the European chub (Leuciscus cephalus), which were caught in the Svratka River. Half of them were caught in front of (Group 1), and half of them behind (Group 2) the WWTP. Fish tissue samples (specifically in muscle) were analyzed, and two nitromusk compounds (musk ambrette (AMB), musk tibetene (TIB)), and two polycyclic musk compounds (phantolide (PH), traseolide (TR)) were explored. Fish of approximately the same age were chosen for the analysis. The measuring instrument had two limitations: the limit of detection LOD =x D and the limit of quantification LOQ=x Q, x D < x Q.

Task Data was left doubly time censored (the limit of detection and the limit quantification are in the role of censor time) and for each group we know the number of censored observations, ie the number observation n D prior the detection, the number of observations n Q in the interval (x D, x Q and the number of non-censored observations n 0 and their current values. Data can be found in the article Fusek, Michalek, Zouhar, Vavrova: Statistical Analysis of Musk Compounds Concentrations in Fish Tissue Based on Doubly Left-Censored Samples. DEED 2013.

Suppose that X 1,..., X n is the random sample from the exponential distribution with the parameter λ and the density f (x, λ) and the distribution function F (x, λ). X (1),..., X (n) is the ordered random sample. Double censoring wit the censors x D and x Q : n D is a frequency of censored in 0, x D, n Q is a frequency of censored in (x D, x Q, n O is a frequency of non-censored, bigger than x Q, X (n no +1),..., X (n). The vector (n D, n Q, n O ) Mu 3(n, θ D, θ Q, θ O ), n = n D + n Q + n O (the multinomial distribution), the marginal distributions are n D Bi(n, θ D ), n Q Bi(n, θ Q ), n O Bi(n, θ O ), where θ D = F (x D, λ), θ Q = F (x Q, λ) F (x D, λ), θ O = 1 F (x Q, λ).

The likelihood function is L(λ, n D, n Q, x (n no +1),..., x (n) ) = n! n D!n Q! [F (x D, λ)] n D [F (x Q, λ) F (x D, λ)] n Q The log-likelihood function is of the form ( ) n! l(λ, n D, n Q, x (n no +1),..., x (n) ) = log + n D log [F (x D, λ)] n D!n Q! n + n Q log [F (x Q, λ) F (x D, λ)] + j=n n O +1 n j=n n O +1 f (x j ). log [ f (x (j) ) ].

l λ = n D F λ(x D, λ) F (x D, λ) } {{ } H D (x D,λ) +n Q F λ(x Q, λ) F λ(x D, λ) F (x Q, λ) F (x D, λ) } {{ } H Q (x Q,λ) 2 l H D (x D, λ) H Q (x Q, λ) = n λ 2 D + n Q + λ λ λ + n j=n n O +1 n j=n n O +1 f λ(x (j) ) f (x (j) ) f λ(x (j) ) f (x (j) )

The exponential distribution with the distribution function F (x) = 1 exp ( λx) and the density f (x) = λ exp ( λx). The likelihood function is ( ) n! l(λ, n D, n Q, x (n no +1),..., x (n) ) = log + n D log [1 exp ( λx D )] n D!n Q! + n Q log [exp ( λx D ) exp ( λx Q )] + n O log (λ) λ n i=n n O +1 x (i) l λ x D exp ( λx D ) = n D 1 exp ( λx D ) + n Q n i=n n O +1 x (i) x Q exp ( λx Q ) x D exp ( λx D ) exp ( λx D ) exp ( λx Q ) + n O λ (1)

{ 2 l λ = n 2 D x D 2 exp ( λx D ) 1 exp ( λx D ) x } D 2 exp ( 2λx D ) [1 exp ( λx D )] 2 + n Q { x 2 D exp ( λx D ) x 2 Q exp ( λx Q ) exp ( λx D ) exp ( λx Q ) The empirical Fisher information is of the form [x Q exp ( λx Q ) x D exp ( λx D )] 2 [exp ( λx D ) exp ( λx Q )] 2 { J empir = 2 l x 2 λ = n D exp ( λx D ) 2 D 1 exp ( λx D ) + x } D 2 exp ( 2λx D ) [1 exp ( λx D )] 2 n Q { x 2 D exp ( λx D ) x 2 Q exp ( λx Q ) exp ( λx D ) exp ( λx Q ) [x Q exp ( λx Q ) x D exp ( λx D )] 2 [exp ( λx D ) exp ( λx Q )] 2 } n O λ 2 } + n O λ 2.

The theoretical Fisher information is E (n D ) = nθ D = n [1 exp ( λx D )], E (n Q ) = nθ Q = n [exp ( λx D ) exp ( λx Q )], E (n O ) = nθ O = n exp ( λx Q ). Jteor = E 2 l λ 2 = n x 2 D exp ( 2λx D ) [1 exp ( λx D )] + nx 2 Q exp ( λx Q ) + n [x Q exp ( λx Q ) x D exp ( λx D )] 2 [exp ( λx D ) exp ( λx Q )] + n λ 2 exp ( λx Q).

We will test the null hypothesish 0 : λ 1 λ 2 = 0 against the alternative H 1 : λ 1 λ 2 0, where λ 1 and λ 2 are parameters of the exponential distribution describing data in two locations. The test statistics T is T = λ 1 λ 2, (2) 2 σ 1 + σ 2 2 where σ 2 1 and σ 2 2 are estimated variances of λ 1 and λ 2.

The variance (asympt.) σ 2 k is σ 2 k = J 1 k, k = 1, 2, (3) where J k is the Fisher information and we can calculate is by the second derivatives of the log-likelihood function.

It was discovered that there is no significant difference between Group 1 and Group 2 in expected concentrations of Phantolide, Traseolide and Musk Ambrette. However, there is a difference between Group 1 and Group 2 in expected concentrations of Musk Tibeten. Compound H p-value PH 0 0.47 TR 0 0.58 AMB 0 1.00 TIB 1 0.00 Table: Comparison of the expected concentrations of musk compounds between Group 1 and Group 2. H = 0 (H = 1) denotes that the null hypothesis is not rejected (is rejected) on the significance level 0.05.