False-Alarm and Non-Detection Probabilities for On-line Quality Control via HMM



Similar documents
Nonparametric adaptive age replacement with a one-cycle criterion

How To Solve A Sequential Mca Problem

THE CENTRAL LIMIT THEOREM TORONTO

Mathematics Course 111: Algebra I Part IV: Vector Spaces

Monte Carlo and Empirical Methods for Stochastic Inference (MASM11/FMS091)

Properties of BMO functions whose reciprocals are also BMO

How To Find Out How To Calculate A Premeasure On A Set Of Two-Dimensional Algebra

Continued Fractions and the Euclidean Algorithm

Cycle transversals in bounded degree graphs

Universal Algorithm for Trading in Stock Market Based on the Method of Calibration

ON THE EXISTENCE AND LIMIT BEHAVIOR OF THE OPTIMAL BANDWIDTH FOR KERNEL DENSITY ESTIMATION

The Ergodic Theorem and randomness

How to Gamble If You Must

Section 6.4: Work. We illustrate with an example.

Monte Carlo-based statistical methods (MASM11/FMS091)

Average System Performance Evaluation using Markov Chain

Further Study on Strong Lagrangian Duality Property for Invex Programs via Penalty Functions 1

Metric Spaces. Chapter Metrics

ON LIMIT LAWS FOR CENTRAL ORDER STATISTICS UNDER POWER NORMALIZATION. E. I. Pancheva, A. Gacovska-Barandovska

Notes V General Equilibrium: Positive Theory. 1 Walrasian Equilibrium and Excess Demand

1 if 1 x 0 1 if 0 x 1

ROLLING HORIZON PROCEDURES FOR THE SOLUTION OF AN OPTIMAL REPLACEMENT

Definition: Suppose that two random variables, either continuous or discrete, X and Y have joint density

Some stability results of parameter identification in a jump diffusion model

Notes on metric spaces

Adaptive Search with Stochastic Acceptance Probabilities for Global Optimization

1. (First passage/hitting times/gambler s ruin problem:) Suppose that X has a discrete state space and let i be a fixed state. Let

Master s Theory Exam Spring 2006

MATH 10: Elementary Statistics and Probability Chapter 5: Continuous Random Variables

Some Research Problems in Uncertainty Theory

Irreducibility criteria for compositions and multiplicative convolutions of polynomials with integer coefficients

x a x 2 (1 + x 2 ) n.

About the inverse football pool problem for 9 games 1

The Expectation Maximization Algorithm A short tutorial

Testing predictive performance of binary choice models 1

LOGNORMAL MODEL FOR STOCK PRICES

The positive minimum degree game on sparse graphs

Optimal proportional reinsurance and dividend pay-out for insurance companies with switching reserves

Lectures 5-6: Taylor Series

On the mathematical theory of splitting and Russian roulette

BANACH AND HILBERT SPACE REVIEW

Inner Product Spaces

F. ABTAHI and M. ZARRIN. (Communicated by J. Goldstein)

Handout #1: Mathematical Reasoning

PUTNAM TRAINING POLYNOMIALS. Exercises 1. Find a polynomial with integral coefficients whose zeros include

RESULTANT AND DISCRIMINANT OF POLYNOMIALS

HT2015: SC4 Statistical Data Mining and Machine Learning

Discrete Convolution and the Discrete Fourier Transform

The Exponential Distribution

Metric Spaces Joseph Muscat 2003 (Last revised May 2009)

I. GROUPS: BASIC DEFINITIONS AND EXAMPLES

A Markovian Model for Investment Analysis in Advertising

Taylor and Maclaurin Series

On a comparison result for Markov processes

Mathematics for Econometrics, Fourth Edition

4. Expanding dynamical systems

How performance metrics depend on the traffic demand in large cellular networks

Monte Carlo-based statistical methods (MASM11/FMS091)

Math 370/408, Spring 2008 Prof. A.J. Hildebrand. Actuarial Exam Practice Problem Set 3 Solutions

Limits and Continuity

MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.436J/15.085J Fall 2008 Lecture 5 9/17/2008 RANDOM VARIABLES

1 Prior Probability and Posterior Probability

Some Problems of Second-Order Rational Difference Equations with Quadratic Terms

= = 3 4, Now assume that P (k) is true for some fixed k 2. This means that

THE FUNDAMENTAL THEOREM OF ARBITRAGE PRICING

9 More on differentiation

THE BANACH CONTRACTION PRINCIPLE. Contents

General Theory of Differential Equations Sections 2.8, , 4.1

An Introduction to Basic Statistics and Probability

Lecture 6: Discrete & Continuous Probability and Random Variables

LÉVY-DRIVEN PROCESSES IN BAYESIAN NONPARAMETRIC INFERENCE

Is a Brownian motion skew?

Hacking-proofness and Stability in a Model of Information Security Networks

Random graphs with a given degree sequence

INDISTINGUISHABILITY OF ABSOLUTELY CONTINUOUS AND SINGULAR DISTRIBUTIONS

THIS paper deals with a situation where a communication

Principle of Data Reduction

UNIT I: RANDOM VARIABLES PART- A -TWO MARKS

SHARP BOUNDS FOR THE SUM OF THE SQUARES OF THE DEGREES OF A GRAPH

Separation Properties for Locally Convex Cones

MATH10212 Linear Algebra. Systems of Linear Equations. Definition. An n-dimensional vector is a row or a column of n numbers (or letters): a 1.

Quasi-static evolution and congested transport

Detekce změn v autoregresních posloupnostech

Primality - Factorization

1. Prove that the empty set is a subset of every set.

Notes on Continuous Random Variables

Load Balancing and Switch Scheduling

Gambling and Data Compression

How To Prove The Dirichlet Unit Theorem

Online Appendix to Stochastic Imitative Game Dynamics with Committed Agents

Gambling Systems and Multiplication-Invariant Measures

MOP 2007 Black Group Integer Polynomials Yufei Zhao. Integer Polynomials. June 29, 2007 Yufei Zhao

ON GENERALIZED RELATIVE COMMUTATIVITY DEGREE OF A FINITE GROUP. A. K. Das and R. K. Nath

I. Pointwise convergence

Cycles and clique-minors in expanders

MATH 425, PRACTICE FINAL EXAM SOLUTIONS.

Mean Ramsey-Turán numbers

FACTORING POLYNOMIALS IN THE RING OF FORMAL POWER SERIES OVER Z

CITY UNIVERSITY LONDON. BEng Degree in Computer Systems Engineering Part II BSc Degree in Computer Systems Engineering Part III PART 2 EXAMINATION

Asymptotics for a discrete-time risk model with Gamma-like insurance risks. Pokfulam Road, Hong Kong

Transcription:

Int. Journal of Math. Analysis, Vol. 6, 2012, no. 24, 1153-1162 False-Alarm and Non-Detection Probabilities for On-line Quality Control via HMM C.C.Y. Dorea a,1, C.R. Gonçalves a, P.G. Medeiros b and W.B. Santos a a Universidade de Brasilia 70910-900 Brasilia-DF, Brazil b Universidade Federal do Rio Grande do Norte 59072-970 Natal-RN, Brazil Abstract On-line quality control during production calls for monitoring produced items according to some prescribed strategy. It is reasonable to assume the existence of system internal non-observable variables so that the carried out monitoring is only partially reliable. In this note, under the setting of a Hidden Markov Model (HMM) and assuming that the evolution of the internal state changes are governed by a two-state Markov chain, we derive estimates for false-alarm and non-detection malfunctioning probabilities. Kernel density methods are used to approximate the stable regime density and the stationary probabilities. As a side result, alternative monitoring strategies are proposed. Mathematics Subject Classification: 60J20, 60J10, 90B30 Keywords: Hidden Markov model; False-alarm; Kernel density methods 1. Introduction Limited by the cost function a sampling interval m is selected and classical on-line quality control during production adopts the procedure of monitoring the sequence of items, independently produced, by examining a single item at every m produced items. Based on the quality requirements and on the distribution of the examined variable a control region C is pre-specified. If the examined item satisfies the control limits, the process is said to be in 1 Corresponding author : changdorea@unb.br. Research partially supported by CNPq, CAPES, FAPDF and FINATEC.

1154 C.C.Y. Dorea, C.R. Gonçalves, P.G. Medeiros and W.B. Santos control and the production continues; otherwise, the process is declared out of control and the production is stopped for adjustments. After adjustments the production is resumed and it is in control again. At each stoppage a new production cycle is defined (see, for example, [4]). Suppose now the internal working status of the system is non-observable and may change from a good working condition (on control) to a deteriorating status (out of control). In [2] it is proposed a model where these changes are governed by a two-state Markov chain {θ n } n 0. When θ n = 0 the process is said to be in control at time n; and, if θ n = 1 the process is out of control. In their proposal, an observable random variable X n, related to characteristics of interest, is examined at every m produced items. It is assumed that X n has a Gaussian distribution, N (μ, σ 2 ), with μ = μ θn. And values of m as well as the parameter d of the control region C =(μ 0 dσ, μ 0 + dσ) were determined by considering a given cost function to be minimized. Here, in the framework of a Hidden Markov Model (HMM), we compute and propose estimation techniques for the false-alarm and non-detection probabilities. False-alarm occurs if the observed variable falls outside the control region but the non-observable internal system state is 0, a good working state. When the opposite occurs we have a non-detection situation. It is assumed that all the working (good) states are lumped together as state 0 and the deteriorating states are gathered as state 1. A Markov chain {θ n } n 0 describes the evolution of the state of the production system. Associated with this chain we observe a sequence of conditionally independent random variables {X n } n 1, with the distribution of each X n depending on the corresponding state θ n. This process {θ n,x n } is generally referred to as a HMM. More specifically, we have P (X n+1 A X 1,...,X n,θ 0,...,θ n )=P (X n+1 A θ n ) (1) and n P (X 1 A 1,...,X n A n θ 1,...,θ n )= P (X j A j θ j ). (2) j=1 In section 2, we treat the special case where the transition matrix of {θ n } as well as the conditional densities of X n given θ n are known, but not necessarily Gaussian densities. For any initial distribution of θ 0, Proposition 1 and Proposition 2 give the false-alarm and the non-detection probabilities. As a side result, upper bounds for the sampling interval m are proposed. In section 3, we assume that the transition probabilities of the hidden chain {θ n } are unknown. Based on the observable sample X 1,...,X n kernel density methods are used to approximate the stable regime density. Theorem

HMM for on-line quality control analysis 1155 1 shows that the limiting stable density is a mixture of densities, which in the particular case treated in [2] reduces to NORMIX (mixture of normal densities). Corollary 1 provide an estimate for the stationary distribution of {θ n } and leads to an alternative monitoring strategy that takes into account the false-alarm situations. And the estimates for false-alarm and non-detection probabilities can be found in Corollary 2. 2. False-alarm and Non-detection For the HMM process {θ n,x n } assume that the chain {θ n } has known transition probabilities given by ( ) 1 p p P = ɛ 1 ɛ 0 <p<1, ɛ > 0 and p + ɛ<1. (3) The conditional distribution of X n given θ n are also known and based on this distribution a control region C is pre-selected. It is assumed that P (X n A θ n = i) = f(x i)dx, 0 <q 0 = f(x 0)dx < 1 and 0 <q 1 = C c f(x 1)dx < 1. C c (4) If the sampling interval m = 1, the on-line quality monitoring adopts the following strategy : items X 1,X 2,... are inspected and verified whether X 1 C, X 2 C,...; maintenance is required at time n if X 1 C,...,X n 1 C and X n / C. Thus we can define the alert times by τ X = inf{k : k 1,X k / C}. False-alarm occurs at time k if τ X = k but the non-observable internal system state is 0, a good working state. Let τ θ be defined as the first time, after time 0, the system reaches state 1, A τ θ = inf{k : k 1,θ k =1}. (5) Then false-alarm and non-detection correspond, respectively, to the events (τ X < τ θ ) and (τ X > τ θ ). Let P ν denote the probability when the initial

1156 C.C.Y. Dorea, C.R. Gonçalves, P.G. Medeiros and W.B. Santos distribution of θ 0 is ν. Similarly, let P 0 and P 1 denote the probability when θ 0 = 0 and θ 0 = 1 respectively. Proposition 1. Let ν be the initial distribution of θ 0. Then the false-alarm probability is given by (1 p)q 0 P ν (τ X <τ θ )=ν(0) 1 (1 p)(1 q 0 ) + ν(1) ɛq 0 (6) 1 (1 p)(1 q 0 ) and the non-detection probability is given by p(1 q 1 ) P ν (τ X >τ θ )=ν(0) 1 (1 p)(1 q 0 ) + ν(1)[q 0(1 ɛ)+(1 q 0 )p](1 q 1 ). 1 (1 p)(1 q 0 ) (7) Proof. (i) For θ 0 = 0 we have from (3) and (4) P 0 (τ X =1,τ θ > 1) = P 0 (θ 1 =0,X 1 / C) =(1 p)q 0 and from (1) and (2) we have for k 2 P 0 (τ X = k, τ θ >k) = P 0 (X 1 C,,X k 1 C, X k / C, θ 1 = = θ k =0) = k 1 P 0 (θ 1 = = θ k = 0)[ P (X j C θ j = 0)]P (X k / C θ k =0) It follows that j=1 = (1 p)[(1 p)(1 q 0 )] k 1 q 0. P 0 (τ X <τ θ )= Similarly, if θ 0 = 1 we have and for k 2 (1 p)q 0 1 (1 p)(1 q 0 ). P 1 (τ X =1,τ θ > 1) = P (θ 1 =0 θ 0 =1)P (X 1 / C θ 1 =0)=ɛq 0 And (6) follows. (ii) To prove (7) note that and for k 2 P 1 (τ X = k, τ θ >k)=ɛ[(1 p)(1 q 0 )] k 1 q 0. P 0 (τ X = τ θ =1)=P (θ 1 =1 θ 0 =0)P (X 1 / C θ 1 =1)=pq 1 P 0 (τ X = τ θ = k) = [(1 p)(1 q 0 )] k 1 pq 1.

HMM for on-line quality control analysis 1157 Moreover, P 1 (τ X = τ θ =1)=(1 ɛ)q 1 and for k 2 P 1 (τ X = τ θ = k) =ɛ(1 q 0 )[(1 p)(1 q 0 )] k 2 pq 1. It follows that pq 1 P ν (τ X = τ θ )=ν(0) 1 (1 p)(1 q 0 ) + ν(1)[q 0(1 ɛ)+(1 q 0 )p]q 1. 1 (1 p)(1 q 0 ) This along with (6) give us (7). Now, assume that a single item is inspected at every m items produced. Then for m 2 we can define and and τ (m) X = inf{km : k 1,X km / C}. (8) Using the same type of arguments as above we get P 0 (τ (m) X = km, τ θ >km) = (1 p) km (1 q 0 ) k 1 q 0 = (1 p) m [(1 p) m (1 q 0 )] k 1 q 0 P 1 (τ (m) X = km, τ θ >km)=ɛ(1 p) m 1 [(1 p) m (1 q 0 )] k 1 q 0. Moreover, P 0 (τ (m) X = τ θ = km) = [(1 p) m (1 q 0 )] k 1 (1 p) m 1 pq 1 P 1 (τ (m) X = τ θ = km) = [(1 p) m (1 q 0 )] k 1 ɛ(1 p) m 2 pq 1. Proposition 2. Let ν be the initial distribution of θ 0. Assume that the monitoring strategy has sampling interval m. Then the false-alarm probability is given by P ν (τ (m) X <τ (1 p) m q 0 θ)=ν(0) 1 (1 p) m (1 q 0 ) + ν(1) ɛ(1 p) m 1 q 0 1 (1 p) m (1 q 0 ) (9)

1158 C.C.Y. Dorea, C.R. Gonçalves, P.G. Medeiros and W.B. Santos and for m 2 the non-detection probability is given by P ν (τ (m) X >τ θ) = ν(0) 1 (1 p)m 1 +(1 p) m 1 p(1 q 1 ) 1 (1 p) m (1 q 0 ) + ν(1) 1 (1 p)m (1 q 0 ) ɛ(1 p) m 2 [(1 p)q 0 + pq 1 ]. 1 (1 p) m (1 q 0 ) (10) Next, we propose upper bounds for the sampling interval m. Wemay assume that the process starts at good working conditions (θ 0 =0,X 0 C). Our aim is to detect whether the chain {θ n } has changed to state 1. A trivial upper bound is given by the mean changing time from state 0 to state 1. From (3) and (5) we have m kp(τ θ = k θ 0 =0)= k(1 p) k 1 p = 1 p. k 1 k 1 Since {θ n } is non-observable, the designed monitoring strategy relies on the observable process {X n }. Taking this into account one should choose m smaller so that in average we will detect the first time the process {θ n,x n } reaches the alert zone. Alert occurs at time k if either X k C c or θ k =1. And this can be expressed by τ A = inf{k : k 1, (θ k,x k ) A}. where A =({0} C c ) ({1} C) ({1} C c ). Since P ((θ k+1,x k+1 ) A c (θ k,x k ) A c )=(1 p)(1 q 0 ), the expected first hitting time in alert zone is given by m E(τ A θ 0 =0,X 0 C) = 3. Estimation Results 1 1 (1 p)(1 q 0 ). In this section we assume that the hidden Markov chain {θ n } has transition matrix P of the form (3), but unknown. As for the process {X n }, we assume that the conditional densities are known and satisfy condition (4). Note that since all entries of P are strictly positive, {θ n } is an ergodic chain and the stationary (limiting) distribution exists, πp = π, π(0) = ɛ p + ɛ and π(1) = p p + ɛ. (11)

HMM for on-line quality control analysis 1159 Define the mixture density function f(x) = π(0)f(x 0) + π(1)f(x 1). (12) Then, in some sense, f( ) represents the density of {X n } when the process reaches some stable regime. Mixture models have been well recognized as useful in many practical applications. It is a popular model-based approach to dealing with data in the presence of population heterogeneity, in the sense that, data consist of unlabelled observations, each of which is thought to belong to one of the distinct classes. For a comprehensive list of applications and literature survey in this area see, for example, [3]. Though we are assuming known conditional densities, the stable regime density (12) may indicate which type of distribution one should assume for the variables X n as well as which control region C one should select. Results from [1] show that, based on the observable sample X 1,...,X n, kernel density methods can be successfully used to estimate the stable regime density. For a probability density K on R define ˆf n (x) = 1 nh n k=1 K( X k x ),h= h n 0,nh n as n. (13) h Typically one takes K( ) either a Gaussian density or an uniform density centered at x. Using these results we have Theorem 1 which allows us to make inference on P. Its proof will be postponed to the end of this section. Theorem 1. Assume that the process {θ n,x n } satisfies (3) and (4) and let ν be any initial distribution of θ 0. Then given δ>0 there exist c 1 = c 1 (δ) > 0 and c 2 = c 2 (δ) > 0 such that P ν ( ˆf n (y) f(y) dy δ) c 1 exp{ c 2 n} (14) and almost surely (a.s.) lim n ˆf n (y) f(y) dy = 0 a.s. (15) Next, we estimate the stationary probabilities (11). Choose x so that f(x 0) f(x 1) and define ˆf n (x ) f(x 1) ˆπ n (0) = f(x 0) f(x 1) and ˆπ n(1) = 1 ˆπ n (0). (16)

1160 C.C.Y. Dorea, C.R. Gonçalves, P.G. Medeiros and W.B. Santos From (15) it follows that, Corollary 1. For any initial distribution of θ 0 we have with probability 1, ˆπ n (0) π(0) and ˆπ n (1) π(1) as n. (17) This result suggests an alternative on-line control procedure that will take into account the false-alarm situations. Let ˆf n (m) (x )= 1 n K( X km x ). nh h Then, from (15) we have lim n k=1 (m) ˆf n (x ) f(x ) a.s. (m) Thus, with ˆf n (x ) in place of ˆf n (x ) in (16) we get (17). It follows that the posterior distribution P (θ nm =0 X m,...,x nm ) can be approximated by ˆf n ˆP (m) (x ) f(x 1) f(x 0) (θ nm =0 X m,...,x nm )= f(x 0) f(x 1) ˆf n (m) (x ). Monitoring Strategy. Assume that the conditional densities f( 0) and f( 1) are both known. Suppose that the control region C and the sampling interval m have been pre-selected. Let τ (m) X be defined by (8) and let 0 <η<1 be a given significance level. Then intervention is carried out at time nm if τ (m) X = nm and ˆP (θnm =0 Xm,...,Xnm) <η. From (16) we can also estimate false-alarm and non-detection probabilities. Observe that, in long-run, the false-alarm can be computed as P (θ km =0 X km / C) = P (X km / C θ km =0)P (θ km =0). P (X km / C) But P (X km / C θ km =0)=q 0 and by (11) we have lim P (θ km =0)=π(0). n From Theorem 1 with f( ) given by (12) we have lim P (X km / C) = f(x)dx. n C c Similarly, we have for non-detection P (θ km =1 X km C) = P (X km C θ km =1)P (θ km =1). P (X km C)

HMM for on-line quality control analysis 1161 These results can be summarized by: Corollary 2. For any initial distribution of θ 0 we have with probability 1, q 0ˆπ n (0) ˆf C c n (x)dx P (false-alarm) and (1 q 1)ˆπ n (1) C ˆf P (non-detection). (18) n (x)dx Proof of Theorem 1. Enough to verify that the hypotheses of Theorem 2 and Corollary 1 from [1] are satisfied. First, we shown that {θ n } is is uniformly ergodic, that is, there exists 0 <ρ<1 such that From (3) we have, ɛ P n = p + ɛ P n ij π(j) ρ n i, j =0, 1. (19) + p(1 p ɛ)n p + ɛ ɛ p ɛ)n ɛ(1 p + ɛ p + ɛ p p ɛ)n p(1 p + ɛ p + ɛ p p ɛ)n + ɛ(1 p + ɛ p + ɛ and from (11) we get (19) by taking ρ =1 p ɛ. Next, we show that {θ n } satisfies the φ-mixing condition : A F k and B Fk+n P π (A B) P π (A)P π (B) φ(n)p π (A) (20) with n 1 φ(n) < and σ-algebras defined by F n = σ(θ 0,...,θ n ) and F n = σ(θ n,θ n+1,...). Note that for A =(θ 0 = a 0,...,θ k = a k ), B =(θ k+n = b k+n,...) and P π (A) > 0 we have and P π (B A) =P π (B θ k = a k )=P n a k b k+n P π (θ k+n+1 = b k+n+1,... θ k+n = b k+n ) It follows that P π (B) =π(b k+n )P π (θ k+n+1 = b k+n+1,... θ k+n = b k+n ). P π (B A) P π (B) P n a k b k+n π(b k+n ) ρ n = φ(n) and (20) is satisfied. References :

1162 C.C.Y. Dorea, C.R. Gonçalves, P.G. Medeiros and W.B. Santos [1] Dorea, C.C.Y. and Zhao, L.C. (2002) - Nonparametric density estimation in Hidden Markov Models, Statis. Inference for Stoch. Processes, vol. 5, 55-64. [2] Ho, L.L.; Medeiros, P.G. and Borges, W. (2007) - An alternative model for on-line quality monitoring for variables, Intern. Jour. of Production Economics, vol. 107, 202-222. [3] McLachlan, G.L. and Peel, D. (2000) - Finite Mixture Models,John Wiley, New York. [4] Taguchi, G.; Elsayed, E.D. and Hsiang, T. (1989) - Quality Engineering in Production Systems, McGraw-Hill, New York. Received: November, 2011