Download Cumulative Sum (CUSUM) Control Schemes

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

UniPro protocol stack wikipedia , lookup

HD-MAC wikipedia , lookup

Transcript
CUMULATIVE SUM (CUSUM) CONTROL SCHEMES
James M. Lucas
E. I. du Pont de Nemours and Co., Inc.
Wilmington, DE
the production of nonconforming product.
Because CUSUM procedures give an early
ABSTRACT
Cumulative
Sum
(CUSUM)
quality
indication of process changes,
control
sistent
schemes are becoming widely used in industry
because they are powerful, versatile, and easy
to use. They cumul ate recent process data to
quickly
They
detect
also
tool.
out-of-control
serve
as
a
19898
There are now more
than
they are con-
philosophy
encourages doing it right the first time.
use of CUSUM procedures
situations.
powerful
a management
with
;s
that
The
also consistent
with a management by exception philosophy as
the CUSUM will point out areas needing
diagnostic
attention.
10,000 CUSUM
control schemes in use daily ;n Du Pont. This
talk
describes
design
and
implementation
procedures for CUSUM control schemes with
emphasis on properties that are recorded as
counts.
The talk will describe recent developments which make CUSUM procedures more
useful and more powerful.
Cumulative sum control schemes are current ly used more for the control of vari ab 1es
than for the control of counts (attributes).
To help remedy this, we will give counts more
emphasis here. We will show that the design
and implementation procedure for Counted Data
CUSUMs
(Lucas
1985)
is
very
similar
to
the
The recent developments described are:
procedure for CUSUMs for variables (Lucas
1976). We will use a Poisson distribution in
o
Fast Initial Response (FIR) CUSUM which
our examples.
gives extra sensitivity to out-ofcontrol situations at start up or after
a (possibly ineffective) control action.
is administratively convenient to record the
number of counts in a given sampling interval. When the number of counts per interval
follows a Poisson distribution, the time
between counts follows an exponential distribution. A time-between-events CUSUM which we
discuss elsewhere (Lucas 1985) should be used
if it ;s convenient to update the CUSUM with
each new count, and it is possible to record
the time since the last count.
o
A
Combined
Shewhart-CUSUM
which
com-
bines the key features of CUSUM schemes
and Shewhart Schemes by adding Shewhart
Control Limits to a CUSUM scheme.
o
ROBUST CUSUM schemes whi ch are not
unduly influenced by a few outliers or
A Poisson CUSUM is used when it
fl iers occurring in the stream of data.
We discuss the use of CUSUMs for the
detection of increases and/or decreases in
count rate; we discuss both one and two-sided
The philosophy of continual improvement of
a process is very compatible with CUSUM procedures. As CUSUM procedures gi ve much more
responsive control, a CUSUM signal does not
mean that the process is producing bad product. Rather it means that action should be
taken so that the process does not produce bad
product.
CUSUMs.
I NTRODUCTI ON
Cumulative
sum
(CUSUM)
quality
While
the most widely used applica-
tion of counted data CUSUMs will probably be
the control of nonconformances, they can also
be used to monitor changes in count level in
other situations such as accident rate or the
rate of occurrence of congenital malformations. We use the terminology counts, rather
than nonconformances, as counts has a broader
applicability.
control
schemes for variables are widely used in
industry. CUSUM procedures will usually give
tighter process control than classical quality
control schemes, such as Shewhart schemes.
With the tighter control that is available
with CUSUM schemes, there will be more emphasis placed on keeping the process on-aim
rather than allowing it to drift in limits.
Because of the tighter control, an out-ofcontrol (Signal will seldom indicate that the
process is. produci ng nonconforming product;
rather an out-of-control Signal will indicate
that control action should be taken to prevent
The recognition that a CUSUM control
scheme is a sequence of Wald Sequential
Probability
Ratio
Tests
(SPRT)
allows
the
optimal ity properties of CUSUM procedures to
be developed.
Johnson and Leone
(1962)
9ave
an early discussion of CUSUM procedures using
the relationship between SPRT1s and CUSUM.
Lorden (1971) showed the asymptotic optimality
of CUSUM procedures for detect i ng a change in
distribution.
Kenett and Pollak (1983) showed
the superiority of a CUSUM scheme for detecting a rare event over a non-CUSUM scheme
proposed by Chen (1978).
916
Yi
Recent enhancements to CUSUM quality
control schemes have included the Fast Initial
Response (FIR) feature (Lucas and Crosier
1982A). The FIR feature recognizes that when
will
the initial SPRT to test a different null
hypothesis than the following SPRTs.
In
A Robust CUSUM (Lucas and Crosier, 19828)
Robust
CUSUM
counts
in
the
ith
use
a
positive
starting
value.
We
for the initial SPRT.
has been recommended when isolated outliers or
extreme values occur for reasons other than a
A
of
the process is off-aim at start-up.
For a
process running at the desired level, the head
start will soon !Izera!!, so it has little
effect. It is instructive to view the choice
of So in terms of the II all and lIa n errors of
the equivalent SPRT.
The a-error is the
probabi 1ity of declaring the process off-aim
when it is not.
The a-error is the probability of declaring the process on-aim when it
is not. When the CUSUM is cons i dered to be a
sequence of Wald SPRTs, one finds that the
SPRTs have small a and large a errors. Using
a head start value approximately equal to h/2
is equivalent to equating the a and a errors
between-event CUSUMs see Lucas (1985).
shift.
number
generally recommend a head start (SO) value
approximately equal to h/2. With such a head
start, the CUSUM will more quickly Signal if
process control work, it is more likely that
the process is not at the desired aim value at
start-up than after the process has been
The FIR
running smoothly for some time.
feature gives a simple procedure for more
quickly detecting an out-of-control situation
at start-up. If the process is initially in
control, the Fast Initial Response feature has
little effect while if the process starts out
in an out-of-control condition, an out-ofcontrol signal is given much more quickly. In
this paper, we demonstrate the benefits of the
FIR feature for a Poisson CUSUM.
For a
discussion of the FIR feature for a time-
process
the
A standard CUSUM will have starting value
So = 0, while a Fast Initial Response CUSUM
a CUSUM is started, it may be appropriate for
true
is
interval.
A Robust CUSUM (Lucas and Crosier, 19828)
is obtained by using the Utwo-in-a-row rule".
To robustify the CUSUM, an outl ier 1 imit is
specified. A single observation outside this
limit does not enter the CUSUM. However, two
outliers in a row are an out-of-control Signal.
will
quickly detect true changes in level that
occur in the process; yet it will be insensit i ve to the occurrence of an occas i on a 1
outlier or flyer. In this paper, we discuss a
Robust Poisson CUSUM.
CUSUM
evaluated
The properties of a combined ShewhartCUSUM control scheme are described by Lucas
(I982).
For variables, the combined scheme
has proven of most value for measurement
control.
In this application a Shewhart
signal can indicate a bad sample while a CUSUM
Signal often indicates a calibration problem.
length (ARL).
bution of run lengths (Brook and Evans 1972).
For a standard CUSUM, the ARL distribution is
nearly geometric except that there ;s a lower
probability of extremely short run lengths.
When the FIR feature is used, the distribution
is nearly geometric except that there is an
increased probability of short run lengths due
A cumulative sum control scheme cumulates
the difference between an observed value Vi
and a reference value k. If this cumulation
equals or exceeds the decision interval value
h, an out-of-control signal is given.
The
CUSUM statistics are:
max (0, Yi - k + SH(i-l))
SLi
max (0, k - Yi + SL(i-l))
The ARL is the average number
of samples taken before an out-of-control
signal is obtained. The ARL should be large
when the process is at its aim level and short
when the process shifts to an undesirable
level.
In-control
ARLs
are
often well
approximated by a geometric distribution;
hence, the ARL also characterizes the distri-
IMPLEMENTATION EXAMPLE
SHi
contro 1
schemes
are
usually
by calculating their average run
to the head start (Lucas and Crosier 1982A).
Table 1 illustrates the implementation of
a Poisson CUSUM with parameter values k=5 and
h=lO.
A CUSUM with and without a head start
value SO=5 is shown.
where max (a,b) is the maximum of a and b.
The first formula is used to detect an
increase in level, while the second formula is
used to detect a decrease in level.
A two
sided CUSUM uses both formulas Simultaneously.
a
would
work
We wi11 find that such
well
if
the
acceptable
count 1eve 1 was about 4 and it was des ired to
quickly detect if the count level increased to
7. I n such a case, the i n-contro 1 average run
1engt would be approximately 397 or 422 and
the out-of-control ARL would be 3.35 or 5.59
for the FIR CUSUM and the standard CUSUM,
For the control of variables, the Vi in
the above formulas is a standardized variable:
Vi =
CUSUM
respectively. In Table 1, the first column is
the observation number, i=1. •• 15. The second
column gives the observed number of counts,
Vi. The th i rd co 1umn records an i ntermediate calculation step.
It calculates Vi-k.
The fourth column calculates Si for a
Xi - AIM
S
where Xi is the observed sample average and
S is the standard deviation of X. For Counts
standard CUSUM (with SO=O) , and the fifth
column calculates Si for a FIR CUSUM with
917
SO=5.
In Table 1,
the first
AVERAGE RUN LENGTHS FOR
THE MOST WIDELY USED CUSUM PROCEDURES
ten observa-
tions are from a process at the desired mean
level while the last five are from a process
with a higher mean level. The process starts
out i nit i ally in contro 1, so the FI R feature
has little effect; the head start soon zeroes
out. The benefit of the FIR feature can be
Deviation From Aim
(Multiple of S)
seen by implementing the CUSUM using only the
last five observations.
This represents a
process that is out of control when the CUSUM
is started.
In this case, the CUSUM with the
FIR feature would give an out-af-control
signal at the first observation (Observation
11), while the standard CUSUM would not signal
until the fourth observation (Observation 14).
POISSON CUSUM
EXAMPLE
h=lO k=5
Y -k
S
No FIR
o
1
2
3
7
3
4
5
6
2
0
2
8
7
4
8
9
10
13
0
2
3
10
8
4
14
15
1l
1l
12
9
-2
2
-3
-5
-3
3
-1
-5
-3
-2
5
3
-1
4
6
0
0
2
0
0
0
3
2
0
0
0
FIR
5
3
5
2
0+
0
3
2
0
0
0
5
5
8
7
1l*
17*
8
7
1l*
17*
So
4
4
5
5
.5
.5
.5
.5
0.0
2.0
0.0
2.5
0
.5
168. 26.6
150 20.1
465. 38.0
432 28.7
1.
2.
4
8.4
5.3
10.4
6.4
3.3
2.0
4.0
2.4
1.7
1.0
2.0
1.2
should be selected to be close to:
This reference value is the same as the
reference value for an SPRT testing the null
hypotheSiS that the mean is ua and alternate
hypothes i s that the mean is ud. When kp 21, the kp value will usually be rounded to
the nearest integer. Then, the CUSUM computations require only integer arithmetic.
After k is selected, the decision interval
value (h)
*Out of Control Signal
+
k
We will give a more detailed discussion of
the design procedure for a Poisson CUSUM for
detecting an increase in count rate.
For a
Poisson CUSUM, the k value will be between the
accept ab 1e proces s mean (jJ a) and the mean
1eve 1 of counts ("d) whi ch the CUSUM scheme
is to detect quickly. While the desired value
(or goal value) for J.la is often zero, a ).Ia
value of zero requires that the CUSUM be
designed with h=l and k=O. Any occurrence of
a count gives a Signal; when count levels are
low and a decrease in count level indicates an
improved system, every count must followed
up. The circumstances surrounding every count
must be examined to find and remove assignable
causes.
In practice, ).I a is often chosen
near to the current mean 1eve 1; th i s represents
current
system
performance.
The
reference value for the Poisson CUSUM (k p )
TABLE 1
Y
h
is chosen using a table
look-up
procedure.
The value of h Should give an
appropriately large ARL when the counts are at
the
after the FIR CUSUM zeroes, the standard
acceptable
level.
It
should
also
be
chosen to give an appropriately small ARL
value when the process is running at the count
CUSUM and the FIR CUSUM are equivalent
level Which should be detected quickly.
The CUSUM parameter k is determined by the
a Poisson CUSUM
of counts is 4
7) is
("a = 4) and a count rate of 7 ("d
to be detected quickly. The Poisson CUSUM is
designed with a k value close to:
acceptable mean level (]Ja) and by the unacceptab 1e mean ("d) 1eve 1 whi ch the CUSUM
scheme is to detect quickly.
For variables
the k value is chosen half way between the
acceptable mean level and the unacceptable
mean level.
when
The CUSUM parameter h is the
("d-"a)/( In("d)-ln("a))
chosen by a table look up procedure to give an
acceptably long in-control ARL.
In practice
the parameter values h = 4 or 5 and k = .5 are
often used.
Consider the design of
the acceptable number
= (7-4)/(ln 7-ln 4)= 5.36
The FIR feature with S(O) = h/2
For ease in implementation using integer
arithmetic, a kp value of 5 is recommended.
usually should be used. With these parameters
you get the following ARL values for a two
With this kp
(Lucas 1985):
sided CUSUM procedure (Lucas and Crosier 1982):
918
value we find
for
the
ARL
THE FAST INITIAL RESPONSE FEATURE
AVERAGE RUN LENGTH
FOR THE POISSON CUSUM EXAMPLE
At start up, or following a possibly
ineffective control action, control of the
process is less certain. Extra sensitivity is
often des i red at these times.
The Fast
Initial Response feature for CUSUM control
schemes was developed to give this extra
sensitivity.
For a standard CUSUM scheme
SO, the initial CU5UM value, is set equal to
" (multiple of k p )
kp
hp
7
7
So
5
5
5
5
5
5
10
10
15
15
0
4
0
5
0
8
4(.8)
7(1.4)
108
95
422
397
3740
3630
4.09
2.37
5.59
3.35
8.09
4.36
zero.
The
FIR
CUSUM
uses
an
initial
CUSUM
value, SO, that is greater than zero.
Our
recommended starting value is h/2. With this
starting value,
a FIR CUSUM detects an
initially out-at-control situation about 40
percent faster than a standard CUSUM.
An appropriate Poisson CUSUM could be the
scheme with hp : 10, kp:5, SO=5.
TWO-SIDED CUSUMs
For the control
CUSUMs
The
of variables,
two-sided
are usually used even when
shifts
in
FIR features.
only one direction indicate a deterioration in
quality.
Different control actions may be
taken depending on which side of aim the CUSUM
scheme
is
usually
out-of-control
signal
indicate
need
the
on
for
recommended.
the
low
control
side
when
the
CUSUM
The CUSUM scheme with
the FIR
Shewhart charts provide rapid detection of
large deviations, while CUSUM schemes provide
rapid detection of small to moderate deviations.
The Shewhart-CUSUM scheme (Lucas,
1982) incorporates the good features of both.
The prime motive for Shewhart-CUSUM is to
obtain faster response to 1arge shifts with
only small changes in performance otherwise.
With the Shewhart-CUSUM scheme. Shewhart
Control Limits (SCL) are added to the CUSUM
scheme. An out-of-control signal is given if
the most recent sample ;s outside of the SCL
or if a regular CUSUM signal is given. Since
the control action is modified only for the
current observation, the combined scheme is
easy to implement.
The performance of the
combi ned Shewhart-CUSUM approach is comparab le
to that of more in olved modifications of
however.
for
seen
COMBINED SHEWHART-CUSUM CONTROL PROCEDURE
it is usually easier to obtain the properties
of a two-sided scheme by combining results for
two one-sided schemes.
When the parameter
values are such that one side gives an out-ofonly
be
to
For attributes, also we generally recommend two-sided control schemes where, for
example, a significant decrease in count rate
might trigger an investigation which could
lead to performance improvements. Properties
of a two-s i ded scheme may be evaluated
signal
can
An
increase the strength, while an out-af-control
control
CUSUM
is used both a 1arger i n-contro 1 ARL and a
quicker detection of an initial out-of-control
situation can be achieved.
would
action
and Cros i er. 1982A);
the FIR
The ARL tab les show that when the FIR feature
signal on the high side would indicate a need
to go into a detective mode to find why the
high strength was achieved. The investigation
could indicate ways to improve the product.
di rect ly (Lucas
of
feature usually should have a larger h value.
signals. For example, it may be important to
contro 1 the strength of a gi yen product
property. Even if there is no prob 1em when a
higher strength is achieved, a two-sided
control
value
from the ARL tables which we have given.
Compare ARLls for CUSUMls with and without the
the
other side has zeroed, the ARL for a two-sided
scheme can be obtained directly from the ARL
of two one-sided schemes. (Lucas and Crosier
CUSUMs such as Parabolic CUSUM (Lucas (1973)).
1982A. Lucas 1985) as:
-~
Des i gn
ARL (SOH. SOL)
easy.
a
a
Shewhart-CUSUM
CUSUM
is
scheme
designed
the
is
ARL
tables are examined. The Shewhart Limits are
included if they make a significant improvement in the ARL curve.
The combined scheme
should use the same k value and the same or
slightly larger h value to compensate for
decreases in in-control ARL.
We generally
recommend SCL of 3.5 or 4.0 rather than 3.0 as
the larger limits have less effect on the incontrol ARL. The following gives ARL values
LH(SOH) LL(O) + LH(O) LL(SOL) - LH(O) LL(O)
LH(O) + LdO)
where the subscript Hand L indicate parameter
values for the high and low side respectively,
and L *( SO*)
is the ARL for a one-s i ded
CUSUM scheme with head start SO* and L or H
may be substituted for
of
After
for h = 4 or 5 k = .5 CUSUM schemes.
ditional tables are given by Lucas (1982).
*
919
Ad-
AVERAGE RUN LENGTH
FOR A COMBINED SHEWHART-CUSUM
combined with a cluster distribution having
probability c.
If the cluster was sampled, a
large number of cou.nts (sufficient to trigger
a standard
Deviation From Aim
(Multiple of S1
CUSUM)
would
h
k
SCL
0
.5
1
2
4
4.
4.0
5.0
5.0
5.0
.5
.5
.5
.5
•5
3.0
3.5
3.0
3.5
4.0
125.
159.
223 •
391.
459 •
25.
26.
34.
37.
38.
8.0
8.3
9.8
10.2
10.4
3.0
3.2
3.5
3.8
4.0
1.2
1.3
1.2
1.3
1.6
We
to be 0, .001, and .01.
Continuing our design
example with h=10, k=5, 50=0,5 we find:
AVERAGE RUN LENGTHS FOR A ROBUST CUSUM
We often select a Shewhart-CUSUM scheme to
process control
observed.
row rule (with and without the FIR feature).
We considered the probability (c) of a cluster
Average Run Lengths
with 50=5(50=0)
contro 1 measurement processes (test 1aboratory
control)~ whereas we usually employ a standard
CUSUM to control production processes.
In
both cases we normally include the FIR
feature.
When the Shewhart-CUSUM is chosen
for measurement
be
compared the performance of a standard Poisson
CUSUM and a Poisson CUSUM using the two-in-a-
and a mean of:
Out 1 i er
Standard
a Shewhart
Cusum
signal often indicates a bad test sample while
a CUSUM signal indicates a calibration problem
or
instrument
offset.
If a measurement
process is prone to outliers, then a standard
Robust
Cusum
CUSUM or a Robust CUSUM scheme may be better.
Probabil ity
4
7
.000
.001
.010
397 (422)
281 (298)
77 (82)
3.35 (5.59)
3.34 (5.58)
3.28 (5.43)
.000
.001
.010
401 (425)
401 (425)
388 (412)
3.40 (5.72)
3.41 (5.73)
3.44 (5.78)
ROBUST POISSON CUSUM
More extens; ve compari son of the perforA Robust CUSUM
for
vari ab 1es
has
mance of a Robust CUSUM and a standard CU5UM
proven
valuable when isolated outliers or flyers
occur fairly often for reasons other than a
(Lucas and Crosier,
true process shift
1982B).
A
Robustification
procedure
for a representat; ve set of parameter values
are contained in (Lucas 1985).
for
variables is the IItwo-in-a-row rule ll in which
a single suspected outlier does not enter a
CUSUM, whi le two suspected outl iers in a row
are an out-of-control signal.
In evaluating
the properties of a Robust CUSUM for variables, we used the widely used contaminated
normal distribution (Andrews, et a1 1972; Chen
and Box 1979).
clusters occur (c = .001 or .01).
Using this distribution, we
CONCLUSION
We have discussed the design and use of
CUSUM procedures. We have shown that they are
simple to use, versatile in that they can be
specifically tailored to detect shifts in
level, and powerful in that they use all
i nformat; on in the data to qui ck 1y detect the
shift.
The design and implementation procedure for counts is essentially the same as
the procedure used
for
the
control
of
variables.
The FIR feature,
a combined
Shewhart-CUSUM scheme and robustification have
all proved to be valuable enhancements for
CUSUM control schemes.
It has been our experience that occasionally a cluster of nonconformances occurs in a
single sample and that it is sometimes valid
to ignore the occurrence of a single sample
having
a
cluster
of
nonconformances
(especially if this cluster could be due to
sampling or measuring instrument problems).
To model the Robust CUSUM, we assumed that the
underlying distribution was a mixture of a
distribution
(with
In these
cases, a Robust CUSUM gives a very large
percentage increase in the in-control ARLs and
on ly a Sma 11 percentage increase in the outof-control ARLs.
evaluated the performance of various CUSUM
schemes.
We demonstrated that the two-in-arow rule was a good robustification procedure
and showed that more complicated rules could
give little improvement. In this section, we
evaluate the properties of the two-in-a-row
rule for robustifying a Poisson CUSUM for
detecting an increase in count rate.
Poisson
The cost of
the robustification procedure can be seen in
the no outl ier case (c = 0). In these cases,
equivalent increases in in-control ARLs can be
achieved with less inflation of the out-ofcontrol ARLs by simply increasing the h
value.
The benefits of the robustification
procedures show up in the cases where outlier
probability I-c)
920
BI BLIOGRAPHY
Andrews, O. F., P. J. Bickel F. R. Hampel, P.
J. Huber, W. H. Rogers,
and J. W.
Tukey
(1972). Robust Estimates of Location: Survey
and Advances.
Brook,
D.
Approach
Lorden, G. (1971).
"Procedures for React ing
to a Change in Distribution ll , Ann. Math.
Stat., 42, 1897-1908.
Princeton University Press.
and
D.
A.
Evans
(1972).
Lucas, J. M. (1973).
"A Modified "V" Mask
Control Scheme," Technometrics, 15, 833-847.
"An
to the Probability Distribution of
CUSUM Run Lengths", Biometrika, 59, 539-549.
Chen, G.
and G. E.
P.
Box (1979).
Lucas, J. M. (1976). "The Design and Use of
Cumulative Sum Quality Control
Schemes",
Journal of Quality Technology, ~, 1-12.
"Implied
Assumptions for Some Proposed Robust Estimatorsll.
University of Wisconsin.
Math.
Research Center Technical Summary Report, 1979.
Chen, R.
Congenital
(1978).
Lucas, J. M. (1982). "Combined Shewhart-CUSUM
Quality Control Schemes II , Journal of Quality
Technology, li, 51-59.
A Surveillance System for
Malformations,
Journal
of
the
Lucas, J. M. (1985). "Counted Data CUSUMs",
Technometrics to appear, May 1985.
Ameri.can Statistical Association, 73. 323 327.
Johnson, N. L. and F. C. Leone (1962). "Cumulative Sum Control Charts - Mathematical
Principles Applied to their Construction and
Use"
(in three parts)
Industrial Quality
Control, June 15-21, July 29-36, AU9ust 22-28.
Lucas, J. M. and R. B. Crosier (1982A). "Fast
Initial Response (FIR) for Cumulative Sum
Quality Control Schemes". Technometrics, 24,
199-205.
-
Kenett,
R.
and
M.
Pollak
(1983).
"On
Sequential Detection of a Shift in the Probability of a Rare Event", Journal of the
Lucas, J. M. and R. B. Crosier (1982B).
"Robust CUSUM'I, Corrrnunications in Statistics,
Theor. Meth. 11 (23), 2669 2687.
American Statistical Association. 78, 389-395.
921