Survey
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Predicting Equipment Failures Using Weibu11 Analysis and SAS* Software Bruce Kay, Federal Express Corporation Douglas E. Price, American Airlines ABSTRACT Predicting engine failures is a two-step process. First, the parameters must be estimated. Then the estimates are used to Ball bearings, aircraft engines and missile guidance systems have one thing in common: they are all subject to unexpected mechanical failure. Practically every industry is concerned with increased cost~ and loss of productivity and customer satisfaction associated with equipment failure. Although breakdowns are inevitable, it is possible to predict when a failure is likely to occur using statistical techniques.' predict subsequent failures. We shall begin with a brief discussion of the theory behind Weibull analysis. Next, we shall describe a step-by-step procedure for estimating the Weibull parameters. Then we shall consider the unique circumstances surrounding predicting engine failures and some typical problems that might arise during an analysis. We shall conclude with a description of how the analysis might be used This paper will investigate a method for estimating failure parameters and how to use these parameters to predict equipment failure. to forecast engine failures. BACKGROUND In particular, it will examine Weibull analysis, a statistical technique of wide applicability. Using this technique in Although there are a variety of techniques for analyzing failure data, three conjunction with historical data, we can predict failures more accurately and minimize their impact. Our discussion will center on predicting aircraft engine failures; however, requirements are common to each: an unambiguously defined time origin, a scale for measuring the passage of time, and a clear meaning for the term failure. this technique applies to a wide range of applications. Provided these three ingredients are present, the technique to be described applies equally well to failures of ball bearings, aircraft HITRODUCTION engines or missile guidance systems. Although the Weibull distribution contains three parameters in its most general form, it Federal Express Corporation specializes in the door-to-door express delivery of packages and documents throughout the United States is commonly expressed as a two-parameter distribution. The frequently omitted third parameter is called the location parameter and to many foreign countries. The company operates an extensive fleet of aircraft that serves sorting facilities in the United because it serves as a time offset for products where failure cannot occur until at States and Europe. Reliability of its aircraft is a key ingredient in the Federal or after the value of the location Express commitment to service. Unfortunately, aircraft or equipment of any parameter. Since most products can normally fail at any time ~fter time zero, the kind is subject to unexpected breakdown. Since Federal Express offers a money-back two-parameter Weibull distribution is the one which we shall discuss here. failures Can be especially costly. If a random variable t has a Weibull distribution, then the Weibull probability density function (pdf) is: guarantee for service failures, equipment Because equipment failures cannot be eliminated entirely, our objective is'to reduce the uncertainty as much as possible. f(t) One method used to predict equipment failures is based on the Weibull distribution. Waloddi Weibull (1951) introduced "A Statistical Distribution Function of Wide Applicability" to model a variety of product life and reliability problems. The widespread use of the Weibull distribution is due to the great range of shapes of the distribution which enables it to fit many types of data. Our interest centers on using the Weibull distribution to predict aircraft engine failures. = (y/ay)ty-1exp[-(t/a)y], where t > O. The parameter y is the shape or Weibull slope. The parameter a is the scale parameter; it is also called the "characteristic life since it is always the ll value of t where the cumulative probability of failure is 63.2 percent (i.e., it is the 63.2 percentile of the cumulative distribution function). Both parameters must be positive. "( is unitless while expressed in the same units as t. 0: is However, the technique that will be described is equally applicable to The Weibull cumulative distribution function (CDF) describes the cumulative probability of numerous other problems. 219 3. 4. failure of an item at a given point in time. The reliability function, which is a term more familiar to some, is related to the COF as the difference between unity and the CDF. The cumulative distribution function is: F(t) = The first step in the estimation of Weibull parameters is to define precisely the time origin for the analysis and the meaning of the tenns "timet! and IIfailure. For aircraft engines the time origin is marked by the installation of the engine on the aircraft. The passage of time is measured in flight hours of operation, and failure is the inability of an engine to perform to standards. 1 - exp[-(t/a)Y], 1I and the reliability function is: R(t) = exp[-(t/a)Y]. Additional information on theWeibull distribution is available in Kalbfleisch and Prentice (1980) and Nelson (1982). After defining the scope, the next step is to obtain data on each engine in the analysis. Each observation is identified as either a failure or censored, where the latter term refers to engines which had not failed prior to the conclusion of the sampling period. This type of censoring is often called Type I or time censoring (Kalbfleisch and Prentice, 1980) • One feature which explains the Weibull distribution's wide applicability is that several other statistical distributions are either related to, or can be approximated by, the Weibull distribution. For example, when r equals 1, the distribution is exponential; when Y equals 2, it is the Rayleigh distribution. For values of y near 3.4, the shape of the Weibull distribution is approximately that of the normal distribution; for r greater than 10, the shape of the Weibull is nearly that of the smallest extreme value distribution. The third step is to plot the failure data to verify that they conform to a Weibull distribution. Computing the plotting positions for the failures involves approximating the true values of the Weibull cumulative distribution function. Nelson (1982) presents a variety of methods for computing both probability and hazard plotting positions. Regardless of the technique employed, the data must be plotted. Only the failures are plotted although the censored data define the relative plotting position. The LIFETEST procedure eliminates most of the drudgery of transforming and plotting the data. In addition to providing graphical representations of the data, PROC LIFETEST can be used to compute nonparametric estimates of the survival distribution. A probability plot of data on our Pratt and Whitney JTBD-7 engine appears in Figure 1. The plotted points should lie in a relatively straight line if the data conform to a Weibull distribution. Once this requirement has been satisfied, the Weibull parameters may be estimated. The flexibility of the Weibull distribution explains why it applies to so many different types of data. Nelson (1982) provides numerous examples of Weibull analysis applied to batteries, alarm clocks, generator field windings, circuit breakers and pistons. Mann (1968) mentions applications of the Weibull distribution to the incidence of droughts, the analysis of water droplet size and thrust levels of rocket engine systems. Of particular importance here is the applicability of the Weibull distribution to the analysis of aircraft engine failures. Three articles provide the foundation for the present study. Pratt and Whitney Aircraft (1967) presents an empirical discussion of estimating Weibull parameters. Pratt and Whitney (1980) describes the results of an extensive study to improve engine reliability and reduce operating costs. The third article, Cavoli and Rokicki (1981), summarizes how to use the estimates to predict engine failures. Much of this paper is based on improvements to ideas presented in those articles and advantages provided by the SAS* system for such an analysis. There are two generally accepted methods for estimating the Weibull parameters: ordinary least squares regression and maximum likelihood estimation. Fortunately, the SAS system can accommodate either technique. We have compared the results from PROC REG with those from PROC LIFEREG and have found that the latter procedure provides more consistent results. The LIFEREG procedure fits parametric models to failure-time data that may be time censored. The parameters are estimated by maximum likelihood using a ridge-stabilized Newton-Raphson algorithm. The procedure iteratively solves for the parameters until they reach a specified converge,nce criteri on. For further detai 1s, consult ·Nel son (1982). METHODOLOGY Now that we have discussed the theoretical underpinnings of a Weibull analysis, let us turn our attention to a step-by-step description of the methodology. There is a four-step procedure to the estimation of the Weibull parameters. 1. 2. Plot the data. Estimate the parameters. Define the scope. Collect the data. 220 ANALYSIS the planner and are not true failures. Instead, convenience is another example of time censoring. Now that the reader has a general understanding of the technique, we would like to take a closer look at the analysis of our JT8D-7 engines. Figure 2 shows the data as they were originally collected and plotted. It represents three years of failure data: there are 104 failures and 126 time censored values. The data as they are plotted obviously do not fit a Weibull distribution We have chosen to focus on premature and foreign object damage failures because they represent over three-fourths of our engine removals and are very costly in terms of our service commitment. Since life-limited failures and convenience are both scheduled events, they have considerably less impact on our system. We have computed the results for premature and foreign object damage failures' separately. The results are shown in Figures 3 and 4. because one observation has severely distorted the linearity of the graph. The SAS statistics guide cautions that extreme observations can unduly influence the fitted model. The results of our empirical studies for both ordinary least squares and maximum Before turning to the prediction of engine failures, one other topic deserves a likelihood indicate that if the number of outliers is small, the best strategy is to eliminate them. Considerable improvement is obtained by removing only one extreme observation (See Figure 1). comment. Since the lIFEREG procedure uses the maximum likelihood technique, it is possible that the model will not converge. It has been our experience that nonconvergence can he a problem when sample sizes are small. In those rare instances, an If there is still a problem with the fit, there are a variety of goodness-of-fit tests for the Weibull distribution (Stephens, 1974, 1977l. In addition, SAS Communications* (1987) provides details and appropriate code to accompl ish this task using the Anderson-Darling statistic. Since our model is intended to be used by someone with little statistical expertise~ estimate of the Weibull parameters can be obtained using the Benard and Bos-Levenbach formula for median plotting positions (Nelson, 1982) and the REG procedure. However, the results should be interpreted with caution. PREDICTION we have chosen not to include a goodness-of-fit test. Although plots do have limitations, they are more easily understood by the 1ayman, and they are extremely useful in detecting outliers. To model future engine failures, we use the Weibull parameters in a simulation. The six-step procedure is diagrammed in Fi gure 5. The analysis of failure data for a mechanism as complex as a jet aircraft. engine is considerably more complicated than we have presented thus far. Aircraft engines actually fit a mixed distribution model. For the sake of simplicity, we shall develop our 1. Increment the time variable. 2. Check for a life-limited failure. 3. Calculate the conditional probability. own taxonomy for engine failure. 6. Repeat the process. 4. Generate a random number. S. Compare the conditional probability and random number. Engine Failure Premature life1imited Foreign Object Damage The procedure begins with four inputs for each engine type: the Weibull parameters, the current engine times, the time until the next scheduled life-limited removal, and an estimate of the average utilization. We have chosen to increment the time variable on a monthly basis to reduce the amount of Conveni eoce computer time required. Our interest will center on predicting premature failures which result from a part or parts wearing out. We shall also consider fails, it time, the continues fail, the foreign object damage (FOD); this occurs when a bird or other object is drawn into the engine causing sudden or catastrophic failure. Life-limited failures are a category of parts or modules of an engine which have a mandated replacement time. life-limited failures are easy to forecast since it is simply a matter of tracking the life-limited parts. The final category is convenience. Each engine is treated separately during the course of the simulation. After the monthly utilization is added to the current engine time, it is checked for a life-limited failure. If it is replaced by an engine with zero event is recorded, and a new engine the simulation. If it does not engine is checked for a premature or foreign object damage failure. The first step in identifying a random failure is to calculate the conditional probability of the event. Given that an engine has reached time t1, the probability it will reach t2 can be calculated using These are at the discretion of 221 the following relationship: REFERENCES Cavo1i, N.C. and G.J. Rokicki, (1981), IICommercial Engine Logistics Cost Projections - A Dynamic and Flexible Approach, II We can use this equation in conjunction with the cumulative distribution function to calculate a probability of failure. This probability is then compared to a random number generated from a uniform distribution. If the probability is less than the random number, the engine begins a new month. If the probability is greater than or equal to the random number, the engine fails and is replaced by a zero-time engine. Kalbfleisch, J.D. and R.L. Prentice, (1980), The Statistical Analysis of Failure Tlme Data, New York: John Wiley and Sons. Mann, N.R., (1968), "Point and Interval Estimation Procedures for the We have chosen one year as the Two-Parameter Weibu11 and Extreme-Value forecast horizon and typically run the model 100 times to improve its accuracy. Oistributions, II Technometrics, 10, The 231-256. results are reported in tabular form by engine serial number and reason for failure. Nelson, W., (1982), Applied Life Data Analysis, New York: John Wlley and Sons. They are also reported in a histogram (Figure 6) so the planner can judge the range of failures for each type of engine. Of course, there is no guarantee that a particular engine will fail since the process is based on a random number generator. Paper presented at the Gas Turbine Conference and Products Show, Houston, TX. Pratt and Whitney Aircraft, (1967), Introduction to Weibul1 Analysis, PWA 3001, East Hartford, CT. On the other hand, this method can provide considerably more insight than applying a flat rate. It also takes the age of the engine into account since engines with more running time are more likely to fail. Pratt and Whitney Aircraft, (1980), JT8D Maintenance Technology Study, PWA 5738, East Hartford, CT. SAS Communications*, (1987), "Distribution Testing Using Base SAS* Software," SAS Institute Inc., XIII, 30-31. CONCLUSION This paper has considered the application of Weibu11 analysis to the prediction of aircraft engine failures. There are distinct advantages and disadvantages to this approach. On the negative side, Weibu11 analysis has more startup time and requires more technical expertise. It is also more difficult for the layman to understand and apply the technique. On the positive side, SAS software eliminates much of the mundane work with predefined procedures. That means that more time can be spent analyzing the data and less time coding. In addition, by Stephens, M.A., (1974), "EOF Statistics for Goodness of Fit and Some Comparisons,1I Journal of the American Statistical Association, 69, 730-736. Stephens fl.A., (1977), "Goodness of Fit for the Extreme Value Distribution,lI Biometrika, 64, 583-588. Weibull, W., (1951), "A Statistical Distribution Function of Wide Applicability," Journal of Applied Mechanics, 18, 293-297. taking the equipment age into consideration, Weibul1 analysis provides a more comprehensive approach to equipment failure than rate analysis. * SAS and SAS Communications are the registered trademarks of SAS Institute Inc., Cary, NC, USA. Although this project is still relatively young, we believe the advantages of this approach will far outweigh the increased For further information please contact: startup costs by improving our planning, budgeting and inventory control. Furthermore, the versatility of the approach means that it can be applied in other areas of the company as well. Bruce Kay Federal Express Corp. 2831 Airways Blvd. Memphis, TN 38132 222 JT8D-7 CUMUUTlVE DISTRI8UTION fUNCTION GA .. M.... 2 .• 1 ALPH ..... SOi5 , • •, G G ,• ,,, , 0 • •0, JT80-7 ORIGIN .. l OU" GUIM .... 2.25 .. LPH ..... 598 ... .. . ... .. ... .. ... ........ ,. L(-LUll I -, I ,. -, i i i ..i .. .. .. ... ... .... , • ,• ,•,• ,, , ....... ,i .i ,i ., i .i Ji . :. . o G , •, +10+1. .. .... L(_lIS» o .. 1.1.1.10 -, ++++ - ...I• +-~-- 7 7.5 8 8.5 ..• .. .• ... .. ...... ..•• ++10++ +--- ----- -+- - - ------+- +- - - - - - - - -+- - - - - - - - - + - - - - - - - -+- - - - - - - - -+- - - - - - - --+ 8.5 • I • • ...•.. •• ++++ ----~--+- - - - - ----+- -- - ----+------ - --+ 8 7 8 fiGURE 1 FIGURE 2 JUO-l PRE/oIATURE f .. ILURES _ .. _2 .•5 ALPHA_517l LOO(-lOO(SURVIVAL)) ESTlynES ,i .i ,i ., i .i ... -. i ..... -. i··:....· L( -LIS)) G ........ "+++1. ..... .. 1.+++ • "1.++1. ...... A+++A+ .•. ++.. + A.. ,l++A A+++ .. +++A +A++,l +A++A ++++,l+ +1.++++ .. - +----- ----+ S.25 ---~-+----~----+---------+--------~+- 8.5 8.75 7 7.25 -------+------ --+--- -----+- 1.5 7.75 8 -------+------~--+ 8.25 8.15 8.16 ----~---+ 9 lOO TSI flOOf'lE 3 JUO-7 FOREIGN 08JECT DAMAGE GA ..... A.. 1.73 ALPHA_U715 LOO(_lOO(SURVIUl») ESTIMATES ,i ·,,,, .,i .i , • .i •,• ., i -.. , • •, L(_lIS)) G G . .. .. ... ++1.+1. .. ...• .. ... •• +++.. 1.+++ + - - - - - - - - -+- - - 8 .... ++,l+++ o 8.6 -~ 9 LOO lSI LOO TSI •, •,• , ,, , • ,•• . .......• .. LOO(-LOO(SURVIVAl)) eSTI .... 'ES lOO(-lOO(8URVIVALl) ESTIMATES - - -+- - - - - - - - -+- - - - - - - - -+- - - - - - - - -+- - -7 7.6 8 •. 5 lOG TSI FIGURE. 223 STARTUP AND LlFELIMITED TIMES WEIBULL PARAMETERS UTILIZATION RATE CALCULATE CONDITIONAL PROBABILI'IY GENERATE RANDOM NUMBER y N FIGURE 5 JT80-1 FAILURE DISTf'llBUTlON fREQUENCY BAR CHART fREQUENCY "i .i .i ,i ,i ,i ,i ,i ,i ,i 10 11 12 13 U Hi 18 17 18 HI 20 21 22 23 FAILURES flGURE 8 224 2.. 25 28 27 28 29 30 31 32 33 3" 35