Survey
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Chapter 5 Process Analysis 1 Analysis Analysis is the examination of processes, facts, and data to gain an understanding of why problems occur and where opportunities for improvement exist. – Statistical inference is the process of drawing conclusions about unknown characteristics of a population from which data were taken. – Predictive statistics focus on cause-and-effect relationships and predictions of future performance from historical data. 2 Basic Probability Concepts An experiment is a process that results in some outcome. The outcome of an experiment is a result that we observe The collection of all possible outcomes of an experiment is called the sample space. Probability is the likelihood that an outcome occurs. 3 Probability Properties Label the n outcomes in a sample space as O1, O2, … On, where Oi represents the ith outcome in the sample space. The probability associated with any outcome must be between 0 and 1 – 0 ≤ P(Oi) ≤ 1 for each outcome Oi The sum of the probabilities over all possible outcomes must be 1.0 – P(O1) + P(O2) + … + P(On) = 1 4 Events An event is a collection of one or more outcomes from a sample space If A is any event, the complement of A, denoted as Ac, consists of all outcomes in the sample space not in A. Two events are mutually exclusive if they have no outcomes in common. 5 Calculating Probabilities Rule 1: The probability of any event is the sum of the probabilities of the outcomes that compose that event. Rule 2: The probability of the complement of any event A is P(Ac) = 1 – P(A). 6 Calculating Probabilities Rule 3: If events A and B are mutually exclusive, then P(A or B) = P(A) + P(B) Rule 4: If two events A and B are not mutually exclusive, then P(A or B) = P(A) + P(B) – P(A and B) 7 Conditional Probability Conditional probability is the probability of occurrence of one event A, given that another event B is known to be true or have already occurred. Multiplication rule of probability: 8 Tree Diagram 9 Calculation of Joint Probabilities 10 Independent Events Two events A and B are independent if P(A | B) = P(A). 11 Random Variables A random variable, X, is a numerical description of the outcome of an experiment. Formally, a random variable is a function that assigns a numerical value to every possible outcome in a sample space. 12 Probability Distributions A probability distribution, f(x), is a characterization of the possible values that a random variable may assume along with the probability of assuming these values. The cumulative distribution function, F(x), specifies the probability that the random variable X will assume a value less than or equal to a specified value, x, denoted as P(X ≤ x). 13 Important Probability Distributions 14 Discrete – Binomial – Poisson Continuous – Normal – Exponential 14 Binomial Distribution The binomial distribution describes the probability of obtaining exactly x “successes” in a sequence of n identical experiments, called trials. 15 Computing the Binomial Distribution using Excel BINOM.DIST(number_s, trials, probability_s, cumulative) 16 Poisson Distribution = expected value or average number of occurrences x = 0, 1, 2, 3, … e = 2.71828… 17 Computing the Poisson Distribution Using Excel POISSON.DIST(x, mean, cumulative) 18 Probability Density Function A curve that characterizes outcomes of a continuous random variable is called a probability density function, and is described by a mathematical function f(x). – Probabilities are only defined over intervals. – The cumulative distribution function, F(x), represents the probability P(X ≤ x). 19 Normal Distribution Familiar bell-shaped curve. 20 Standard Normal Distribution If a normal random variable has a mean μ = 0 and a standard deviation σ = 1, it is called a standard normal distribution, represented by z. 21 Calculating Normal Probabilities If x is any value from a normal distribution with mean μ and standard deviation σ, we may easily convert it to an equivalent value from a standard normal distribution using: 22 Calculating Normal Probabilities Using Excel Excel function NORM.DIST(x, mean, standard deviation, true) calculates the cumulative probability F(x) for a specified mean and standard deviation. The Excel function NORM.S.DIST(z) calculates the cumulative probability for the standard normal distribution. 23 NORM.INV Function The Excel function NORM.INV(probability, mean, standard_dev) can be used when we know the cumulative probability (probability) but don’t know the value of x. 24 Exponential Distribution The exponential distribution models the time between randomly occurring events, such as the time to or between failures of mechanical or electrical components. 25 Calculating the Exponential Distribution Using Excel The Excel function EXPON.DIST(x, lambda, true) can be used to compute cumulative exponential probabilities. 26 Sampling Distributions A sampling distribution is the distribution of a statistic for all possible samples of a fixed size. Sampling distribution of the mean – Expected value of the sample mean is the population mean – Standard deviation of the sample mean (called the standard error of the mean) is the population standard deviation divided by the square root of the sample size 27 Central Limit Theorem 28 Illustrating the Central Limit Theorem 29 29 Confidence Intervals 30 A confidence interval (CI) is an interval estimate of a population parameter that also specifies the likelihood that the interval contains the true population parameter. This probability is called the level of confidence, denoted by 1 − α, and is usually expressed as a percentage. 30 Common Confidence Intervals 31 Confidence Interval Template The Student Companion Site provides an Excel workbook, Confidence Intervals.xlsx, with worksheet templates for formulas (5.17) through (5.19). 32 Hypothesis Testing Hypothesis testing involves drawing inferences about two contrasting propositions (hypotheses) relating to the value of a population parameter, one of which is assumed to be true in the absence of contradictory data (called the null hypothesis), and the other which must be true if the null hypothesis is rejected (called the alternative hypothesis). 33 Hypothesis Testing Process Steps 1. Formulate the hypotheses to test. 2. Select a level of significance. 3. Determine a decision rule on which to base a conclusion. 4. Collect data and calculate a test statistic. 5. Apply the decision rule to the test statistic and draw a conclusion. 34 Excel Procedures 35 Regression Analysis Regression analysis is a tool for building statistical models that characterize relationships between a dependent variable and one or more independent variables, all of which are numerical. – A regression model that involves a single independent variable is called simple regression. A regression model that involves several independent variables is called multiple regression. 36 Correlation Correlation is a measure of a linear relationship between two variables, X and Y, and is measured by the (population) correlation coefficient. Correlation coefficients will range from −1 to +1. 37 Analysis of Variance Analysis of Variance, or ANOVA, is a hypothesis-testing methodology for drawing conclusions about equality of means of multiple populations. 38 One Way Analysis of Variance In its simplest form—one-way ANOVA—we are interested in comparing means of observed responses of several different levels of a single factor. ANOVA tests the hypothesis that the means of all populations are equal against the alternative hypothesis that at least one mean differs from the others. 39 Multi-Vari Studies A multi-vari study investigates three types of process variation: positional (variation within the same item or sample), cyclical (variation between parts or samples), and temporal (over time, such as between different production shifts). 40 Design of Experiments A designed experiment is a test or series of tests that enables the experimenter to compare two or more methods to determine which is better, or determine levels of controllable factors to optimize the yield of a process or minimize the variability of a response variable. 41 Factorial Experiments Factorial experiment – one that considers all combinations of levels of each factor The simplest type of factorial experiment is one with two factors at two levels Each combination of different levels of the factor is called a treatment. 42 Main Effects A main effect measures the difference that a factor has on the response. – What is the effect of increasing the temperature regardless of the value of the reaction time? – What is the effect of increasing the reaction time regardless of the temperature? Main effect = (Average response at high level) - (Average response at low level) (5.22) 43 Interactions An interaction is the effect of changing one factor has on the level of other factors. – For example, increasing factor 1 when factor 2 is at the low level might result in an increase in the response variable; however, increasing factor 1 when factor 2 is at the high level might result in a decrease in the response variable. Interaction effect = (Average response with both factors at the same level) – (Average response with both factors at opposite levels) (5.23) 44 Excel Templates for Factorial Experiments The Student Companion Site has Excel templates for factorial experiments. 45 Note to Instructors 46 The following slides provide an experiential exercise found on the Web for applying DOE. Students usually have a lot of fun with this. 46 Paper Helicopter Design 1. 2. 3. 4. 47 Cut along all the solid lines on the diagram to the right. Fold flap A forward and flap B to the back. Fold flaps C and D both forward along the dotted lines. Fold along the line E upward to give a weight at the bottom. 47 Making the Helicopter 48 48 Helicopter Project Description Response goal: Maximize the length of time before hitting the ground from a fixed height. Identify 3 potential sources that might influence the response, for example, AB length, CD length, and width Develop an experimental design; collect data, and analyze results. Each group should submit a report of their findings next class. Show all results; calculate the main effects and interactions, and draw conclusions about what is the best design. 49 Root Cause Analysis Root cause – “that condition (or interrelated set of conditions) having allowed or caused a defect to occur, which once corrected properly, permanently prevents recurrence of the defect in the same, or subsequent, product or service generated by the process.” 50 Five Why Technique Redefine a problem statement as a chain of causes and effects to identify the source of the symptoms by asking why, ideally five times. 51 Cause-and-Effect Diagrams Cause-and-effect diagram – a simple graphical method for presenting a chain of causes and effects and for sorting out causes and organizing relationships between variables. 52 Project Review – Analyze (1 of 2) Team members have received any necessary “justin-time” training Team members understood how to use analysis tools appropriately and effectively The data collected in the Measure Phase have been fully understood and studied Appropriate statistical tools have been used to conduct the analyses of data Variation is thoroughly understood Root causes and hypotheses that explain problems have been identified 53 Project Review – Analyze (2 of 2) Data provide confirmation of key conclusions and validation of root causes Process maps are accurate and representative of actual or desired process flow (in the case of a re-design activity) The process has been studied to identify bottlenecks, sources of error, and non-value added activities Preliminary improvement or re-design goals have been set 54