Download (a) When would you use Chebyshev`s theorem and the empirical

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Birthday problem wikipedia , lookup

Infinite monkey theorem wikipedia , lookup

Inductive probability wikipedia , lookup

Ars Conjectandi wikipedia , lookup

Probability box wikipedia , lookup

Probability interpretations wikipedia , lookup

Central limit theorem wikipedia , lookup

Transcript
(a) When would you use Chebyshev’s theorem and the empirical rule in business? How are they calculated? Provide one real-life
example that requires Chebyshev’s theorem and one that requires the empirical rule.
(b) Why is using Bayes’ theorem important to help answer business-related questions?
(c) What does this theorem allow you to do that traditional statistics do not?
(d) What are some prerequisites for using Bayesian statistics?
(a) (i) Chebyshev's theorem gives an upper bound for the proportion of the data falling within k standard deviations of
the mean of a distribution. The theorem states that, for any population or sample, at least [1 - (1 / k)^2] of the
observations in the data set fall within k standard deviations of the mean, where k  1.
For example we could use Chebyshev's theorem to state the upper limit on the proportion of observations that fall
within, say, 1.5 standard deviation or 2 standard deviations of the mean salary of the employees of an organization. In
this way, we would be able to construct an approximate (rough) confidence interval for the mean salary.
1 - (1/1.5)^2 = 0.5555 → about 55.55% of the salaries are within 1.5 standard deviations of the mean salary
1 - (1/2)^2 = 0.75 → about 75% of the salaries are within 2 standard deviations of the mean salary
(ii) The empirical rule is used for the same purpose as Chebyshev’s theorem, but when we know that the original
observations conform to a normal distribution. The empirical rule states that 68.3%, 95.5%, and 99.7% of all
observations fall within 1, 2, and 3 standard deviations, respectively of the mean.
For example, the control limits of a control chart are drawn 3 standard deviations away from the mean. That means that
99.7% of all observations are expected (or required) to fall within the control limits (assuming the process is in control).
Using such a control chart, we can check if the process is in control or not.
(b) The simplest version of the theorem states that P(A|B) = P(B|A) * P(A) / P(B)
In words, the probability of A given B is equal to the probability of B given A times the probability of A, divided by the
probability of B.
For example, suppose that for a given disease, there is a test that gives accurate positive and negative results 98% of the
time; Also, 3% of the population has the disease.
We can use Baye’s Theorem, to determine the probability that a person has the disease given that the test is positive:
P(has the disease | test is positive) = P(test is positive | has the disease) * P(has the disease) / P(test is positive)
= 0.98 * 0.03 / {0.98 * 0.03 + (1 - 0.98) * (1 - 0.03)}
= 0.6025
Thus, there is a 60.25% chance that given a person tests positive, she actually has the disease.
(c) In Baye’s Theorem, evidence helps confirm our suspicions more if we thought it were more unlikely to occur.
(d) To use Bayesian statistics, one must have some prior knowledge of the opposite probability than one is trying to
determine. For example, if one needs to determine the probability that a person is sick, given the test results, one needs
to have an estimate for the probability that the test comes back positive, given that the person is sick. One also needs to
know the probability of being sick, independent of any test.