Download 6.2 Transforming and Combining Random

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Bootstrapping (statistics) wikipedia , lookup

History of statistics wikipedia , lookup

Degrees of freedom (statistics) wikipedia , lookup

Central limit theorem wikipedia , lookup

Law of large numbers wikipedia , lookup

Transcript
6.2 Transforming and Combining
Random Variables
Objectives
SWBAT:
• DESCRIBE the effects of transforming a random variable by adding or subtracting a constant and
multiplying or dividing by a constant.
• FIND the mean and standard deviation of the sum or difference of independent random variables.
• FIND probabilities involving the sum or difference of independent Normal random variables.
El Dorado Community College considers a student to be full-time if he or she
is taking between 12 and 18 units. The number of units X that a randomly
selected El Dorado Community College full-time student is taking in the fall
semester has the following distribution.
Calculate and interpret the mean and standard deviation of X.
If we were to randomly select many, many full-time students at El Dorado Community College, the
average number of units they would be taking in the fall semester is 14.65.
The number of units taken by randomly selected El Dorado
Community College full-time students in the fall typically
varies by about 2.06 units from the mean (14.65).
At El Dorado Community College, the tuition for full-time students is $50
per unit. So, if T = tuition charge for a randomly selected full-time
student, T = 50X. Here’s the probability distribution for T:
(note: 600=50(12 credits))
Calculate the mean and standard deviation of T.
What is the effect of multiplying or dividing a random variable by a constant?
Effect on a Random Variable of Multiplying (Dividing) by a Constant
Multiplying (or dividing) each value of a random variable by a number b:
• Multiplies (divides) measures of center and location (mean,
median, quartiles, percentiles) by b.
• Multiplies (divides) measures of spread (range, IQR, standard
deviation) by |b|.
• Does not change the shape of the distribution.
In addition to tuition charges, each full-time student at El Dorado Community College is
assessed student fees of $100 per semester. If C = overall cost for a randomly selected fulltime student, C = 100 + T.
Here is the probability distribution for C and a histogram of the probability distribution:
Calculate the mean and standard deviation of C.
What is the effect of adding (or subtracting) a constant to a random variable?
Effect on a Random Variable of Adding (or Subtracting) a Constant
Adding the same number a (which could be negative) to each value of a
random variable:
• Adds a to measures of center and location (mean, median,
quartiles, percentiles).
• Does not change measures of spread (range, IQR, standard
deviation).
• Does not change the shape of the distribution.
What is a linear transformation? How does a linear transformation
affect the mean and standard deviation of a random variable?
A linear transformation is a transformation of a random variable that involves adding a constant a,
multiplying by a constant b, or both.
Effect on a Linear Transformation on the Mean and Standard Deviation
If Y = a + bX is a linear transformation of the random variable X, then
•The probability distribution of Y has the same shape as the probability distribution of X.
•µY = a + bµX.
•σY = |b|σX (since b could be a negative number).
Linear transformations have similar effects on other measures of center or location (median,
quartiles, percentiles) and spread (range, IQR).
Whether we’re dealing with data or random variables, the effects of a linear transformation are
the same.
Note: These results apply to both discrete and continuous random variables.
In a large introductory statistics class, the distribution of X = raw scores on a test was
approximately normally distributed with a mean of 17.2 and a standard deviation of 3.8. The
professor decides to scale the scores by multiplying the raw scores by 4 and adding 10.
(a) Define the variable Y to be the scaled score of a randomly selected student from this class.
Find the mean and standard deviation of Y.
(b) What is the probability that a randomly selected student has a scaled test score of at least
90?
N(78.8, 15.2)
There is a 23.06% chance that a randomly selected student has a
scaled test score of at least 90.
Rules for combining random variables
Many interesting statistics problems require us to examine two or more random variables.
Mean of the Sum of Random Variables
For any two random variables X and Y, if T = X + Y, then the expected value of T is
E(T) = µT = µX + µY
In general, the mean of the sum of several random variables is the sum of their means.
Variance of the Sum of Random Variables
For any two independent random variables X and Y, if T = X + Y, then the variance of T is
sT2 = sX2 + sY2
In general, the variance of the sum of several independent random variables is the sum of their
variances.
Rules for combining random variables
Mean of the Difference of Random Variables
For any two random variables X and Y, if D = X - Y, then the expected value of D is
E(D) = µD = µX - µY
In general, the mean of the difference of several random variables is the difference of their means.
The order of subtraction is important!
Variance of the Difference of Random Variables
For any two random variables X and Y, if D = X - Y, then the variance of D is
sD2 = sX2 + sY2
In general, the variance of the difference of two independent random variables is the sum of their
variances.
• With regards to variance, you may be asking yourself “why add even for the
difference of the variables?” Think about this analogy:
We buy some cereal. The box says "16 ounces." We know that's not precisely the
weight of the cereal in the box, just close; after all, one corn flake more or less would
change the weight ever so slightly. Weights of such boxes of cereal vary somewhat,
and our uncertainty about the exact weight is expressed by the variance (or standard
deviation) of those weights.
Next we get out a bowl that holds 3 ounces of cereal and pour it full. Our pouring
skill certainly is not very precise, so the bowl now contains about 3 ounces with
some variability (uncertainty).
How much cereal is left in the box? Well, we'd assume about 13 ounces. But notice
that we're less certain about this remaining weight than we were about the weight
before we poured out the bowlful. The variability of the weight in the box has
increased even though we subtracted cereal.
Moral: Every time something happens at random, whether it adds to the pile or
subtracts from it, uncertainty (read "variance") increases.
Speed Dating: Suppose that the height M of male speed daters follows a Normal
distribution with a mean of 68.5 inches and a standard deviation of 4 inches and
the height F of female speed daters follows a Normal distribution with a mean of
64 inches and a standard deviation of 3 inches. What is the probability that a
randomly selected male speed dater is taller than the randomly selected female
speed dater he is paired with? (only non-simulation approach)
Let’s define the random variable D = M - F
D will represent the difference between the male’s height and the female’s height.
Because D is the difference of two independent Normal random variables, D follows a
Normal distribution.
Our goal is to find P(M > F), or P(D > 0).
We are working with a Normal distribution that has a mean difference of 4.5
inches and a standard deviation of 5 inches.
If we were going to calculate this manually, we would substitute into our zscore formula using 0 for our x. We have to operate under the assumption that
there is no difference between male and female heights.
Using the standard Normal table, this gives us an
area to the left of 0.1841. However, we are
looking for the difference to be greater than 0
(M>F), so we need to find the area to the right,
which is 1-0.1841 = 0.8159.
On the calculator:
There is a 0.8159 chance that a randomly selected male speed dater will be taller than the
randomly selected female speed dater with whom he is paired.
Suppose that a certain variety of apples have weights that are approximately
Normally distributed with a mean of 9 ounces and a standard deviation of 1.5
ounces. If bags of apples are filled by randomly selecting 12 apples, what is the
probability that the sum of the 12 apples is less than 100 ounces?
Let X = weight of a randomly selected apple
Our goal: Find P(T<100)
Because T is the sum of 12 independent Normal random variables, T follows a
Normal distribution with:
N(108, 5.2)
We want to find P(T<100)
There is a 6.2% chance that the 12 randomly selected apples will have a total
weight of less than 100 ounces.
The shape is also difficult to
determine.
The standard deviation cannot be
calculated since the cost for tuition and
fees and the cost for books are not
independent. Students who take more
units will typically have to buy more
books.