Download Briefly, what is probability (include in this the 3 "approaches" to

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Statistics wikipedia , lookup

History of statistics wikipedia , lookup

Ars Conjectandi wikipedia , lookup

Probability interpretations wikipedia , lookup

Probability wikipedia , lookup

Transcript
1. Briefly, what is probability (include in this the 3 "approaches" to probability discussed in the
early part of chapter 5).
Classical approach to Probability
Suppose there are 𝑛 outcomes for a random experiment, which are equally likely, mutually
exclusive and exhaustive. If π‘š of the 𝑛 outcomes are favorable to an event 𝐴, then the
π‘š
probability of the event is defined as 𝑃(𝐴) = .
𝑛
Frequency approach to probability
Let the experiment be repeated 𝑛 times. Suppose the event 𝐴 occurs f times and does not
occur in 𝑛 βˆ’ 𝑓 times. Then 𝑓 is called the frequency of the event 𝐴 in 𝑛 repetitions and 𝑓/𝑛 is called the
relative frequency. The limit of frequency ration as 𝑛 becomes larger and larger and tends to infinity is
defined as the probability of the event 𝐴.
Axiomatic approach to probability.
Consider the collection of all events. Then the function 𝑃 defined for every event is a probability
function if it satisfies the following three axioms.
Axiom 1 (Axiom of nonnegativity)
If A is any event, then P(A)β‰₯0
Axiom 2(axiom of certainty)
Let S be the sample space. Then P(S)=1
Axiom 3EAxiom of additivity)
If A and b are mutually exclusive events, then P(Aβˆͺ 𝐡) =P(A)+P(B).
2. Please (1) work the following problems, (2) tell me what rule or principle you used to solve
them and (3) then look at others' answers to check yourself and that person -- let's see if we
can come to consensus on them!
1. Ninety students will graduate from Lima Shawnee High School this spring. Of the 90 students,
50 are planning to attend college. Two students are to be picked at random to carry flags at
graduation.
a. What is the probability both of the selected students plan to attend college?
Total number of ways in which 2 students can be selected from 90 students = (
4005
Number of ways in which 2 students can be selected from 50 students = (
50×49
50
)=
= 1225
1×2
2
Probability that both of the selected students are planning to attend college
=
π‘π‘’π‘šπ‘π‘’π‘Ÿ π‘œπ‘“ π‘“π‘Žπ‘£π‘œπ‘Ÿπ‘Žπ‘π‘™π‘’ π‘π‘Žπ‘ π‘’π‘  1225 245
=
=
= 0.3059
π‘‡π‘œπ‘‘π‘Žπ‘™ π‘›π‘’π‘šπ‘π‘’π‘Ÿ π‘œπ‘“ π‘π‘Žπ‘ π‘’π‘ 
4005 801
90×89
90
)=
=
1×2
2
b. What is the probability one of the two selected students plans to attend college?
Total number of ways in which 2 students can be selected from 90 students = (
4005
90×89
90
)=
=
1×2
2
Number of ways in which 1 student can be selected from the group of 50 students and 1
student from the remaining group of 40 students = 50 × 40 = 2000
Probability that one of the selected students plans to attend college
=
π‘π‘’π‘šπ‘π‘’π‘Ÿ π‘œπ‘“ π‘“π‘Žπ‘£π‘œπ‘Ÿπ‘Žπ‘π‘™π‘’ π‘π‘Žπ‘ π‘’π‘  2000 400
=
=
= 0.4994
π‘‡π‘œπ‘‘π‘Žπ‘™ π‘›π‘’π‘šπ‘π‘’π‘Ÿ π‘œπ‘“ π‘π‘Žπ‘ π‘’π‘ 
4005 801
2. A survey of undergraduate students in the School of Business at Northern University revealed
the following regarding the gender and majors of the students:
Major
Gender Accounting Management Finance total
Male 100 150 50 300
Female 100 50 50 200
total 200 200 100 500
Let us define the following events.
Let 𝐹 denote the event a randomly selected student is a female.
Let M denote the event a randomly selected student is a male.
Let A denote the event a randomly selected student is a accounting major
Let B denote the event a randomly selected student is a management major.
Let C denote the event a randomly selected student is a finance major.
a.
What is the probability of selecting a female student?
Total number of students = 500
Number of female students = 200
π‘π‘’π‘šπ‘π‘’π‘Ÿ π‘œπ‘“ π‘“π‘Žπ‘£π‘œπ‘Ÿπ‘Žπ‘π‘™π‘’ π‘π‘Žπ‘ π‘’π‘ 
200
2
Probability that a randomly selected student is a female =
=
= = 0.4
π‘‡π‘œπ‘‘π‘Žπ‘™ π‘›π‘’π‘šπ‘π‘’π‘Ÿ π‘œπ‘“ π‘π‘Žπ‘ π‘’π‘ 
𝑃(𝐹) = 0.4
b.
What is the probability of selecting a finance or accounting major?
Total number of students = 500
Number of finance major students = 100
500
5
Probability that a randomly selected student is a finance major student
=
π‘π‘’π‘šπ‘π‘’π‘Ÿ π‘œπ‘“ π‘“π‘Žπ‘£π‘œπ‘Ÿπ‘Žπ‘π‘™π‘’ π‘π‘Žπ‘ π‘’π‘ 
π‘‡π‘œπ‘‘π‘Žπ‘™ π‘›π‘’π‘šπ‘π‘’π‘Ÿ π‘œπ‘“ π‘π‘Žπ‘ π‘’π‘ 
=
100
500
1
= = 0.2
5
𝑃(𝐢) = 0.2
Number of accounting major students = 200
Probability that a randomly selected student is a accounting major student
=
π‘π‘’π‘šπ‘π‘’π‘Ÿ π‘œπ‘“ π‘“π‘Žπ‘£π‘œπ‘Ÿπ‘Žπ‘π‘™π‘’ π‘π‘Žπ‘ π‘’π‘ 
π‘‡π‘œπ‘‘π‘Žπ‘™ π‘›π‘’π‘šπ‘π‘’π‘Ÿ π‘œπ‘“ π‘π‘Žπ‘ π‘’π‘ 
=
200
500
2
= = 0.4
5
𝑃(𝐴) = 0.4
Then the event that a randomly selected student is finance major or accounting major is 𝐴 βˆͺ 𝐢.
A student cannot be both accounting major and finance major simultaneously. So the events 𝐴 and C
are mutually exclusive. By the axiom of additivity,
𝑃(𝐴 βˆͺ 𝐢) = 𝑃(𝐴) + 𝑃(𝐢) =
100 200 300
+
=
= 0.6
500 500 500
The probability that a randomly selected student is finance major or accounting major = 0.6
c. What is the probability of selecting a female or an accounting major? Which rule of
addition did you apply?
The required probability is 𝑃(𝐹 βˆͺ 𝐴)
By the addition theorem of probability,
𝑃(𝐹 βˆͺ 𝐴) = 𝑃(𝐹) + 𝑃(𝐴) βˆ’ 𝑃(𝐹 ∩ 𝐴)
𝑃(𝐹) =
π‘π‘’π‘šπ‘π‘’π‘Ÿ π‘œπ‘“ π‘“π‘’π‘šπ‘Žπ‘™π‘’ 𝑠𝑑𝑒𝑑𝑒𝑛𝑑𝑠 200
=
π‘‘π‘œπ‘‘π‘Žπ‘™ π‘›π‘’π‘šπ‘π‘’π‘Ÿ π‘œπ‘“ 𝑠𝑑𝑒𝑑𝑒𝑛𝑑𝑠
500
π‘π‘’π‘šπ‘π‘’π‘Ÿ π‘œπ‘“ π‘Žπ‘π‘π‘œπ‘’π‘›π‘‘π‘–π‘›π‘” π‘šπ‘Žπ‘—π‘œπ‘Ÿ 𝑠𝑑𝑒𝑑𝑒𝑛𝑑𝑠 200
=
π‘‡π‘œπ‘‘π‘Žπ‘™ π‘›π‘’π‘šπ‘π‘’π‘Ÿ π‘œπ‘“ 𝑠𝑑𝑒𝑑𝑒𝑛𝑑𝑠
500
π‘π‘’π‘šπ‘π‘’π‘Ÿ π‘œπ‘“ π‘“π‘’π‘šπ‘Žπ‘™π‘’ π‘Žπ‘π‘π‘œπ‘’π‘›π‘‘π‘–π‘›π‘” 𝑠𝑑𝑒𝑑𝑒𝑛𝑑𝑠 100
𝑃(𝐹 ∩ 𝐴) =
=
π‘‡π‘œπ‘‘π‘Žπ‘™ π‘›π‘’π‘šπ‘π‘’π‘Ÿ π‘œπ‘“ 𝑠𝑑𝑒𝑑𝑒𝑛𝑑𝑠
500
𝑃(𝐴) =
Then
𝑃(𝐹 βˆͺ 𝐴) = 𝑃(𝐹) + 𝑃(𝐴) βˆ’ 𝑃(𝐹 ∩ 𝐴) =
c.
200 200 100 300 3
+
βˆ’
=
= = 0.6
500 500 500 500 5
Are gender and major independent? Why?
Gender and major are independent if and only if all of the following conditions is satisfied.
If any one of the conditions is not satisfied, then gender and major are not independent.
𝑃(𝐹 ∩ 𝐴) = 𝑃(𝐹)𝑃(𝐴)
𝑃(𝐹 ∩ 𝐡) = 𝑃(𝐹)𝑃(𝐡)
𝑃(𝐹 ∩ 𝐢) = 𝑃(𝐹)𝑃(𝐢)
𝑃(𝑀 ∩ 𝐴) = 𝑃(𝑀)𝑃(𝐴)
𝑃(𝑀 ∩ 𝐡) = 𝑃(𝑀)𝑃(𝐡)
𝑃(𝑀 ∩ 𝐢) = 𝑃(𝑀)𝑃(𝐢)
A
B
C
Total
M
100
150
50
300
F
100
50
50
200
Total
200
200
100
500
From the above table
300 3
=
500 5
200 2
𝑃(𝐴) =
=
500 5
100 1
𝑃(𝑀 ∩ 𝐴) =
= = 0.20
500 5
3 2
6
𝑃(𝑀). 𝑃(𝐴) = . =
= 0.24
5 5 25
𝑃(𝑀) =
Clearly
𝑃(𝑀 ∩ 𝐴) β‰  𝑃(𝑀)𝑃(𝐴)
Therefore the events 𝑀 and 𝐴 are not independent.
Gender and major are not independent.
d.
What is the probability of selecting an accounting major, given that the person selected is a
male?
The required probability is the conditional probability of the event 𝐴 given that the event 𝑀 has
occurred. By the definition of conditional probability,
𝑃(𝑀 ∩ 𝐴) 1/5 1
𝑃(𝐴/𝑀) =
=
=
𝑃(𝑀)
3/5 3
e.
Suppose two students are selected randomly to attend a lunch with the president of the
university. What is the probability that both of those selected are accounting majors?
500
Total number of ways in which 2 students can be selected from 500 students = (
)=
2
500×499
= 124,750
1×2
Number of ways in which 2 accounting major students can be selected from 200 accounting
200×199
200
major students = (
)=
= 1,990
1×2
2
Probability that the 2 students are accounting major
π‘“π‘Žπ‘£π‘œπ‘Ÿπ‘Žπ‘π‘’ π‘›π‘’π‘šπ‘π‘’π‘Ÿ π‘œπ‘“ π‘π‘Žπ‘ π‘’π‘ 
1,990
199
=
=
=
= 0.015952
π‘‘π‘œπ‘‘π‘Žπ‘™ π‘›π‘’π‘šπ‘π‘’π‘Ÿ π‘œπ‘“ π‘π‘Žπ‘ π‘’π‘ 
124,750 12,475
1. Reynolds Construction Company has agreed not to erect all "look-alike" homes in a new
subdivision. Five exterior designs are offered to potential home buyers. The builder has
standardized three interior plans that can be incorporated in any of the five exteriors. How
many different ways can the exterior and interior plans be offered to potential home buyers?
There are 5 exterior designs and there are 3 interior designs.
Number of ways in which 1 exterior design from 5 exterior designs 1 interior design fro 3 interior
design can be selected = 5 × 3 = 15
1. A puzzle in the newspaper presents a matching problem. The names of 10 in one U.S.
presidents are listed column, and their vice presidents are listed in random order in the second
column. The puzzle asks the reader to match each president with his vice president. If you
make the matches randomly, how many matches are possible? What is the probability all 10 of
your matches are correct?
The total number ways in which 10 vice presidents can be arranged against the 10 presidents is
= 10! = 10 × 9 × 8 × 7 × 6 × 5 × 4 × 3 × 2 × 1 = 3628800
Of the 3628800 arrangements only is in the correct order.
1
Therefore, the required probability is
3628800
1. What is a continuous probability distribution (use the information from Applied Statistics in
Business and Economics, Chapters 7) as compared with a discrete probability distribution which
is discussed in chapter 6 (same text)?
Let 𝑋 be a random variable. If the variable 𝑋 assumes a finite number of values or a countably
infinite number of values, then it is known as a discrete random variable. The number of
students in a randomly selected school or the number of children in a randomly selected family
are examples of discrete random variables.
If the variable assumes a continuum of values it is called a continuous random variable. A
continuous variable assumes all values in a given interval. The height of randomly selected
student, the weight of a new born baby etc are examples of continuous random variables.
1. How does the Empirical Rule apply to continuous discrete probability distributions (Chapter 6)?
Does the Empirical Rule always apply to discrete probability distributions? Why or why not?
Does the Empirical Rule apply to continuous probability distributions (Chapter 7)? If so, how
and when?
If the distribution is normal 68% of the data lie within 1 standard deviation from the mean, 99% lie
within 2 standard deviations from the mean and 99.7% lie within 3 standard deviations. If the
distribution is approximately normal, then approximately 68% of the data lie within 1 standard
deviation from the mean, approximately 99% lie within 2 standard deviations from the mean and
approximately 99.7% lie within 3 standard deviations. This the empirical rule. Since normal
distribution is a continuous distribution, the empirical rule is applied to continuous distributions.
1. Use Chapter 7 (Applied Statistics in Business and Economics) to address the following 2
problems:
o The mean starting salary for college graduates in the spring of 2004 was $36,280.
Assume that the distribution of starting salaries follows the normal distribution with a
standard deviation of $3,300. What percent of the graduates have starting salaries:
1. Between $35,000 and $40,000?
2. More than $45,000?
3. Between $40,000 and $45,000?
Let 𝑋 denote the starting salary of a randomly selected college graduate. Then X follows normal
π‘‹βˆ’36,280
distribution with mean 36,280 and standard deviation 3,300. Then Z =
follows standard normal
3,300
distribution.
1. 𝑃(35,000 < 𝑋 < 40,000) = 𝑃 (
35,000βˆ’36,280
3,300
<
π‘‹βˆ’36,280
3,300
<
40,000βˆ’36,280
3,300
) = 𝑃(βˆ’0.3879 < 𝑍 < 1.1273)
= 𝑃(𝑍 < 1.1273) βˆ’ 𝑃(𝑍 < βˆ’0.3879) = 0.8702 βˆ’ 0.3491 = 0.5211
2. 𝑃(𝑋 > 45,000) = 𝑃 (
π‘‹βˆ’36,280
3,300
>
3. 𝑃(40,000 < 𝑋 < 45,000) = 𝑃 (
45,000βˆ’36,280
3,300
40,000βˆ’36,280
3,300
) = 𝑃(𝑍 > 2.6424) = 1 βˆ’ 𝑃(𝑍, 2.6424) = 1 βˆ’ 0.9959 = 0.0041
<
π‘‹βˆ’36,280
3,300
<
45,000βˆ’36,280
3,300
) = 𝑃(1.1273 < 𝑍 < 2.6424)
= 𝑃(𝑍 < 2.6424) βˆ’ 𝑃(𝑍 < 1.1273) = 0.9959 βˆ’ 0.8702 = 0.1257
The price of shares of Bank of Florida at the end of trading each day for the last year
followed the normal distribution. Assume there were 240 trading days in the year. The
mean price was $42.00 per share and the standard deviation was $2.25 per share.
1. What percent of the days was the price over $45.00? How many days would you
estimate?
2. What percent of the days was the price between $38.00 and $40.00?
3. What was the stock's price on the highest 15 percent of days?
Let 𝑋 denote the price of shares of Bank of Florida on at the end of a randomly selected
trading day. Then 𝑋 follows normal distribution with mean 42.00 and standard deviation 2.25.
o
1. 𝑃(𝑋 > 45.00) = 𝑃 (𝑍 >
45.00βˆ’42.00
2.25
) = 𝑃(𝑍 > 1.3333) = 1 βˆ’ 𝑃(𝑍 < 1.3333) = 1 βˆ’ 0.9088 = 0.0912
The price was above 45.00 in 9.12% of days which is 240 ×
38βˆ’42
9.12
100
= 21.89 β‰… 22 π‘‘π‘Žπ‘¦π‘ 
40βˆ’42
2. 𝑃(38 < 𝑋 < 40) = 𝑃 (
<𝑍<
) = 𝑃(βˆ’1.7778 < 𝑍 < βˆ’0.8889)
2.25
2.25
= 𝑃(𝑍 < βˆ’0.8889) βˆ’ 𝑃(𝑍 < βˆ’1.7778) = 0.1870 βˆ’ 0.0377 = 0.1493
The price was between 38 and 40 in 14.93% of the days.
3. From tables of standard normal distribution 𝑃(𝑍 > 1.0364) = 0.15 = 15%
The corresponding value of 𝑋 is 𝑋 = 42.00 + 1.0364 × 2.25 = 44.3320 β‰… 44.33
The stock’s price on the highest 15 percent of days was above $44.33
***********************************************************************
Statistical Symbols and Definitions Matching Assignment
Match the letter of the definition on the right to the appropriate symbol on the left.
Symbols
ο‚· 1.
ο‚· 2.
ο‚· 3.
ο‚· 4.
ο‚· 5.
ο‚· 6.
ο‚· 7.
ο‚· 8.
S (Uppercase Sigma) ____
m (Mu) ____
s (Lowercase Sigma) ____
p (Pi) ____
e (Epsilon) ____
c2 (Chi Square) ____
! ____
H0 ____
Definitions
ο‚· a. Null hypothesis
ο‚· b. Summation
ο‚· c. Factorial
ο‚· d. Nonparametric hypothesis test
ο‚· e. Population standard deviation
ο‚· f. Alternate hypothesis
ο‚· g. Maximum allowable error
ο‚· h. Population mean
ο‚·
9. H1 ____
i. Probability of success in a binomial trial
Symbol
Definition
Ξ£
Summation
πœ‡
Population mean
𝜎
Population standard deviation
πœ‹
Probability of success in binomial trial
πœ–
Maximum allowable error
πœ’2
Nonparametric hypothesis test
!
Factorial
𝐻0
Null hypothesis
𝐻1
Alternate hypothesis
Match the letter of the term on the right to the definition of that term on the left.
Definitions
Terms
ο‚· 1. The average of the squared deviation scores from a
ο‚· a. Reliability
distribution mean. ____
ο‚· 2. Midpoint in the distribution of numbers. ____
ο‚· b. Mode
ο‚· 3. It has to do with the accuracy and precision of a
ο‚· c. Generalization
measurement procedure. ____
ο‚· 4. Examines if an observed causal relationship
ο‚· d. Variance
generalizes across persons, settings, and times. ____
ο‚· 5. The difference between the largest and smallest
ο‚· e. Median
score in a distribution. ____
ο‚· 6. The arithmetic average. ____
ο‚· f. External validity
ο‚· 7. Refers to the extent to which a test measures what
ο‚· g. Mean
we actually wish to measure. ____
ο‚· 8. The most frequently occurring value in a set of
ο‚· h. Internal validity
numbers. ____
ο‚· 9. The conclusion from research conducted on a sample ο‚· i. Range
population to the population as a whole. ____
ο‚· j. Standard deviation
ο‚· 10. Examines whether the conclusion that we draw
ο‚· k. Validity
about a demonstrated experimental relationship truly
implies cause. ____
ο‚· 11. Determines how far away the data values are from
the average. ____
Definitions
The average of the squared deviation scores from
a distribution mean. ____
. Midpoint in the distribution of numbers. ____
It has to do with the accuracy and precision of a
Terms
Variance
Median
Reliability
measurement procedure. ____
. Examines if an observed causal relationship
generalizes across persons, settings, and times.
____
The difference between the largest and smallest
score in a distribution. ____
The arithmetic average. ____
. Refers to the extent to which a test measures
what we actually wish to measure. ____
The most frequently occurring value in a set of
numbers. ____
The conclusion from research conducted on a
sample population to the population as a whole.
____
Examines whether the conclusion that we draw
about a demonstrated experimental relationship
truly implies cause. ____
Determines how far away the data values are
from the average. ____
External validity
Range
Mean
Validity
Mode
Generalization
Internal validity
Standard deviation