Download Final test Statistics 2

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Hardware random number generator wikipedia , lookup

Computational electromagnetics wikipedia , lookup

Fisher–Yates shuffle wikipedia , lookup

Generalized linear model wikipedia , lookup

Birthday problem wikipedia , lookup

Randomness wikipedia , lookup

Risk aversion (psychology) wikipedia , lookup

Transcript
VOORBLAD VOOR EEN SCHRIFTELIJK TENTAMEN/TOETS
Verplicht gedeelte:
Vaknaam
: Statistics 1 for IBA
Vakcode
: 30J109
Datum tentamen
: 26 maart 2010
Duur tentamen
: 3 uur
Docent
: Gert Nieuwenhuis (tel. 2350 of 0650504640)
Telefoon secretariaat departement: 2430
Studenten worden geacht zich tijdens het tentamen correct te gedragen en de instructies van
examinator en surveillant op te volgen.
Bij constatering van fraude wordt streng opgetreden.
Alle studenten moeten uiteindelijk inleveren:
1) de door hen geproduceerde oplossingen van de opgaven;
2) ook de formulieren met alle vragen.
Vriendelijk verzoek aan de surveillanten: zou u de uitwerkingen van de studenten
per blauwe lijst bij elkaar willen leggen? Zodat onder elke blauwe lijst meteen de
toetsen liggen van de studenten die op deze lijst staan. Wel graag de formulieren met
de opgaven apart op een stapel. Alvast zeer bedankt.
Facultatief gedeelte:
Open boek examen
Schrapkaart
Gebruik eenvoudige calculator
Gebruik grafische calculator
Enkelzijdig afdrukken
Kladpapier beschikbaar
neen
neen
ja
neen
nee
ja
Uitwerkingen dienen op tentamenpapier gemaakt te worden, iedereen dient zijn werk
en de formulieren met de vragen in te leveren.
1
Mid-Exam Statistics 1 for IBA
March 26, 2010, 3 hours
It is NOT allowed to use a graphical, programmable calculator; only a simple pocket
calculator is allowed. Apart from the cover page, this exam contains three pages.
Always explain your answers.
At the end, you have to hand in:
1) the solutions to the six exercises 2) the sheets with the six exercises
On Monday, March 29, these six questions and their solutions will be posted on
Blackboard.
Exercise 1
Last year’s population of the participants of the Mid-Exam of the course Statistics 1
(March 27, 2009) consisted of 464 students. On that population we consider the
continuous variable X = ‘grade (scale 0 – 10) scored for the Mid-Exam’. The dataset
of X is summarized by way of the following classified relative frequency distribution,
with unequal class widths.
[0, 4] (4, 6] (6, 8] (8, 10] total
class
1
rel frequency 0.1272 0.2565 0.3879 0.2284
a. Determine the frequency density at 2.5.
b. Calculate the mean and the variance of this (relative) frequency distribution.
Let F be the cumulative (relative) distribution function of that frequency distribution.
c. Calculate the value of F at the mean you have calculated in b.
Exercise 2
In this exercise we (among others) ask ourselves the question whether Statistics 1
students with higher grades for the team assignment (TA) in general will also score
higher grades for the Mid-Exam. For a random sample of 340 students from previous
years, we get the following summarized results concerning the variables X = ‘grade
(scale 0 – 10) for the TA’ and Y = ‘grade (scale 0 – 10) for the Mid-Exam’:
𝑥̅ = 6.9750;
𝑦̅ = 6.2782;
𝑠𝑋2 = 1.8841;
𝑠𝑌2 = 4.6194
𝑠𝑋,𝑌 = 0.5513
a. Calculate the correlation coefficient of the x- and y-data. What do you think of
the strength of the linear relationship between y and x?
b. What can be concluded from the fact that the variance of the y-data is so much
larger than the variance of the x-data?
c. Write down the regression equation of y on x.
d. One student in the sample scored 7 for the TA and 6.8 for the Mid-Exam.
Calculate the residual of that student.
e. Using Chebyshev’s rule and the above summarized results, find a lower bound
for the percentage of the students in the sample scoring more than 1.5 for the
Mid-Exam.
2
Exercise 3
In a certain random experiment, the events A, B, C, D and E are considered. The
following facts are known:
𝑃(𝐴) = 0.4; 𝑃(𝐵) = 0.3; 𝑃(𝐶) = 0.3; 𝑃(𝐷) = 0.25; 𝑃(𝐸) = 0.45;
A and E are independent; 𝑃(𝐴 ∩ 𝐷) = 0.05; B and E are disjoint;
the events A, B and C constitute a partition of the sample space .
a.
b.
c.
d.
e.
f.
g.
h.
Calculate 𝑃(𝐴 ∪ 𝐶).
Calculate 𝑃(𝐴 ∪ 𝐸).
Calculate 𝑃(𝐵 𝑐 ∪ 𝐸).
Are A and E also disjoint? Explain.
Calculate 𝑃(𝐷 𝑐 |𝐴).
Calculate 𝑃(𝐴 ∪ 𝐵|𝐸).
Calculate 𝑃(𝐷 𝑐 |𝐴𝑐 ).
Are 𝐵 and 𝐶 independent? Explain.
Exercise 4
We reconsider the so-called paradox of Galilei:
“When three fair dice are thrown, then there are six outcomes that all have 10 eyes in
total: 631, 622, 541, 532, 442 and 433. To obtain a total of 9 eyes, there also are six
outcomes: 621, 522, 531, 441, 432 and 333. Still it turns out that the total 10 happens
more frequently than the total 9. How is this possible?”
Galilei solved the problem for three fair dice. However, his wife Galilinda asked
herself the question whether a similar problem comes up when throwing the die of the
swindler three times. Recall that for this die the random variable X = ‘number of eyes
of one roll’ has the following probability density function f :
1
2
3
4
5
6
x
𝒇(𝒙) 1/12 1/6 1/6 1/6 1/6 1/4
You are asked to solve Galilinda’s problem by considering the outcomes of the
random experiment as ordered 3-tuples, thus renaming Galilei’s six “outcomes” 632,
…, 433 with ten eyes and the six “outcomes” 621, …, 333 with nine eyes as six basic
events of ordered outcomes. For instance, Galilei’s “outcome” 442 then becomes the
basic event {(4, 4, 2), (4, 2, 4), (2, 4, 4)}, which consists of three ordered outcomes.
Below, Y denotes the total number of eyes when the die of the swindler is thrown
three times.
a. Are the ordered outcomes of this random experiment (throwing the swindler’s
die three times) equally likely? Formulate all ordered outcomes of the basic
event 622. What is the probability of this basic event?
b. Calculate the probabilities 𝑃(𝑌 = 10) and 𝑃(𝑌 = 9) by first calculating the
probabilities of each of the 12 basic events.
c. Give your comments regarding the problem raised by Galilinda.
3
Exercise 5
The CEO of a company adopts the following discrete probability density function
(pdf) 𝑓 to model the probability distribution of the random variable X = ‘next year’s
profit (in millions of euro)’:
0
1
2
3
x
−2 −1
𝒇(𝒙) 0.08 0.12 0.20 0.30 0.22 0.08
Below, F denotes the cumulative distribution function (cdf) of X.
a.
b.
c.
d.
Calculate the expectation  of X.
Calculate 𝑃(−2 < 𝑋 ≤ 1) and 𝑃(|𝑋 − | ≤ 1.5).
Consider F on the set of all real numbers. Draw the graph of F.
Calculate the variance of X.
The contract of the CEO promises for the year 2012 the bonus 𝑌 = 0.05𝑋 + 0.1 (in
millions of euro), which depends linearly on the profit X in 2011 and might be
negative.
e. Calculate the expectation and the standard deviation of Y.
f. Calculate the probability that Y will be positive.
Exercise 6
The CEO of another company makes use of a continuous pdf 𝑓 to model the
probability distribution of the random variable X = ‘next year’s profit (in millions of
euro)’:
1 1
+ 𝑥
if − 2 < 𝑥 ≤ 0
4 8
1
if 0 < 𝑥 ≤ 2
𝑓(𝑥) = 4
1 1
− 𝑥
if 2 < 𝑥 < 4
2 8
{0
if 𝑥 ≤ −2 or 𝑥 ≥ 4
1
1
So: 𝑓(𝑥) is the straight (increasing) line 𝑓(𝑥) = 4 + 8 𝑥 on the interval (−2, 0], it is
1
1
constantly equal to 0.25 on (0, 2] and it is a straight (decreasing) line 𝑓(𝑥) = 2 − 8 𝑥
on (2, 4). Outside the interval (−2, 4) it is constantly equal to zero: 𝑓(𝑥) = 0.
a.
b.
c.
d.
Draw the graph of f and check whether f indeed is a pdf.
Calculate 𝑃(𝑋 > 2.5).
Calculate the conditional probability 𝑃(𝑋 < 2.5 | 𝑋 > 2).
Use symmetry to determine 𝑃(𝑋 < 1) and 𝐸(𝑋).
The CEO decides to invest in the year 2012 the amount 𝑌 = 0.1𝑋 − 0.05 (in millions
of euro), which depends linearly on the profit X in 2011 and might be negative.
e. Calculate the (15/16)-quantile of Y; that is, calculate the real number b for
which 𝑃(𝑌 ≤ 𝑏) = 15/16.
4