Download Exam of Statistics 6C

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

History of statistics wikipedia , lookup

Statistics wikipedia , lookup

Transcript
1
Università di Venezia - Corso di Laurea Economics & Management
Exam of Statistics 6C - Prof. M. Romanazzi
May 31st, 2016
Full Name
Matricola
• Total (nominal) score: 30/30 (2/30 for each question).
• Pass score: 18/30. Highest (29, 30) grades must be confirmed by oral discussion.
• Pocket calculator and portable computer are allowed, textbooks or class notes are not.
• Detailed solutions to questions must be given on the draft sheet (foglio di brutta copia);
final answers/results must be copied on the exam sheet, beside the small squares.
2
Exercise 1 The stem-and-leaf display in Table 1 shows the distribution of a sample of n = 80 ENI stock price
variations (%) from 1998 to 2013.
-5
-4
-3
-2
-1
-0
0
1
2
3
n = 80
−1|2
Pn is read −1.2%
xi = −14.632
Pi=1
n
2
i=1 xi = 228.503
60
7
41
5310
8544332211
99998888776666444333211
0011113355555566777788
14556678
023499
02
Table 1: Stem-and-leaf of ENI stock price variations (%).
Q1 Compute the percentiles of the order 25% and 75%.
x0.25 = (x(20) + x(21) )/2 = −0.9, x0.75 = (x(60) + x(61) )/2 = 0.7
Q2 Consider a variation equal to −3.0%. What is the position of such a data with respect to the
observed sample (central, left or right tail, outlier)?
Left tail, but not an outlier as Tukey’s lower fence is −3.3.
Q3 Compute the sample mean and the sample standard deviation of ENI stock price variations.
x̄n = −0.183, sX = 1.691
Q4 Derive from the stem-and-leaf display the % of data in the interval x̄n ∓ 2sX . How does it
compare with the corresponding % from the empirical rule of areas under normality?
% of data in x̄n ∓ 2sX : 76 observations, corresponding to 95%
Comparison with empirical rule: result very close to empirical rule.
Exercise 2
1. Is there a relation between ENI stock price variations (X, %) and FTSE MIB index variations 1
(Y , %)? Figure 1 shows the joint distribution of X and Y in the sample of 80 units of Ex. 1. Table
2 gives the summary statistics.
Q1 Compute means, standard deviations, covariance and correlation and report the results in
Table 2.
See Table 2.
Q2 Count from the scatter plot the number of points with both X and Y lower than −3.
2 points.
Q3 Compute the coefficients of the least square line ŷ = α̂ + β̂x and the goodness-of-fit measure.
α̂ = 0.185 , β̂ = 0.689
GOF: R2 = (rX,Y )2 = 0.605
1 FTSE
MIB index is a summary index of Milan stock market.
n
80
x
−0.183
Pn
xi
−14.632
y
0.0594
i=1
Pn
i=1
4.751
sX
1.691
yi
Pn
x2i
228.503
sY
1.498
i=1
Pn
yi2
177.530
sX,Y
1.970
i=1
Table 2: Summary statistics.
Pn
i=1 xi yi
154.792
rX,Y
0.778
4
3
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
0
●
●
●
●
●●●
●
●
● ●
●
● ●
●●
● ●●
●
●
● ●●● ●●
● ●●●●
●
●
●
●
●●●
●
●
● ●
●
●
● ●
●
●●
●
●
●
●
●
●
●
●
−2
●
●
●
●
−4
FTSE MIB index variation (%)
2
●
−6
●
−6
−4
−2
0
2
4
ENI stock price variation (%))
Figure 1: Scatter plot.
Q4 Compute the prediction of FTSE MIB index variation for a variation of ENI stock price equal
to +1% and evaluate the corresponding prediction error.
√
Prediction: ŷ(x = 1) = 0.874, Prediction error: se = sY 1 − R2 = 0.941
Exercise 3 Suppose to sample at random n values X1 , ..., Xn from a standard normal distribution.
Q1 Put n = 20. What is the probability that exactly 10 observations out of 20 are positive?
Due to properties of standard normal distribution, the problem
amounts to compute the
20
probability of 10 heads when a fair coin is flipped 20 times, 20
(0.5)
= 0.1761971, using the
10
binomial distribution.
Pn
Q2 Put again n = 20 and let Tn = =1 Xi . What are the expectation and the standard deviation
of Tn ?
√
√
Since the standard normal has µ = 0 and σ = 1, E(Tn ) = nµ = 0, SD(Tn ) = nσ = 20 '
4.472.
Pn
Q3 Put n = 100 and let X̄n = ( =1 Xi )/n. Evaluate the probability of the event |X̄n | < 0.05.
√
Using sampling theory, E(X̄n ) = µ = 0 and SD(X̄n ) = σ/ n = 1/10. By the CLT,
P (|X̄n | < 0.05) = P (−0.05 < X̄n < 0.05) ' P (−0.5 < N (0, 1) < 0.5) = 0.383.
4
Exercise 4 Consider again the sample of FTSE MIB index variations in Ex. 2 and denote with µ the average
variation in the reference population.
Q1 What is the point estimate of µ and what is the measure of the sampling error?
d Ȳn ) = sY /√n = 0.167.
Point estimate of µ: ȳn = 0.0594, measure of sampling error: SE(
Q2 Derive the confidence interval for µ (confidence level 95%).
d Ȳn ) = (−0.268, 0.387)
ȳn ∓ z0.975 SE(
Q3 Consider the test H0 : µ = 0, H1 : µ 6= 0 (significance level 5%). What is the observed
value of the test statistic and what is the rejection region of the test? Do the results involve
rejection of H0 ? Explain carefully.
d Ȳn )|H0 = 0.356, Rejection region: |T S| ≥ z0.975
Observed value of test statistic: (ȳn −µ)/SE(
The results do not involve rejection of H0 because the observed value of test statistic T S
does not belong to the rejection region of the test. In fact, once the sampling error is taken
into account, the observed value of sample average is fully consistent with the null hypothesis
µ = 0.
Exercise 5 Consider again the sample of ENI stock price variations in the stem-and-leaf display of Table 1 and
denote with pA the proportion of positive variations in the reference population. We want to test
the null hypothesis H0 : pA = 0.5 against the alternative H1 : pA 6= 0.5 (significance level 5%).
Compute the observed value of the test statistic and describe the rejection region of the test. What
is the final decision, rejection or non rejection of H0 ?
From Table 1, the point estimate of pA is p̂A = 38/80 = 0.475
Observed value of test statistic: (p̂A − pA )/SE(p̂A )|H0 ' −0.447, Rejection region: |T S| ≥ z0.975
Final decision: Do not reject H0