Download The effects of Bank Prime Loans Rate and Unemployment Rate on

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Categorical variable wikipedia , lookup

Omnibus test wikipedia , lookup

Transcript
The effects of Bank Prime Loans Rate and Unemployment Rate
on the US Public Debt during from 1949 to 1980
Alexandre Catsicas, CBA1, Fordham University, Bronx, New York 10458,
[email protected]
Abstract
The public debt is an issue that every nation has to face. It can be considered as the total
nation’s debts; an indicator of how much public spending is financed by borrowing instead of
taxation, and which at the end, can be indirectly interpreted as a debt of the citizens. In this work,
we will explore the interesting effects of two different socio-economic phenomenon such as the
Bank Prime Loan Rate (X) and the Unemployment Rate (Z), and the, try to see if their
consequences can be related to the Public Debt of the United States (Y) between 1949 and 1980.
I. Introduction
When watching the news or reading the newspaper, it has been difficult not to be exposed
to a topic discussing about the economical crisis we are facing at this very moment. Terms like
“prime loan rate”, “unemployment rate”, as well as “public debt” of a nation, can become issues
that will most likely affect each of us in a more or less direct way. In this paper, we will
concentrate during the years between 1949 and 1980, in order to try to build a model where the
potential correlations between these three terms will be explored, and which at the end will lead
us to the elaboration of the kind of connections that can be established within our three variables.
Regression, as a way that will help us to break down the data into a form that can provide the
desired connections and correlations between the data, will be particularly helpful in our
research. Indeed, it is the regression that will allow us to prove if prime loan and unemployment
rate can have particular consequences on the public debt of a nation. In this work, we will
explore some of the mechanisms of the Economy and try to see if it can be statistically be proved
that the higher the unemployment and bank prime loan rates have been in the US from 1949 to
1980, the higher the public debt of the country has been.
1
This term paper will be written in March/April 2009 in partial fulfillment of Statistical Decision Making course by
Prof. Vinod
II. Model and data
As discussed earlier, the model proposed in this work will evaluate the eventual possibility
of a causal relationship between a nation’s unemployment rate (Z) and bank prime loan (X) on
that nation’s public debt (Y). For this, we shall proceed with a linear regression model which will
be represented by :
Y=β0+β1X+ β2Z+
This formula will represent the true population regression equation, in which  stands for
the unknown errors in our model. Once plugged into R, the estimated model becomes :
Y=b0+b1X+b2Z+residual
Where Y, the public debt of a nation, would be our dependent variable, caused by our
independent variables X and Z, the bank prime loan and the unemployment rate respectively.
The whole set of data that we will analyze in this work has been taken from the following
website : www.economagic.com .
The United-Sates’ public dept (Y) is the amount of money owed by the Federal
Government of the US to holders of US debt instruments. In other words, the public debt is the
amount of money owed by the Government to its creditors, whether they are nationals or
foreigners. The bank prime loan (X) is a term which designates a reference interest rate used by
banks. This interest rate at which banks will lend to their customers is a very important reference,
for it can be used as an index in calculating rate changes to adjustable rate mortgages and other
variable rate implicating short term loans, such as private student loans for example. The
unemployment rate (Z) is commonly defined as the percentage of people in the labor force but
who are unemployed.
While looking at our variables, we can see that two of them have a distribution skewed to
the right. Indeed, Y and X have both a Median that is smaller than their respective Mean.
Whereas Z is skewed to the left with a Median slightly bigger than its Mean. Variable Y as a
mean of $39.32 (in tens of billion), with a variance of 327.47. Variable X has a mean of 5.77%
with a variance of 9.15, and variable Z has a mean of 5.32% with a variance of only 2.07 (all
values have been rounded up to two decimals for this description).
a) Units of measurement
My two independent variables, X and Z, are measured in the same magnitudes. Indeed,
both the unemployment rate and the bank prime loan are expressed in percentage. For practical
reasons, such as an easier reading of the analysis and an easier manipulation of the data, the
liberty to round up each initial value to one decimal has been taken in this work. Regarding our
dependent variable Y, the public debt, the values were initially expressed in hundreds of billion.
Adjustments were made, and each value of Y has been divided by 1’000’000’000. The values
have then been rounded up once again to one decimal in order to facilitate the lecture and the
manipulation. Since not all the variables will be of the same magnitude, we are to expect that the
visual plots will be slightly skewed, but at the end, the regression analysis should remain the
same.
b) Basic descriptive statistics
In this work, all the data analysis has been performed at the 95% confidence interval. The
following description contains value that have all been rounded up to two decimals.
Variable Y : the public debt, has a mean of 39.32, the standard error is 3.2, the median is
31.45, the standard deviation is 18.1, the sample variance is 327.47, the kurtosis is 1.23, and the
skewness is 1.52.
Variable X : the bank prime loan, has a mean of 5.77, the standard error is 0.53, the
median is 4.65, the standard deviation is 3.02, the sample variance is 9.15, the kurtosis is 1.54,
and the skewness is 1.32
Variable Z : the unemployment rate, has a mean of 5.32, the standard error is 0.25, the
median is 5.5, the standard deviation is 1.44, the sample variance is 2.07, the kurtosis is -0.75,
and the skewness is 0.21.
From the data analysis and descriptive statistics, we can see that the multiple R=0.8978,
R =0.806, and adjusted R2 =0.7926 are all under one, as they should be (by definition), but also
pretty high. The adjusted R2 suggests that our model is about 79.26% successful in explaining
the United-States public debt. See Appendix of this paper.
2
y
x
z
nobs
32.000000 32.000000 32.000000
NAs
0.000000
0.000000
0.000000
Minimum
25.300000
2.000000
2.900000
Maximum
90.800000 15.300000
8.500000
1. Quartile
27.350000
3.800000
4.250000
3. Quartile
43.475000
7.075000
6.025000
Mean
39.318750
5.765625
5.315625
Median
31.450000
4.650000
5.500000
Sum
1258.200000 184.500000 170.100000
SE Mean
3.198977
0.534714
0.254401
LCL Mean
32.794393
4.675068
4.796770
UCL Mean
45.843107
6.856182
5.834480
Variance
327.470605
9.149425
2.071038
Stdev
18.096149
3.024802
1.439110
Skewness
1.521094
1.319349
0.212147
Kurtosis
1.126328
1.541404 -0.754337
We can also test the Correlation Coefficients :
For X and Y, the Correlation is 0.8655, with a p-value of 1.620e-10. The p-value being so
small, we can reject the null hypothesis and therefore conclude that there is a good correlation
between the public debt and the bank prime loan.
For Z and Y, the Correlation is 0.4998, with a p-value of 0.003586. Since the p-value is
quiet small, we can also reject the null hypothesis, but this time the correlation between the
public debt and the unemployment rate will not be as strong as between the public debt and the
prime loan.
For X and Z, the Correlation is only 0.316, with a p-value of 0.07805. Since the p-value
remains small, we can also reject the null hypothesis. But we can see that the correlation between
the unemployment rate and the bank prime loan is the smallest of our three relationships.
y
x
z
y 1.0000000 0.8655309 0.4997698
x 0.8655309 1.0000000 0.3160369
z 0.4997698 0.3160369 1.0000000
c) Outlier detection for X, Y, and Z
IQR=Q3-Q1
LOW=Q1-1.5*IQR
UPPER=Q3+1.5*IQR
The outlier detection limits for Y are :
Q1-1.5*(inter quartile range)= 3.15500000000001
Number of outliers below it are= 0
Q3+1.5*(inter quartile range)= 67.675
Number of outliers above it are= 4
Actual values above the upper limit are: 70, 77.2, 82.7, 90.8
The outlier detection limits for X are :
Q1-1.5*(inter quartile range)= -1.1125
Number of outliers below it are= 0
Q3+1.5*(inter quartile range)= 11.9875
Number of outliers above it are= 2
Actual values above the upper limit are: 12.7 and 15.3
The outlier detection limits for Z are :
Q1-1.5*(inter quartile range)= 1.5875
Number of outliers below it are= 0
Q3+1.5*(inter quartile range)= 8.6875
Number of outliers above it are= 0
Even though we do have four outliers in the variable Y, and two outliers in the variable
X, the fact that our adjusted R2 is so high means that the outliers should not provoke any
misreading in our future results.
Scatter Plots:
The relationships between our two independent variables (X and Z) and the dependent
variable (Y) will be shown in the following scatter plots.
Figure1 shows us the relationship between the public debt and the bank prime loan. The
positive correaltion appears pretty strong, with a regression line of almost 45 degres. Since our
two varaibles are increasing at an almost similar rate, we can conclude that there is a significant
relationship between Y and X.
80
70
60
50
40
30
Public Debt (in tens of billion)
90
Fig. 1: Public Dept versus Bank Prime Loan Rate
2
4
6
8
10
Bank Prime Loan Rate
12
14
Figure2 shows us the relationship between the public debt and the unemployment rate.
The scatter plott on itself is quiet spread and doesn’t seem to indicate a lof of information. But
once the regression line is added, we can see positive relationship between Y and Z. But this
positive correlation still remains smaller than the one for Y and X.
80
70
60
50
40
30
Public Debt (in tens of billion)
90
Fig. 2: Public Dept versus Unemployment Rate
3
4
5
6
Unemployment Rate
7
8
Figure 3 will depict the relationship between the residuals against the bank prime loan
rate, and Figure 4 will depict the relationship between the residuals against the unemployment
rate. Ideally, there should be no relations in the plots for the model to be significant, which is our
case.
0
-10
Residuals
10
20
Fig. 3: Residuals against Bank Prime Loan Rate
2
4
6
8
10
12
14
Bank Prime Loan Rate
0
-10
Residuals
10
20
Fig. 4: Residuals against Unemployment Rate
3
4
5
6
Unemployment Rate
7
8
Our last plot, Figure 5, will depict the actual Y (the public debt) compared to the fitted Y.
Although we do not have a perfect linear correlation, we can see that the different values still
seem to follow a certain constant pattern. We have to remember that our model is trying to
explain a very complicated macroeconomic mechanism, a mechanism which as the Figure 5
tends to show us, will involve much more independent variables than the only two that we are
working with. Nevertheless, let us remark that an almost perfect virtual 45 degree line can still be
perceived in this model.
60
40
20
Fitted Y
80
Fig. 5: Fitted against Actual Y
30
40
50
60
Actual Y
70
80
90
Our Fitted Regression equation is :
Y=b0+b1X+b2Z
Where b0=-4.5960, b1= 4.7029, and b2= 3.04
Once these figures are plugged into the equation we get :
Y=- - 4.596 + 4.7029X + 3.1604Z + residual
Intercept
X
Z
Coefficients
Standard Error
-4.5960
5.7739
4.7029
0.5157
3.1604
1.0840
t Stat
-0.796
9.119
2.915
Pr(>|t|)
0.43250
5.13e-10
0.00678
The “Coefficients” column gives us the regression intercept and slope coefficients b1and
b2. The slope b1 measures the change in Y from a unit change in X. The slope b2 measures the
change in Y from a unit change in Z.
Regression Statistics
Multiple R Squared
Adjusted R Squared
Standard Error
Observations
0.806
0.7926
8.241
32
As described earlier, the “Regression Statistics” are in rather high, confirming the
reasoning behind our model. Both of our Multiple R are under 1.0, as they have to be. The
standard error is 8.241. The degree of freedom is 29 and the regression degree of freedom is 2.
Now in order to determine how close a line is to the data, it becomes necessary to
measure the predicted value of Y given X, versus the actual value of Y. The difference between
the observed actual Y value, and the predicted Y value is what call the model error. Therefore,
the estimated model will change to :
Ŷ =b0+b1X+b2Z+e
With the following fitted value :
Ŷ= - 4.596 + 4.7029X + 3.1604Z + 8.241
III. Statistical testing of the model
In order the validity of our model, we will conduct two T-tests and one F-test. For the Ttests, each of the regressor variables (X and Z) will be considered in relation to the dependent
variable (Y). The F-test will determine if both X and Z against Y can measure the overall fit of
data in describing the relationship between the bank prime loan rate and the unemployment rate.
Y Intercept
Prime Loan
Unemployment
Coefficients
-4.5960
4.7029
3.1604
Standard Error
5.7739
0.5157
1.0840
t Stat
-0.796
9.119
2.915
P-value
0.43250
5.13e-10
0.00678
Lower 95%
-16.405
-3.648
0.9433
Upper 95%
7.213
5.758
5.377
The first t-Test :
Null hypothesis H0: β1 = 0
Alternative hypothesis: β1≠ 0
Rule : Reject the Null if | t | > 2.045
The observed value of the t-statistic for β1 is 9.119 at a 29 DF. The critical value defines
the rejection region. If the observed value is greater than the critical value, then the null
hypothesis is rejected. Since the critical value at a 95% confidence level is 2.045, we can
therefore reject the null. We can conclude that the bank prime loan (X) has definitely had a
statistically significance on the public debt of the United-States between 1949 and 1980.
The second t-Test :
Null hypothesis H0: β1 = 0
Alternative hypothesis: β1≠ 0
Rule : Reject the Null if | t | > 2.045
The observed value of the t-statistic for β2 is 2.915 at a 29 DF. The critical value defines
the rejection region. If the observed value is greater than the critical value, then the null
hypothesis will again be rejected. Since the critical value at a 95% confidence level is 2.045, we
can therefore reject the null. We can conclude that the unemployment rate (Z) has also had a
statistically significance, but less notable this time, on the public debt of the United-States
between 1949 and 1980.
The F-Test :
The F-Test indicates whether or not the overall model is useful as a tool for explaining
the theory of our model. As seen previously, the null hypothesis states that the model is not
useful, whereas on the other hand, the alternative states that both of our independent variables
are useful.
Null hypothesis: H0: β1 = β2 = … βk = 0
Alternative hypothesis: Ha: at least some βi ≠ 0
For our model, the F-value is 60.24 and the p-value is 4.71e-11. Since we are testing our
model at the 95% confidence interval in which  = 0.05, we can reject the null because  > p.
Also, we could just have compared the F statistic to the critical value 3.328 in order to test our
hypothesis. Since the observed value of F is larger than the critical value, we can therefore reject
the null. We can conclude that both of our two independent variables are significant with positive
slope. Our model is therefore rather significant in explaining the effects of the bank prime loan
and the unemployment rate on the public debt of the United-States between 1949 and 1980.
Conclusion and final remarks :
The conduction of this work has been very interesting on several different points. The
Economy in which we are living in appears to be a very complex notion, to which it seems
difficult to attribute a defined spectrum of causes and effects. This work has allowed to establish
quiet strong links between three different factors. The statistical evidences that we have
discovered throughout this entire analysis, showed us the repercussions of a social factor such as
the unemployment rate, as well as the effects of a financial factor such as the bank prime loan
rate on the public dept of a nation. In both cases, we saw that an increase of our independent
variables X and Z caused an increase of the public dept of the United-States from 1949 to 1980.
Of course our model doesn’t allow us to clearly explain how these two independent variables act
on our dependent variable, but at least the model showed us that positive correlation exists.
If look back at our R2 (0.7926), our model is almost 80% effective in explaining the
consequences on the public debt. And here is a way where we can try to explain the
consequences of (X) and (Z) in the macroeconomic sphere :
According to our research, we can observe that the bank prime loan rate (X), had a t-Test that
could definitely allow us to reject the hypothesis stipulating that it had no effect on the public
debt. Which after all makes a certain sense when we remember one of the basic principles of
macroeconomics where the equilibrium state is defined by : Y = C + I + G + (EX-IM). Now we
can see that a decrease in C (Consumption Expenditure / Spending of Households) and a
decrease in I (Domestic Investment / Spending by Firms) will cause a decrease on Y. If we go
back to our variable (X), an increase of the bank prime loan rate could have the following effects
: if the interest rates at which households and firms usually borrow money increases, the general
consumption C, and the general investment I in the economy will decrease. But since our
equilibrium must be maintained, one way will be to increase G (Government expenditures). But
if G increases, it becomes inevitable that the public debt of the country will also increase due to
new expenses. The public dept is therefore clearly affected by our variable (X), the bank prime
loan rate. We could therefore imagine that if a country has a public debt becoming too important,
one solution would be to decrease the prime rates in order to give rise to a general increase of
households’ consumption and firms’ investment. Like this, our equilibrium will be maintained
without any increase of G.
Now regarding our second independent variable (Z), the unemployment rate, we have seen that
correlation was not as explicit as with the variable (X). Indeed, even if we have been able to
reject the null hypothesis with the help of the t-Test, the effects of the unemployment rate are a
little bit less noticeable than the bank prime loan rate. But at the end we can still establish a
positive correlation between this unemployment rate and our public debt. If you look again at
our equilibrium formula, we can deduct that an increase in the unemployment rate, will cause
households to consume less (with no job there is no money to spend), if the consumption C
decreases, one of the alternative to maintain the equilibrium could be taken by the government G
to spend more. Which once again, will cause an increase in the public debt.
At the end, our model can let us imagine that a good way to decrease the public debt of a
country by still maintaining the equilibrium state, would be a combination of a decrease in bank
prime loan rates, and a general increase in hiring the unemployed force of the population.
Appendix
Y=c(25.3, 25.7, 25.5, 25.9, 26.6, 27.1, 27.4, 27.2, 27.1, 27.6, 28.5, 28.6, 28.9, 29.8, 30.6,
31.2, 31.7, 31.9, 32.6, 34.7, 35.4, 37.1, 39.8, 42.7, 45.8, 47.5, 53.3, 62.0, 70.0, 77.2, 82.7, 90.8)
X=c(2.0, 2.1, 2.6, 3.0, 3.2, 3.1, 3.2, 3.8, 4.2, 3.8, 4.5, 4.8, 4.5, 4.5, 4.5, 4.5, 4.5, 5.6, 5.6, 6.3, 8.0, 7.9, 5.7, 5.2, 8 .0,
10.8, 7.9, 6.8, 6.8, 9.1)
Z=c(7.6, 5.2, 3.3, 3.0, 2.9, 5.6, 4.4, 4.1, 4.3, 6.8, 5.5, 5.5, 6.7, 5.6, 5.6, 5.2, 4.5, 3.8, 3.8,
3.6, 3.5, 5.0, 6.0, 5.6, 4.9, 5.6, 8.5, 7.7, 7.1, 6.1, 5.9, 7.2)
x
y
z
Min. : 2
Min. :25.3
Min. : 2.9
1st Qu.:3.8
1st Qu.:27.35
1st Qu.: 4.25
Median :4.65
Median :31.45
Median : 5.5
Mean :5.77
Mean :39.32
Mean : 5.32
3rd Qu.:7.08
3rd Qu.:43.48
3rd Qu.: 6.03
Max. :15.3
Max. :90.8
Max. :8.5
Coefficients:
Estimate Std. Error t value Pr(>|t|)
(Intercept) -4.5960 5.7739 -0.796 0.4325
x
4.7029 0.5157 9.119 5.13 e-10 ***
z
3.1604 1.0840 2.915 0.00678 **
--Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' '
1
Residual standard error: 8.241 on 29 degrees of
freedom
Multiple R-Squared: 0.806, Adjusted R-squared:
0.7926
F-statistic: 60.24 on 2 and 29 DF, p-value: 4.71e-11
Residuals:
Min
1Q Median
3Q
Max
-16.127 -4.917 -1.259 3.426 20.996
Response: y
Df Sum Sq Mean Sq F value Pr(>F)
x
1 7605.0 7605.0 109.2567 3.59e-11 ***
z
1 577.2 577.2 8.2925 0.007548 **
I(x^2) 1 20.4 20.4
0.2929 0.592629
Residuals 28
1949
69.9
--Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' '
1
> confint(reg1)
2.5 %
(Intercept) -16.4049585
x
3.6480938
z
0.9433127
97.5 %
7.212971
5.757740
5.377483
data: x and y
t = 9.4652, df = 230, p-value = 1.620e-10
alternative hypothesis: true correlation is not equal to
0
95 percent confidence interval: 0.7402478
0.9327238
sample estimates:
cor 0.865531
data: z and y
t = 3.1603, df = 30, p-value = 0.003586
alternative hypothesis: true correlation is not equal to
0
95 percent confidence interval: 0.1829596 0.7225477
sample estimates:
cor 3160369
data: x and z
t = 1.8245, df = 30, p-value = 0.07805
alternative hypothesis: true correlation is not equal to
0
95 percent confidence interval: -0.03670165
0.59874863
sample estimates:
cor 0.3160369
CHECKLIST
PART 1 your name and title of the paper.
1) Is the title sufficiently descriptive? Is it typed centered, bold, properly capitalized, in large font and along the top
line?
0.5 points. I claim full credit for this 
2) Is there your Name, affiliation and E-mail along second and third line?
0.5 points. I claim full credit for this 
3) Is there the (*) Footnote with date, the phrase “partial fulfillment,” etc.
0.5 points. I claim full credit for this 
4) Is the Abstract LESS THAN 125 words and descriptive of the model? (Please do give data sources in the initial
version due march 9 but delete it from the final version of the abstract as attached to the paper due March 23, where
the data information goes in the data appendix)
2.5 points. I claim full credit for this 
5) Are there exactly four numbered sections with bold titles present? How many pages? (Cut a point if raw data are
NOT attached in the program and data appendix).
0.5 points. I claim full credit for this 
Points earned in PART 1 so far are:
4.5 Out of 4.5 .
PART 2
6) Are data sources mentioned? 0.5 points I claim full credit for this 
7) Are descriptive stats including correlation coefficients mentioned?
0.5 points I claim full credit for this 
8) Is outlier detection done for Y, X and Z variables?
1 point I claim full credit for this 
9) Are 4 figures present? Fig 1 has Y vertical &X horizontal (Y vs. X), Fig 2 has Y vs. Z,
Fig 3 has residuals vs X and Fig.4 has residuals vs. Z. Note that if the residuals do not show any particular pattern,
the model as a whole is good.
0.5 points I claim full credit for this 
10) Are the figures labeled correctly? 0.5 points I claim full credit for this 
11) Are the Correct variables represented on the vertical axis ?
0.5 points I claim full credit for this 
12) Are all four figures explicitly discussed in the text? 0.5 points I claim full credit for this 
Points earned in PART 2 so far are:
3.5 Out of 4
PART 3
13) Is regression equation with equation numbers reported? e.g.:
Y = 0.0123 + 2.3345 X  0.3567 Z + residual (numbers will be different for each)
14) Is it centered? It should be centered.0.5 points I claim full credit for this 
15) Are there two t tests ? Did the student correctly look up critical t values from the t table in the textbook or from
EXCEL functions?
16) The NULL and ALTERNATIVE hypotheses should be correctly stated for each test.
1.5 points I claim full credit for this 
17) Is there one F test? Did the student correctly look up critical values for F from the F table in the textbook? 1
point I claim full credit for this 
18) Are the t tests and F tests discussed somewhere in the text?
0.5 points I claim full credit for this 
19) You should mention possible future work to improve the model in the conclusion section.
Is there a punch line for conclusion? Usually this will comprise the gist of the following 3 things:
The X (spell out the name) variable has (has not) a statistically significant effect on Y (Spell out name).
The Z (spell out the name) variable has (has not) a statistically significant effect on Y (Spell out name).
The overall model is (is not) statistically significant. Note that possible future work is not called the punch line.
1
point I claim full credit for this 
Total Points earned in PART 3 are:
3 Out of 4.5
Grand Total of All points claimed are:
11 Out of 13. Auditor to state his itemized scores in red pen and sign.
Grand total of points given by the auditor are:
Out of 13