Download Chapter 7 - Lone Star College

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts
no text concepts found
Transcript
Regression
Regression
 A regression line attempts to predict one variable
based on the relationship with another variable (its
correlation).
 The regression line is placed so that the error (distance
from the line to each data point) is minimized.
 The placement of the regression line minimizes the
total squared predictive error. (That way there are no
negative values.)
Chapter 7-Regression
2
Prediction
 What is predictive error?
 The amount of error associated with the placement of a
best fitting regression line.
 The placement of the regression line:


Minimizes the total predictive error, and
Minimizes the total squared predictive error
Chapter 7-Regression
3
Progress Check 7.1
20
15
10
5
1
5
10
15
20
a) Predict the approximate rate of inflation, given an unemployment rate of 5 percent.
b) Predict the approximate rate of inflation, given an unemployment rate of 15 percent.
Chapter 7-Regression
4
Estimating regression and relationship
 Witte web demonstration
Chapter 7-Regression
5
Least Squares Regression Equation
 Y = bX + a
 Where Y = the predicted value
 X = a known value
 b = slope of the line
 a = Y-intercept
 Ability to predict an outcome for a variable, given a
regression line and a value of a second paired variable.
Chapter 7-Regression
6
Calculating Least Squares Regression
Page 159
Calculate SSx, SSy, and r for the data.
2. Substitute numbers into the formula below
SSy
(r)
b=
SSx
3. Find the mean for X and mean for Y
4. Solve for a
1.
√
a = Y-(b)(X)
5. Solve for the predicted value
Y’ = (b)(X) + a
Chapter 7-Regression
7
Standard error of estimate
 This represents a special kind of standard deviation
that reflects the magnitude of predictive error.
 It is the difference between known values and
predicted values based on the regression equation.
 It is how much we over/under estimate a value based
on the regression equation, which is related to the
strength of the correlation.
Chapter 7-Regression
8
Calculation of standard error of estimate
Page 162
 Square root of the quantity of sum of squares for Y
times one minus r squared divided by n minus 2.
 (n-2 because 2 paired variable results in n-2 degrees of
freedom)
SSy(1-r2)
 Sy|x=
n-2
√
Chapter 7-Regression
9
Assumptions
 Use of regression equation requires that the
underlying relationship be linear.
 Use of the standard error of estimate assumes that
except for chance, the dots in the original scatterplot
will be dispersed equally about all segments of the
regression line. (homoscedasticity)
Chapter 7-Regression
10
Progress Check 7.2 Page 160-1
Educational Level (X)
Weekly Reading Time (Y)
X = 13
Y=8
SSx = 25
SSy = 50
R = .30
a) Determine the least squares equation for predicting
weekly reading time from educational level.
b) Faith’s education level is 15. What is her predicted
reading time?
c) Keegan’s educational level is 11. What is his predicted
reading time?
Chapter 7-Regression
11
Calculate Standard Error of Estimate
 Calculate the Standard Error of Estimate using the
data in 7.2 on page 160
Educational Level (X)
Weekly Reading Time (Y)
X = 13
Y=8
SSx = 25
SSy = 50
R = .30
Chapter 7-Regression
12
Correlation, Prediction, Error
X
Y
5
4
4
6
1
4
2
3
4
5
2
4
X2
Y2
X*Y
X - mean number of cases of influenza in a month among employees
Y - mean number of bacteria x 1 million on a door knob on front door
Chapter 7-Regression
13
Regression toward the mean
 Extreme scores on multiple trials will tend toward the
mean.
 The regression fallacy: accepting that regression
toward the mean is real, rather than a chance effect.
Chapter 7-Regression
14
Regression toward the mean
 Tversky and Kahnemann
 Study of Israeli Air Force pilots in 1974
 Some trainees were praised after good landings, while
others were reprimanded after bad landings.
 On their next landings, praised trainees did more
poorly and reprimanded trainees did better.
 Conclusion:
 Praise hinders but a reprimand helps performance!
Chapter 7-Regression
15