Download Excel Correlation Assignment - Shoreline Community College

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts
no text concepts found
Transcript
Don S. Christensen
Psyc 209
Shoreline Community College
Psychology Department
HOMEWORK #1: EXCEL CORRELATION ASSIGNMENT (22 Points)
Due Date: Thursday October 13th in class
Purpose:
This assignment will increase your understanding of the concept of “correlation” and provides practice using
Microsoft Excel to enter data, calculate descriptive statistics, calculate correlations, and create and interpret scatter
plots.
Suggested Time Table:
Information relative to this assignment will be covered in week 2, so I don’t expect you to work on it right away.
You may already be familiar with Excel. If not, don’t worry. One of the goals of this assignment is to introduce
you to Excel. However, the assumption of this assignment is that students have not calculated correlations or
created scatterplots using this application.
Late Assignments:
Assignments handed in after the due date listed above will lose 2 points per day late.
Suggested Formatting:
Please staple together the following:
Page 1: Excel printout of Data Set1 with your Excel computations and scatterplot (see next page of this
assignment for sample output).
Page 2: Excel printout of Data Set 2 with your Excel computations and scatterplot.
Page 3: Hand calculation of correlation for Data Set 1 and short answers to Data Set 1 question 9 and Data Set
number 2 question 9 (please use Calculation and Answer Worksheet provided on the last page of the
assignment)
Don S. Christensen
Psyc 209
Shoreline Community College
Psychology Department
Don S. Christensen
Psyc 209
Shoreline Community College
Psychology Department
DATA SET 1
General Scenario: Dr. Truong wants to examine the relation between the amount of television that people watch and how
happy they are. Ten adults participate in her study. Each participant records the total number of hours of TV that he or she
watches for a particular week. In addition, at the end of the week, each participant completes a questionnaire that measures the
person’s overall happiness. The questionnaire scores can range from 0(extremely unhappy) to 10 (extremely happy). The
results from Dr. Truong’s experiment are listed below.
Participant Number
1
2
3
4
5
6
7
8
9
10
Hours of TV Watched
8
5
3
3
0
1
6
4
5
5
Happiness Score
4
6
9
8
10
10
5
8
5
5
Directions for Data Set 1:
1.
Enter the data from the table above into an Excel spreadsheet. Label row 1 of columns A, B, and C with “Participant
Number,” “Hours of TV Watched,” and “Happiness Score,” respectively.
2. In the “Participant Number column,” in cells 14 through 18 respectively, type in the following terms: mean, median, mode,
variance, standard deviation, and Pearson r.
3. Use Excel to compute the mean, median, mode, variance, and standard deviation (round to two decimal places). NOTE:
Excel has a several ways of calculating the standard deviation & variance. Make sure that you have selected the
correct ones by calculating the standard deviation of Hours of TV Watched by hand and compare your calculation
with Excel’s. If it doesn’t match, change your formula.
4. Use Excel to compute the Pearson correlation coefficient between hours of TV watched and happiness (round to two
decimal places).
5. Use Excel to create a scatterplot for these two variables. Label the X-axis “Hours of TV Watched” and the Y-axis
“Happiness Score,” and make sure that the data along each axis correspond to the correct variable (e.g., make sure that
data plotted on an axis matches your label for it).
6. Enter your name as part of the X-axis label. To do this, create the basic table first. After you have positioned it in your
spreadsheet, click on the label (once to select it) and insert your cursor after the last letter. Hit “Return” and then type your
name.
7. Add a trendline to your chart. This will visually illustrate the line the correlation is attempting to fit to the data. To do this,
select your chart by clicking on it once. The “Chart” menu will appear above at the top of the page, in between the “Tools”
and “Window” menus. The Chart menu will not appear if you don’t select your chart. You will see the “Table” menu
instead and you won’t be able to create a trendline. With your chart selected, click on (or pull down) the “Chart” menu
and select “Add Trendline…” You want a linear trendline, which is usually the default setting. If “Linear” is already
highlighted, click “OK” and a trendline line will be added to your chart.
8. Print out a page containing your calculations and scatter plot (7 points).
9. Using the scatterplot and correlation, describe the association between the two variables in a sentence or two. (Answer this
question in the space provided on the Calculation and Answer Worksheet.)
10. Calculate the Pearson correlation coefficient by hand, using the definitional formulas provided with this assignment.
Please use the work sheet provided. You may use a calculator but please SHOW YOUR WORK. Make sure your
calculation matches the number calculated by Excel.
Don S. Christensen
Psyc 209
Shoreline Community College
Psychology Department
DATA SET 2
General Scenario: Dr. Rouke wants to examine the relation between arousal and performance on an athletic task: freethrow
shooting in basketball. Ten adults have agreed to participate in his study. Prior to shooting freethrows, each participant
completes a physiological test that measures feelings of arousal. Scores range from 0 (low arousal) to 9 (high arousal). After
this arousal test, each participant shoots 10 freethrows and the number of baskets made is recorded. The results from Dr.
Rouke’s study are listed below.
Participant Number
1
2
3
4
5
6
7
8
9
10
Arousal Score
6
2
5
9
2
3
1
8
7
4
Freethrow Performance
6
4
6
2
3
5
2
4
5
6
Directions for Data Set 2;
1.
2.
3.
4.
5.
6.
7.
8.
9.
Enter the data from the table above into an Excel spreadsheet. Label columns A, B, C with “Participant Number,”
“Arousal Score,” and “Freethrow Performance” respectively.
In the Participant Number column, in cells 14 through 18 respectively, type in the following terms: mean, median, mode,
variance, standard deviation, and Pearson r.
Use Excel to compute the mean, median, mode, variance, and standard deviation (round to two decimal places).
Use Excel to compute the Pearson correlation coefficient between arousal and freethrow performance (round to two
decimal places).
Use Excel to create a scatterplot for these two variables. Label the X-axis “Arousal” and the Y-axis “Freethrow
Performance,” and make sure that the data along each axis correspond to the correct variable (e.g., make sure that data
plotted on an axis matches your label for it).
Enter your name directly underneath the X-axis label (see Data Set 1 for instructions on how to do this).
Add a trendline to your chart (see Data Set 1 for instructions on how to do this).
Print out a page containing your calculations and scatter plot (7 points).
Based solely on the Pearson correlation coefficient, what would you conclude about the relation between arousal and
freethrow performance? Based on the scatterplot, what conclusion do you reach? Why does the Pearson statistic provide
an inaccurate picture in this particular example? (Answer this question in the space provided on the Calculation and
Answer Worksheet.)
Don S. Christensen
Psyc 209
Shoreline Community College
Psychology Department
Sample Pearson Correlation Coefficient Calculation
Legend
N = number of participants (in this example, N = 10 participants)
X = variable corresponding to the numbers in column X (e.g., “self-esteem” measured on 0-10 scale)
Y = variable corresponding to the numbers in column Y (e.g., “creativity” measured on a 0-10 scale)
Xi = any individual number in column X (e.g., 8 or 7 or 6….)
Yi = any individual number in column Y (e.g., 9 or 6 or 4…)
MX = the mean or average of variable X
MY = the mean or average of variable Y
SDX = the standard deviation of variable X
SDY = the standard deviation of variable Y
Pearson r = Covariance  (SDXSDY) (Steps 1-4 below will show you how to calculate the covariance)
Step 1: Calculate the Mean of X and the Mean of Y.
Mean of X = MX = 60/10 = 6; Mean of Y = MY = 50/10 = 5
Step 2: Calculate the variance and the standard deviation of X and Y.
Variance of X = 54/10 = 5.4
Standard Deviation of X = SDX = 2.32
Variance of Y = 64/10 = 6.4
Standard Deviation of Y = SDY = 2.53
Step 3: Calculate the Covariance of X and Y.
CovarianceXY = Average cross-product = (Sum of Column H)  N = 49/10 = 4.9
Step 4: Calculate the Pearson r.
Pearson r = Covariance  (SDXSDY) = 4.9  (2.322.5) = 0.83
A
Participant
B
X
(self-esteem)
C
Y
(creativity)
D
(Xi-MX)
E
(Xi-MX)2
F
(Yi-MY)
G
(Yi-MY)2
H
(Xi-MX)(YiM Y)
1
2
3
4
5
6
7
8
9
10
8
7
6
5
6
1
8
3
9
7
9
6
4
2
6
2
8
1
6
6
2
1
0
-1
0
-5
2
-3
3
1
4
1
0
1
0
25
4
9
9
1
4
1
-1
-3
1
-3
3
-4
1
1
16
1
1
9
1
9
9
16
1
1
24=8
11=1
0  -1 = 0
-1  -3 = 3
01=0
-5  -3 = 15
23=6
-3  -4 = 12
31=3
11=1
Sum
Average
60
6.0
50
5.0
0
0.0
54
5.4
0
0.0
64
6.4
49
4.9
Don S. Christensen
Psyc 209
Shoreline Community College
Psychology Department
Calculation and Answer Worksheet
Data Set 1:
9. Describe the association between the TV watching and happiness. Don’t just say that the correlation is positive or
negative. Explain what your correlation tells you about how these two variables are associated with each other. (2 points)
10. Calculate the Pearson r by hand. Enter numbers from Data Set #1 in the table below. See “Sample Pearson Correlation
Coefficient Calculation” on the previous page for additional instructions. (2 points)
A
Participant
B
X
C
Y
D
(Xi-MX)
E
(Xi-MX)2
F
(Yi-MY)
G
(Yi-MY)2
H
(Xi-MX)(Yi-MY)
Sum
Average
Show additional calculations below:
Pearson r = ________
Data Set 2:
9. Based solely on the Pearson correlation coefficient, what would you conclude about the relation between arousal and
freethrow performance? Based on the scatterplot, what conclusion do you reach? Why does the Pearson statistic provide
an inaccurate picture in this particular example? (2 points)