Download Frequencies and the normal distribution

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Bootstrapping (statistics) wikipedia , lookup

Transcript
Frequencies and the normal
distribution
CSC 152 (Blum)
1
Calculate the mean (average) and standard
deviation of the House –fly wing data
CSC 152 (Blum)
2
Calculate the sample size (count)
of the data
CSC 152 (Blum)
3
Calculate the minimum and maximum
of the data
CSC 152 (Blum)
4
Make a range of data starting at the Min plus
0.00001 and going to the Max by 0.1’s
CSC 152 (Blum)
5
Warning: Array formula
Array formulas have results
that span several cells
instead of just one.
They require:
1) Highlighting enough cells
for the complete answer
2) Clicking Ctrl-Shift-Enter
instead of just Enter.
If you attempt to edit part
of an array formula result,
problems occur. Use the
Esc key to get out of it.
CSC 152 (Blum)
6
Highlight cells D2:D21 (for the answer) and insert the
formula =FREQUENCY(A2:A101,C2:C21) and then hit
Ctrl-Shift-Enter
CSC 152 (Blum)
7
A frequency is the number of times something occurs – in
this case the values up to and including 3.6, then the
values between 3.6 (exclusive) and 3.7 (inclusive), etc.
CSC 152 (Blum)
8
Make a quick (unformatted, un-designed) XY
Scatter graph
CSC 152 (Blum)
9
Normal distribution
CSC 152 (Blum)
10
Use Excel’s formula for the
normal distribution
Notice that we have
used absolute
addressing for the
mean B$2 and
standard deviation
B$4. That means
when the formula is
copied elsewhere the
2 and 4 are held
fixed.
CSC 152 (Blum)
The 4th argument is
whether or not we
want to sum the
distribution
everything from
negative infinity up
to and including a
value – we said no.
11
Before comparing we need two more steps. The first is to divide the
frequencies by the sample size turning them effectively into
probabilities
Instead of saying the
value 4.2 occurred 7
times in a sample size
of 100, we say 4.2
occurred 0.07 or 7%
of the time.
Note the count
requires absolute
addressing.
CSC 152 (Blum)
12
The normal distribution has to be multiplied
by the Δx – the separation between our
values
We had gone up by 0.2’s
instead of by 0.1’s there
would be roughly half as
many Frequencies with
roughly twice the value
they have now.
CSC 152 (Blum)
13
Highlight columns C, F & G (only where
there’s data) and Insert an XY-Scatter chart.
CSC 152 (Blum)
14
Apply a Chart Layout (e.g. 1) under Design.
CSC 152 (Blum)
15
Change the title and axis labels
CSC 152 (Blum)
16
Format the axis to have a Minimum of 3
CSC 152 (Blum)
17
Right click on a data point and choose
Select Data
CSC 152 (Blum)
18
Select a data set, click Edit and give the
series a name.
CSC 152 (Blum)
19
Result – see legend
CSC 152 (Blum)
20
Right click on the second (normal dist.) series
and choose Change Series Chart Type. Choose
Smooth Lines.
CSC 152 (Blum)
21
Result
CSC 152 (Blum)
22
Histogram: highlight columns C & D and choose
Insert Column Chart
CSC 152 (Blum)
23
Right click and choose Select Data
CSC 152 (Blum)
24
Highlight Series 1 and click Remove
CSC 152 (Blum)
25
Highlight Series 2 and click Edit under
Horizontal Axis Labels
CSC 152 (Blum)
26
Then highlight the C column and click OK
CSC 152 (Blum)
27
Result so far
CSC 152 (Blum)
28
Choose a Layout (e.g. 7) and label axes
CSC 152 (Blum)
29
Add a title (if you haven’t got one already)
CSC 152 (Blum)
30
Right click on the columns and choose Format
Data Series. Choose the gap width to be 0%.
CSC 152 (Blum)
31
Choose a Solid Line and a Border Color.
CSC 152 (Blum)
32
Result
CSC 152 (Blum)
33