Download Chapter 1 - Reading Guide

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Bootstrapping (statistics) wikipedia , lookup

History of statistics wikipedia , lookup

Gibbs sampling wikipedia , lookup

Time series wikipedia , lookup

Categorical variable wikipedia , lookup

Transcript
TPS4e Chapter 1: Exploring Data
Chapter 1: Exploring Data
Key Vocabulary:














individual
variable
frequency table
relative frequency table
distribution
pie chart
bar graph
two-way table
marginal distributions
conditional distributions
side-by-side bar graph
association
dotplot
stemplot











histogram
SOCS
outlier
symmetric



IQR
five-number summary
x
spread
variability
median
quartiles
Q1, Q3






minimum
maximum
boxplot
resistant
standard deviation
variance
Data Analysis: Making Sense of Data (pp.2-6)
1. How is statistics defined?
2. Define data analysis?
3. Individuals are…
4. A variable (IT IS NOT X) is …
5. When you first meet a new data set, ask yourself:

Who…

What…

Why, When, Where and How…
6. Explain the difference between a categorical variable and a quantitative variable. Give an example of each.
7. Give an example of a categorical variable that has number values.
8. Define distribution:
Give an example of a distribution:
6. How should data be explored?
7. Answer the two questions for the Check Your Understanding on page 5:
8. Define inference.
9. What are the two primary ways to produce data?
1.1
Analyzing Categorical Data (pp.8-22)
1. A frequency table displays…
2. A relative frequency table displays…
3. In one sentence explain the difference between a frequency table and a relative frequency table?
4. What type of data are pie charts and bar graphs used for?
5. Pie Charts can only be used when?
2
6. Categories in a bar graph are represented by _______________________ and the bar heights give the
category _________________________________________.
7. What is a two-way table?
8. How is a two-way table setup?
9. Define marginal distribution.
10. Which is more informative when comparing: group counts or percents? Why?
11. Answer the two questions for the Check Your Understanding on page 14.
12. What is a conditional distribution? Give an example demonstrating how to calculate one set of
conditional distributions in a two-way table.
13. What is the purpose of using a segmented bar graph?
14. Answer Check Your Understanding on page 17.
3
15. Describe the four steps to organizing a statistical problem:

State…

Plan…

Do…

Conclude…
16. Explain what it meant by an association between two variables; Give an example.
17. What do you need to be cautious of when variables seem to have a strong association?
1.2
Analyzing Categorical Data (pp.27-42)
1. What is a dotplot? When would you use it? Draw an example.
2. [VERY IMPORTANT CONCEPT!!] When examining a distribution, you can describe the overall
pattern by its ( Make sure you understand how to compare distributions in context!!)
S___________
O__________
C________
S_________
3. Describe Shape
a) If a distribution is symmetric, what does it look like?
b) If a distribution is skewed to the right, what does it look like?
4
c) If a distribution is skewed to the left, what does it look like?
d) Describe and illustrate the following distributions:
i) Unimodal
ii) Bimodal
iii) Multimodal
4. Answer questions in Check Your Understanding on page 31.
5. How are a stemplot and a histogram similar?
6. When is it beneficial to split the stems on a stemplot?
7. When is it best to use a back-to-back stemplot?
8. How is the stemplot of a distribution related to its histogram?
5
9. Answer questions in Check Your Understanding on page 34.
10. What is a histogram?
11. List the three steps involved in making a histogram.
12. When is it better to use a histogram rather than a stemplot or dotplot?
13. What is meant by frequency in a histogram?
14. Why is it advantageous to use a relative frequency histogram instead of a frequency histogram?
15. What is the difference between a bar-graph and a histogram?
16. Define outlier.
6
17. Answer Check Your Understanding questions on pages 39.
18. Answer Check Your Understanding questions on pages 41.
1.3
Analyzing Categorical Data (pp.50-67)
Describe Center
1. In statistics, what are the most common measures of center?
2. Explain how to calculate by hand and find on the calculator the mean, x (Xbar).
3. What is the meaning of  (sigma)?
4. Explain the difference between x and  (mu).
5. Define resistant measure.
6. Explain why the mean is not a resistant measure of center.
7
7. What is the median of a distribution? Explain how to calculate by hand and find on the calculator.
8. Explain why the median is a resistant measure of center?
9. How does the shape of the distribution affect the mean and median?
10. In a symmetric distribution where are the mean and median in relation to each other? What about in a
distribution that is skewed?
11. Answer Check Your Understanding questions on page 55.
8
Describe Spread and Outliers
12. What is the range?
13. Is the range a resistant measure of spread? Explain.
14. What is the difference between “average” value and “typical” value?
15. How do you find first quartile Q1 and third quartile Q3 by hand and find on the calculator.
16. What is the Interquartile Range (IQR)?
17. Is the IQR and the quartiles a resistant measure of spread? Explain.
18. How is the IQR used to identify outliers?
19. What is the five-number summary of a distribution?
20. How much of the data falls between each quartile?
9
21. How much of the data falls between Q1 and Q3?
22. Describe a boxplot.
23. Explain how to use the five-number summary to make a boxplot.
24. Answer Check Your Understanding questions on page 61.
.
25. What does the standard deviation(S or Sx) measure?
b) How do we calculate it by hand and find on the calculator.
26. What is the relationship between variance (S2x or S2) and standard deviation (Sx or S)? Why do we
prefer to use standard deviation and NOT variance?
27. What are the 4 properties of the standard deviation explained on page 64?
10
28. When does standard deviation equal zero?
29. What are the units for the standard deviation of a distribution?
30. Is standard deviation resistant or nonresistant to extreme observations? Explain.
31. Answer Check Your Understanding questions on page 64.
32. How should one go about choosing measures of center and spread?
33. Use a five number summary when…
34. Use x and s when
11
12