Download Guided Notes

Document related concepts

Probability wikipedia , lookup

History of statistics wikipedia , lookup

Statistics wikipedia , lookup

Transcript
AMDG
Name
AP Exam Review: The Practice of Statistics (4th Edition) - Starnes, Yates, Moore
Chapter 1: Exploring Data
Key Vocabulary:













individual
variable
frequency table
relative frequency table
distribution
pie chart
bar graph
two-way table
marginal distributions
conditional distributions
side-by-side bar graph
association
dotplot












stemplot
histogram
SOCS
outlier
symmetric

x
spread
variability
median
quartiles
Q1, Q3








IQR
five-number summary
minimum
maximum
boxplot
resistant
standard deviation
variance
Data Analysis: Making Sense of Data (pp.2-6)
1. Individuals are…
2. A variable is …
3. When you first meet a new data set, ask yourself:

Who…

What…

Why, When, Where and How…
4. Explain the difference between a categorical variable and a quantitative variable. Give an example
of each.
5. Give an example of a categorical variable that has number values.
6. Define distribution:
AMDG
7. What are the four steps to exploring data?

Begin by….

Study relationships…

Start with a …

Then add…
8. Answer the two questions for the Check Your Understanding on page 5:
9. Define inference.
1.1 Analyzing Categorical Data (pp.8-22)
1. A frequency table displays…
2. A relative frequency table displays…
3. What type of data are pie charts and bar graphs used for?
4. Categories in a bar graph are represented by ___________ and the bar heights give the category
__________________.
5. What is a two-way table?
6. Define marginal distribution.
7. What are the two steps in examining a marginal distribution?
2
AMDG
8. Answer the two questions for the Check Your Understanding on page 14.
9. What is a conditional distribution? Give an example demonstrating how to calculate one set of
conditional distributions in a two-way table.
10. What is the purpose of using a segmented bar graph?
11. Answer question one for the Check Your Understanding on page 17.
12. Describe the four steps to organizing a statistical problem:


State…
Plan…

Do…

Conclude…
13. Explain what it meant by an association between two variables.
1.2 Analyzing Categorical Data (pp.27-42)
1. What is a dotplot? Draw an example.
2. When examining a distribution, you can describe the overall pattern by its
S_____
O_____
C_____
3. If a distribution is symmetric, what does it look like?
3
S_____
AMDG
4. If a distribution is skewed to the right, what does it look like?
5. If a distribution is skewed to the left, what does it look like?
6. Describe and illustrate the following distributions:
a. Unimodal
b. Bimodal
c. Multimodal
7. Answer questions 1-4 for the Check Your Understanding on page 31.
8. How are a stemplot and a histogram similar?
9. When is it beneficial to split the stems on a stemplot?
10. When is it best to use a back-to-back stemplot?
11. List the three steps involved in making a histogram.
12. Why is it advantageous to use a relative frequency histogram instead of a frequency histogram?
4
AMDG
13. Answer questions 2-4 for the Check Your Understanding on page 35.
1.3 Analyzing Categorical Data (pp.50-67)
1. What is the most common measure of center?
2. Explain how to calculate the mean, x .
3. What is the meaning of ?
4. Explain the difference between x and .
5. Define resistant measure.
6. Explain why the mean is not a resistant measure of center.
7. What is the median of a distribution? Explain how to find it.
8. Explain why the median is a resistant measure of center?
9. How does the shape of the distribution affect the mean and median?
10. What is the range?
11. Is the range a resistant measure of spread? Explain.
5
AMDG
12. How do you find first quartile Q1 and third quartile Q3?
13. What is the Interquartile Range (IQR)?
14. Is the IQR and the quartiles a resistant measure of spread? Explain.
15. How is the IQR used to identify outliers?
16. What is the five-number summary of a distribution?
17. Explain how to use the five-number summary to make a boxplot.
18. What does the standard deviation measure? How do we calculate it?
19. What is the relationship between variance and standard deviation?
20. What are the properties of the standard deviation as explained on page 64?
21. How should one go about choosing measures of center and spread?
6
AMDG
Chapter 2: Modeling Distributions of Data
Key Vocabulary:

percentiles

transform data

cumulative relative

mean of density curve
frequency graphs

standard deviation of
density curve

z-scores

transforming data

Normal curves

density curves

Normal distributions

median of density curve

68-95-99.7 rule

N( =__,  = ___)

standard Normal distribution

standard Normal table

Normal probability plot


mu


sigma
2.1 Describing Location in a Distribution (pp.84-103)
1. A percentile is…
2. Is there a difference between the 80th percentile and the top 80%? Explain.
3. Is there a difference between the 80th percentile and the lower 80%? Explain.
4. Refer to the “Cumulative Relative Frequency Graphs” section on page 86 to answer the following
questions:
a. Explain how to find the relative frequency column.
b. Explain how to find the cumulative frequency column.
c. Explain how to find the cumulative relative frequency column.
5. Explain how to make a cumulative relative frequency graph.
6. What can a cumulative relative frequency graph be used to describe?
7. Answer the four questions for the Check Your Understanding on page 89.
7
AMDG
8. Explain how to standardize a variable.
9. What information does a z – score provide?
10. Explain how to calculate and interpret a z- score.
11. What is the purpose of standardizing a variable?
12. Explain the effects of adding or subtracting a constant from each observation when transforming
data.
13. Explain the effects of multiplying or dividing by a constant from each observation when transforming
data.
14. Summarize the four steps for exploring quantitative data as outlined on page 99.
15. What is a density curve?
16. What does the area under a density curve represent?
17. Where is the median of a density curve located?
8
AMDG
18. Where is the mean of a density curve located?
19. Answer questions 1 and 2 for the Check Your Understanding on page 103.
2.2
Normal Distributions (pp.110-128)
1. How would you describe the shape of a Normal curve? Draw two examples.
2. Explain how the mean and the standard deviation are related to the Normal curve.
3. Define Normal distribution and Normal curve.
4. What is the abbreviation for a Normal distribution with a mean  and a standard deviation ?
5. Explain the 68-95-99.7 Rule. When does this rule apply?
6. Answer questions 1-3 for the Check Your Understanding on page 114.
7. What is the standard Normal distribution?
8. What information does the standard Normal table give?
9. How do you use the standard Normal table (Table A) to find the area under the standard Normal
curve to the left of a given z-value? Draw a sketch.
9
AMDG
10. How do you use Table A to find the area under the standard Normal curve to the right of a given
z-value? Draw a sketch.
11. How do you use Table A to find the area under the standard Normal curve between two given zvalues? Draw a sketch.
12. Summarize the steps on how to solve problems involving Normal distributions as outlined on
page 120.
13. When is it appropriate to use Table A “backwards”?
14. Describe two methods for assessing whether or not a distribution is approximately Normal.
15. What is a Normal probability plot?
16. How do you interpret a Normal probability plot?
17. When is it appropriate to use the NormalCDF and Inverse Normal functions on the calculator?
10
AMDG
Chapter 3: Describing Relationships
3.1
Scatterplots and Correlation (pp.142-156)
1. Why do we study the relationship between two quantitative variables?
2. What is the difference between a response variable and the explanatory variable?
3. How are response and explanatory variables related to dependent and independent variables?
4. When is it appropriate to use a scatterplot to display data?
5. A scatterplot shows the relationship between…
6. Which variable always appears on the horizontal axis of a scatterplot?
7. When examining a scatterplot, you can describe the overall pattern by its:
D_____
O_____
F_____
S_____
8. Explain the difference between a positive association and a negative association.
9. What is correlation r?
10. Answer the five questions for the Check Your Understanding on page 149.
11. What does correlation measure?
11
AMDG
12. Explain why two variables must both be quantitative in order to find the correlation between
them.
13. What is true about the relationship between two variables if the r-value is:
a. Near 0?
b. Near 1?
c. Near -1?
d. Exactly 1?
e. Exactly -1?
14. Is correlation resistant to extreme observations? Explain.
15. What do you need to know in order to interpret correlation?
3.2
Least-Squares Regression (pp.164-188)
1. What is a regression line?
2. In what way is a regression line a mathematical model?
3. What is the general form of a regression equation? Define each variable in the equation.
4. What is the difference between y and ŷ ?
12
AMDG
5. What is extrapolation and why is this dangerous?
6. Answer the four questions for the Check Your Understanding on page 167.
7. What is a residual? How do you interpret a residual?
8. What is a least-squares regression line?
9. What is the formula for the equation of the least-squares regression line?
10. The least-squares regression line always passes through the point ...
11. What is a residual plot? Sketch a graph of a residual plot.
12. If a least-squares regression line fits the data well, what two characteristics should the residual plot
exhibit?
13. What is the standard deviation of the residuals? How is it interpreted?
14. How is the coefficient of determination defined?
15. What is the formula for calculating the coefficient of determination?
13
AMDG
16. If r2 = 0.95, what can be concluded about the relationship between x and y?
______% of the variation in (response variable) is accounted for by the regression line.
17. When reporting a regression, should r or r2 be used describe the success of the regression? Explain.
18. Identify the slope, the y intercept, s and r2 on the computer output.
19. What are three limitations of correlation and regression?
20. What is an outlier?
21. What is an influential point?
22. Under what conditions does an outlier become an influential observation?
14
AMDG
23. What is a lurking variable?
24. Why does association not imply causation?
15
AMDG
16
AMDG
Chapter 4: Designing Studies
Key Vocabulary:















sample
population
sample survey
voluntary response sample
confounded
design
convenience sampling
biased
simple random sample
table of random digits
probability sample
stratified random sample
cluster sampling
inference
margin of error
















strata
undercoverage
nonresponse
response bias
sampling frame
systematic random sample
observational study
experimental
confounding
lurking variable
experimental units
subjects
random assignment
treatment
factor
level
17


placebo effect
single blind experiment


control group
completely randomized
experiment
randomized block design
matched pair design
statistically significant
replication
hidden bias
double-blind experiment
block design
data ethics








AMDG
4.1
Sampling and Surveys (pp.206-224)
1. Explain the difference between a population and a sample.
2. What is involved in planning a sample survey?
3. Why might convenience sampling be unreliable?
4. What is a biased study?
5. Why are voluntary response samples unreliable?
6. Define simple random sample (SRS).
7. What two properties of a table of random digits make it a good choice for creating a simple
random sample?
8. State the two steps in choosing an SRS:
9. What is the difference between sampling with replacement and sampling without replacement?
10. How can you account for this difference with and without replacement when using a table of
random digits or other random number generator?
11. How do you select a stratified random sample?
18
AMDG
12. What is cluster sampling?
13. What is inference?
14. What is a margin of error?
15. What is the benefit of a larger sample size?
16. A sampling frame is…
17. Give an example of undercoverage in a sample.
18. Give an example of nonresponse bias in a sample.
19. Give an example of response bias in a sample.
20. How can the wording of questions cause bias in a sample?
21. Answer the two questions for the Check Your Understanding on page 224.
19
AMDG
4.2
Experiments (pp.231-251)
1. Explain the differences between observational study and experiment.
2. A lurking variable is…
3. What problems can lurking variables cause?
4. Confounding occurs when…
5. Answer the four questions for the Check Your Understanding on page 233.
6. Explain the difference between experimental units and subjects.
7. Define treatment.
8. By studying the TV Advertising example on page 235, identify the factors and levels in the
experiment.
9. Explain why the example, Which Works Better: Online or In-Class SAT Preparation, is a bad
experiment.
10. What is random assignment?
11. What is a comparative experimental design?
12. In a completely randomized design…
20
AMDG
13. Does using chance to assign treatments in an experiment guarantee a completely randomized design?
Explain.
14. What is the significance of using a control group?
15. The basic principles of statistical design experiments are:
16. Define control, random assignment and replication in experimental design.
17. Describe the placebo effect.
18. What are the differences between a double-blind and single-blind experiement?
19. Define statistically significant.
20. What is a block?
21. What is a randomized block design?
22. When does randomization take place in a block design, and how does this differ to a completely
randomized design?
23. What is the goal of a matched pairs design?
24. When is it beneficial to use a blocked/paired design? How should we choose which variables to block
for?
21
AMDG
4.3
Using Studies Wisely (pp.261-267)
1. Name the two types of inferences that can be identified based on the design of a study.
2. Name the challenges of establishing causation.
3. What are the four criteria for establishing causation when we can’t do an experiment?
4. Briefly describe the basics of data ethics.
22
AMDG
Chapter 5: Probability: What are the chances?
Key Vocabulary:
 law of large numbers
 probability
 simulation
 two -way table
 sample space
 S = {H, T}


tree diagram
probability model
5.1








replacement
event
P(A)
complement AC
disjoint
mutually exclusive event
Venn diagram
union (or)






intersection (and)
conditional probability
independent events
general multiplication rule
general addition rule
multiplication rule
Randomness, Probability, and Simulation (pp.282-292)
1. What is the law of large numbers?
2. The probability of any outcome…
3. How do you interpret a probability?
4. Answer the two questions for the Check Your Understanding on page 286.
5. What are the two myths about randomness? Explain.
6. Define simulation.
7. Name and describe the four steps in performing a simulation:
8. What are some common errors when using a table of random digits?
23
AMDG
5.2
Probability Rules (pp.299-308)
1. In statistics, what is meant by the term sample space?
2. In statistics, what is meant the term probability model?
3. What is an event?
4. What is the P (A) if all outcomes in the sample space are equally likely?
5. Define the complement of an event. What is the complement rule?
6. Explain why the probability of any event is a number between 0 and 1.
7. What is the sum of the probabilities of all possible outcomes?
8. Describe the probability that an event does not occur?
9. When are two events considered disjoint or mutually exclusive?
10. What is the addition rule for mutually exclusive events?
11. What is the probability of two disjoint events?
12. Summarize the five basic probability rules as outlined on page 302.
13. Answer the three questions for Check Your Understanding on page 303.
24
AMDG
14. When is a two-way table helpful?
15. In statistics, what is meant by the word “or”?
16. When can a Venn diagram be helpful?
17. What is the general addition rule for two events?
18. What happens if the general addition rule is used for two mutually exclusive events?
19. What does the union of two or more events mean? Illustrate on a Venn diagram.
20. What does the intersection of two or more events mean? Illustrate on a Venn diagram.
5.3
Conditional Probabilty and Independence (pp.312-327)
1. What is conditional probablity? What is the notation for conditional probabilty?
2. Answer the two questions for the Check Your Understanding on page 314.
3. What are independent events?
4. What is the notation used for independent events?
25
AMDG
5. Answer the three questions for Check Your Understanding on page 317.
6. When is a tree diagram helpful?
7. State the general multiplication rule for any two events.
8. State the multiplication rule for independent events.
9. How is the general multiplication rule different than the multiplication rule for independent
events?
10. Explain the difference between mutually exclusive and independent.
11. State the formula for calculating conditional probabilities.
12. How is the conditional probability formula related to the general multiplication rule?
26
AMDG
Chapter 6: Random Variables
Key Vocabulary:








random variable
discrete random variable
probability distribution
mean of a random variable
variance of a random
variable
probability density curve
continuous random
variable
standard deviation











binomial setting
binomial random variable
binomial distribution
binomial coefficient
binomial probability
linear transformation
normal approximation
geometric setting
geometric distribution
geometric random variable
Normal approximation




geometric probability
factorial
expected value
standard deviation

X
Y

uniform distribution


6.1
Discrete and Continuous Random Variables (pp.341-352)
1. What is a random variable?
2. Define probability distribution.
3. What is a discrete random variable?
4. What are the two requirements for the probability distributions of discrete random variables?
5. If X is a discrete random variable, what information does the probability distribution of X
give?
6. In a probability histogram what does the height of each bar represent (assuming the width of
each bar is the same)?
7. In a probability histogram, what is the sum of the height of each bar?
8. What is the mean  X of a discrete random variable X?
9. How do you calculate the mean of a discrete random variable?
27
AMDG
10. Define expected value. What notation is used for expected value?
11. Does the expected value of a random variable have to equal one of the possible values of the
random variable? Explain.
12. Explain how to calculate the variance and standard deviation of a discrete random variable.
13. Explain the meaning of the standard deviation of a random variable X.
14. What is a continuous random variable and how is it displayed?
15. If X is a continuous random variable, how is the probability distribution of X described?
16. What is the area under a probability density curve equal to?
17. What is the difference between a discrete random variable and a continuous random
variable?
18. If X is a discrete random variable, do P  X  2  and P  X  2  have the same value?
Explain.
19. If X is a continuous random variable, do P  X  2  and P  X  2  have the same value?
Explain.
20. How is a Normal distribution related to probability distribution?
28
AMDG
6.2
Transforming and Combining Random Variables
(pp.358-375)
1. What is the effect on a random variable of multiplying or dividing by a constant?
2. How does multiplying by a constant effect the variance?
3. What is the effect on a random variable of adding or subtracting by a constant?
4. Define linear transformation.
5. What are the effects of a linear transformation on the mean and standard deviation?
6. Define the mean of the sum of random variables.
7. What are independent random variables?
8. Define the variance of the sum of independent random variables. What types of variables
does it apply to?
9. When can you add the variances of two random variables?
10. State the equation for the mean of the difference of random variables?
11. State the formula for the variance of the difference of random variables.
12. What happens if two independent Normal random variables are combined?
13. Suppose  X = 5 and Y = 10. According to the rules for means, what is  X Y ?
29
AMDG
14. Suppose  X = 2. According to the rules for means, what is 3 4 X ?
15. Suppose  X2 = 2 and  Y2 = 3 and X and Y are independent random variables. According to
the rules for variances, what is  X2 Y ? What is  X Y ?
16. Suppose  X2 = 4. According to the rules for variances, what is  32 4 X ? What is  3 4 X ?
6.3
Binomial and Geometric Random Variables (pp.382-401)
1. What is a binomial setting?
2. Describe the conditions of a binomial setting.
3. What is a binomial random variable and what are its possible values?
4. Define the parameters of a binomial distribution.
5. Explain the meaning of the binomial coefficient and state the formula.
6. Explain how to calculate binomial probabilities.
7. What commands on the calculator are used to calculate binomial probabilities?
8. Explain how to calculate the mean and standard deviation of a binomial random variable.
9. When can the binomial distribution be used to sample without replacement? Explain why this is an
issue.
10. What is a geometric setting?
30
AMDG
11. Describe the conditions for a geometric setting.
12. What is a geometric random variable and what are its possible values?
13. Describe the parameters of a geometric distribution.
14. What is the formula for geometric probability?
15. How is the mean of a geometric random variable calculated?
31
AMDG
32
AMDG
Chapter 7: Sampling Distributions
Key Vocabulary:













parameter
statistic
sampling variability
sampling distribution
population distribution
biased estimator
unbiased estimator
bias
variability
variability of a statisic
sample proportion
mean and standard deviation of sampling distributions
central limit theorem
7.1
What Is a Sampling Distribution? (pp.414-428)
1. Explain the difference between a parameter and a statistic?
2. What is sampling variability?
3. Explain the difference between  and x , and between p and p̂ ?
4. What is meant by the sampling distribution of a statistic?
5. What is population distribution?
6. What is the difference between the distribution of the population, the distribution of the
sample, and the sampling distribution of a sample statistic?
7. What is sampling variability?
8. Explain the difference between  and x , and between p and p̂ ?
9. What is meant by the sampling distribution of a statistic?
10. What is population distribution?
11. What is the difference between the distribution of the population, the distribution of the
sample, and the sampling distribution of a sample statistic?
33
AMDG
12. When is a statistic considered an unbiased estimator?
13. What is biased estimator?
14. How is the size of a sample related to the spread of the sampling distribution?
15. The variability of a statistic is…
16. Explain the difference between bias and variability.
7.2
Sample Proportions (pp.432-438)
1. What is the purpose of the sample proportion?
2. In an SRS of size n, what is true about the sampling distribution of p̂ when the sample size n
increases?
3. In an SRS of size n:
a. What is the mean of the sampling distribution of p̂ ?
b. What is the standard deviation of the sampling distribution of p̂ ?
4. What happens to the standard deviation of p̂ as the sample size n increases?
5. When does the formula
p 1  p 
apply to the standard deviation of p̂ ?
n
6. When the sample size n is large, the sampling distribution of p̂ is approximately Normal.
What test can you use to determine if the sample is large enough to assume that the sampling
distribution is approximately normal?
34
AMDG
7.3
Sample Means (pp.442-452)
1. What are the mean and standard deviation of the sampling distribution of the sample mean x ?
Describe the conditions for these formulas.
2. Explain how the behavior of the sample mean and standard deviation are similar to the sample
proportion.
3. The mean and standard deviation of a population are parameters.
What symbols are used to represent these parameters?
4. The mean and standard deviation of a sample are statistics.
What symbols are used to represent these statistics?
5. The shape of the distribution of the sample mean depends on …
6. Because averages are less variable than individual outcomes, what is true about the standard
deviation of the sampling distribution of x ?
7. What symbols are used to represent the mean and standard deviation of the sampling
distribution of x ?
8. What is the mean of the sampling distribution of x , if x is the mean of an SRS of size n
drawn from a large population with mean and standard deviation ?
9. What is the standard deviation of the sampling distribution of x , if x is the mean of an SRS
of size n drawn from a large population with mean and standard deviation ?
10. When should you use

n
to calculate the standard deviation of x ?
11. What is the Central Limit Theorem?
35
AMDG
36
AMDG
Chapter 8: Estimating with Confidence
Key Vocabulary:


















point estimator
point estimate
confidence interval
margin of error
interval
confidence level
random
normal
independent
four step process
level C confidence interval
degrees of freedom
standard error
one -sample z interval
t distribution
t-procedures
one-sample t interval
robust
8.1
Confidence Intervals: The Basics (pp.615-643)
10. A point estimator is a statistic that…
11. The value of the point estimator statistic is called a ____________________ and it is our
"best guess" at the value of the _____________________.
12. Summarize the facts about sampling distributions learned in chapter 7:
Shape
Center
Spread
37
AMDG
13. In statistics, what is meant by a 95% confidence interval?
14. A confidence interval takes the form of : “estimate  margin of error”
where:
estimate =
margin of error =
15. Define a level C confidence interval.
16. What information does the margin of error provide?
17. Sketch and label a 95% confidence interval for the standard normal curve.
18. In a sampling distribution of x , why is the interval of numbers between x  2s called a 95%
confidence interval?
19. Sketch and label a 90% confidence interval for the standard normal curve.
20. Interpret a Confidence level: "To say that we are 95% confident is shorthand for …..
21. Explain how to interpret a Confidence interval.
22. Does the confidence level tell us the chance that a particular confidence interval captures the
population parameter? If not, what does it tell us?
23. What does the critical value depend on?
24. Write the form for calculating a confidence interval as shown on page 478.
25. Why do we want high confidence and a small margin of error?
38
AMDG
26. Explain the two conditions when the margin of error gets smaller.
27. State the three conditions for constructing a confidence interval for p or  .
 Random

Normal

Independent
19. What are the two important reminders for constructing and interpreting confidence intervals?
8.2
Estimating a Population Proportion (pp.484-494)
1. In statistics, what is meant by a sample proportion: p̂ ?
2. Give the mean and standard deviation for the sampling distribution of p̂ ?
3. How does the standard deviation differ to to standard error for the sampling distribution of
p̂ ?
4. Describe the sampling distribution of a sample proportion p̂ as learned in section 7.2.
 Shape

Center
39
AMDG

Spread
5. Define standard error.
6. In general what is meant by the standard error of a statistic?
7. How do you calculate the standard error of p̂ ?
8. What is the formula for a one-sample z interval for a population proportion? Describe how to
construct a level C confidence interval for a population proportion.
9. Describe the four step process on how to contruct and interpret a confidence interval.
 State

Plan

Do

Conclude
10. What formula is used to determine the sample size necessary for a given margin of error?
11. What conditions must be met in order to use z procedures for inference about a proportion?
12. What does z* represent?
40
AMDG
13. What is the value of z* for a 95% confidence interval? Include a sketch.
14. What is the value of z* for a 90% confidence interval? Include a sketch.
15. What is the value of z* for a 99% confidence interval? Include a sketch.
8.3
Estimating a Population Mean (pp.499-515)
1. What is the formula for a one-sample z interval for a population mean? Describe how to construct
a level C confidence interval for a population mean.
2. What is the formula for the margin of error of the confidence interval for the population mean ?
3. How can you arrange to have both high confidence and a small margin of error?
4. Describe the three steps for choosing a sample size for a desired margin of error when estimating
.
5. What happens to the margin of error as z* gets smaller? Does this result in a higher or lower
confidence level?
6. What happens to the margin of error, as  gets smaller?
7. What happens to the margin of error, as n gets larger? By how many times must the sample size
n increase in order to cut the margin of error in half?
41
AMDG
8. The formula used to determine the sample size n that will yield a confidence interval for a
population mean with a specified margin of error m is z *
s
n
£ ME . Solve for n.
9. It is the size of the ________________ that determines the margin of error. The size of the
____________________ does not influence the sample size we need.
10. Complete the Check Your Undertanding on page 501.
11. How do you calculate the degrees of freedom for a t distribution?
12. What happens to the t distribution as the degrees of freedom increase?
13. How would you construct a t distribution?
14. Describe the differences between a standard normal distribution and a t distribution.
15. Describe the similarities between a standard normal distribution and a t distribution.
16. What is the formula for the standard deviation of the sampling distribution of the sample mean
x?
17. What is the standard error of the sample mean x ?
42
AMDG
28. Describe how to construct the one-sample t interval for a population mean?
29. Summarize the three conditions for inference about a population mean:
 Random

Normal

Independent
30. Inferences for proportions use ______ and inferences for means use _____.
31. What does it mean if an inference procedure is robust?
32. If the size of the SRS is less than 15, when can we use t procedures on the data?
33. If the size of the SRS is at least 15, when can we use t procedures on the data?
34. If the size of the SRS is at least 30, when can we use t procedures on the data?
43
AMDG
35. Summarize the details of the four step procedure for estimating p:
 State

Plan

Do

Conclude
44
AMDG
Chapter 9: Testing a Claim
Key Vocabulary:




















Significance test
Null Hypothesis
Alternative Hypotheses
One sided alternative
Two sided alternative
p-value
 level
significance level
one-sample z test
test statistic
one sample t test
paired data
four-step process
statistically significant
Type I Error
Type II Error
Power
Degrees of freedom
t-distribution
paired t procedures
9.1
Significance Tests: The Basics (pp.528-543)
20. What is a significance test?
21. What is the difference between a null and an alternative hypothesis? What notation is used for
each?
22. Explain the differences between one-sided and two-sided hypotheses. How can you decide which
one to use?
45
AMDG
23. What form does the null and alternative hypothesis take in significance testing?
24. Hypotheses always refer to a ___________, not to a ______________.
25. In statistics, what is meant by the P-value? What does a P-value measure?
26. If a P-value is small, what do we conclude about the null hypothesis?
27. If a P-value is large, what do we conclude about the null hypothesis?
28. What are common errors students make in their conclusions of P-values?
29. On what evidence would we reject the null hypothesis?
30. On what evidence would we accept the null hypothesis (ie. fail to reject the null hypothesis)?
31. What is meant by a significance level?
32. Explain what it means to say that data are statistically significant.
33. How small should the P-value be in order to claim that a result is statistically significant?
34. When using a fixed significance level to draw a conclusion in a statistical test what can be
concluded when the P value is   and ?
46
AMDG
35. What two circumstances guide us in choosing a level of significance?


36. What is a Type I Error?
37. What is a Type II Error ?
38. Which error is worse, Type I or Type II?
39. Complete the Check Your Understanding on page 539.
40. What is the relationship between the significance level  and the probability of Type I Error?
41. How can we reduce the probability of a Type I error?
42. What is meant by the power of a significance test?
43. What is the relationship between Power and Type II Error? Will you be expected to calculate the
power on the AP exam?
47
AMDG
44. What four factors affect the power of a test? Why does this matter?
45. Describe the three influences that must be verified before deciding on how many observations are
needed in a study.
 Significance Level
 Practical Importance
 Power
9.2
Tests about a Population Proportion (pp.549-561)
1. Summarize the three conditions that must be checked before carrying out significance tests:



2. State the general form of the “test statistic”.
3. What does the test statistic measure? Is this formula on the AP exam formula sheet?
4. Describe the four step process for signifigance tests. Explain what is required at each step.

State

Plan

Do

Conclude
48
AMDG
5. What test statistic is used when testing for a population proportion? Is this on the formula sheet?
6. Summarize the one-sample z test for a proportion and sketch the three possible H a ’s.

Choose…

To test…

Find…

Use this test…

If Normaility is not met
7. What happens when the data does not support H a ?
8. If asked to carry out a signifigance test and there is no  provided, what is recommended?
9. Can you use confidence intervals to decide between two hypotheses? What is the advantage to
using confidence intervals for this purpose?
10. Why don't we always use confidence intervals?
49
AMDG
9.3
Tests about a Population Mean (pp.565-585)
1. What are the three conditions for conducting a significance test for a population mean?
2. What test statistic do we use when testing a population mean? Is this formula on the AP exam
formula sheet?
3. How do you calculate p-values using the t-distributions?
4. What do you do if the degrees of freedom you need is not in table b?
5. How do you find p-values when carrying out a signifigance test about a population mean on
the calculator?
6. For a one-sample t- test for a population mean, state:
 H0

the three possible H a ’s (with small sketches to illustrate)

What is the t test statistic and how is it interpreted?

Under what conditions can this test be used…
7. In terms of rejecting the hypothesis H 0 , how is a significance test related to a confidence
interval on the same population?
50
AMDG
8. Use your calculator to find the p value (tcdf command) for the example Healthy Streams.
What is that p-value?
9. When using technology for the "DO" part of the four step process, what is recommended on
page 573?
10. Work through the Juicy Pineapple example on page 574. Use a calculator to find the exact Pvalue. Why is tcdf mulitplied by 2?
11. Why is the difference between using the calculator versus Table b when finding the p-value in
this example?
12. Do we have encough evidence to reject H 0 in the Juicy Pineapple example? Explain.
13. Read the Check Your Understanding on page 577 and answer questions 1 and 2.
14. What is paired data?
15. What information would lead us to apply a paired t-test to a study, and what would be the
statistic of interest?
51
AMDG
16. In the example, Is Caffeine Dependence Real, explain the difference in the "Do" procedures
for this example versus the Juicy Pineapple example.
17. Describe the four points to be aware of when interpreting signifigance tests.
52
AMDG
Chapter 10: Comparing Two Populations or Groups
Key Vocabulary:





difference between two
proportions
two sample z interval for proportions
two sample z test for difference between two proportions
two sample z statistic
two sample t statistic








pooled combined sample proportion
standard error
randomization distribution
paired t-test
two sample t test for means
two sample t interval for means
difference between two means
pooled two sample t statistic
10.1 Comparing Two Proportions (pp. 604-618)
1. Summarize the three properties of a sampling distribution of a sample proportion:
 Shape

Center

Spread
53
AMDG
2. What are the shape, center, and spread of the sampling distribution of pˆ1  pˆ 2 ? Provide the
formulas for the mean and standard deviation.
 Shape

Center

Spread
3. What conditions need to be met for the sampling distribution of pˆ1  pˆ 2 ?
4. Give the formula for the standard error when calculating a confidence interval for pˆ1  pˆ 2 , and
define each variable in the equation.
5. What is the confidence interval for pˆ1  pˆ 2 ?
6. What conditions must be met in order to use the Two-sample z Interval for a Difference between
Two Proportions?

Random

Normal

Independent
54
AMDG
8. Use the example, Teens and Adults on Social Networking Sites, to outline how to construct
and interpret a confidence interval for the difference between two proportions, p1  p2 .
9. State the null hypothesis for a two proportion significance test.
10. What does pˆ c represent, and how is it calculated?
11. Why do we pool the sample proportions?
12. Give the formula for the two-proportion z-statistic, and define each variable in the equation.
13. Is this on the formula sheet? What does the test statistic measure?
14. State and use diagrams to illustrate the three possible alternative hypotheses for a two
proportion z-test.
15. What are the conditions for conducting a two-sample z test for a difference between
proportions?
16. How are these different than the conditions for a one-sample z interval for p?
17. Describe the randomization distribution.
55
AMDG
18. What must you be careful about when defining parameters in experiments? How can this be
avoided?
19. Can you use your calculator for the Do step? Are there any drawbacks?
20. What are the calculator commands for the two-sample z test and interval for pˆ1  pˆ 2 ?
10.2 Comparing Two Means (pp.627-648)
1. Summarize the three properties of a sampling distribution of a sample mean:
 Shape

Center

Spread
2. What are the shape, center, and spread of the sampling distribution of x1  x2 ? Give the
formula for the mean and standard deviation.
 Shape

Center

Spread
56
AMDG
3. What are the conditions for the sampling distribution of x1  x2 ?
4. Give the formula for the two-sample t-statistic, and define each variable in the equation.
5. Is this on the formula sheet? What does it measure?
6. What is the standard error of x1  x2 ? Is this on the formula sheet?
7. What distribution does the two-sample t statistic have?
8. Why do we use a t statistic rather than a z statistic?
9. Without using technology, how do you estimate the degrees of freedom when using twosample t-procedures?
10. How do you calculate the confidence interval for 1  2 ?
11. In a two-sample t interval problem, what conditions must be met for comparing two means?
12. What are the conditions for conducting a two-sample t test for 1  2 ?
13. Draw a sketch of the three possible scenarios for the alternative hypothesis.
57
AMDG
14. Describe the Normal Condition when using the two sample t procedures.
15. What calculator commands are used for a two-sample t test and interval for 1  2
?
16. How do you proceed when using two-sample t procedures to check the Normal Condition in
the following cases:

Sample size less than 15

Sample size at least 15

Large samples
17. In a two-sample problem, must/should the two sample sizes be equal?
18. When doing two-sample t procedures, should we pool the data to estimate a common standard
deviation? Is there any benefit? Are there any risks?
58
AMDG
Chapter 11: Inference for Distributions of Categorical Data
Key Vocabulary:













one way table
chi-square test for goodness of fit
chi-square statistic
expected count
observed count
chi square distribution
degrees of freedom
chi-square distribution
components of chi-square
cell counts
r x c table
chi square test for homogeneity
chi square test for association/independence
11.1 Chi-Square Goodness of Fit Test (pp.678-690)
1. What is a one-way table?
2. What is a chi-square goodness-of-fit test?
3. What is the difference between the notation X2 and 2?
4. State the general form for the null hypotheses for a 2 goodness of fit test.
5. State the general form for the alternative hypotheses for a 2 goodness of fit test.
6. How do you calculate the expected counts for a chi-square goodness-of-fit test? How should you
round the answer for the expected counts?
7. What is the shape of a chi-square distribution? What happens to the shape as the degrees of
freedom increases? (Illustrate with a diagram)
8. Describe the center and spread of the chi-square distributions.
59
AMDG
9. What is the chi-square test statistic? Is it on the formula sheet? What does it measure?
10. How many degrees of freedom does the chi-square distribution have?
11. What is the rule of thumb for all expected counts in a chi-square goodness of fit test?
12. What conditions must be met in order to use the goodness of fit test?
12. How do you calculate p-values using chi-square distributions?
14. Can you use your calculator to conduct a chi-square goodness-of-fit test? If yes, what are the
calculator commands?
15.What is meant by a component of chi-square?
16. What does the largest component of chi-square signify?
17.Why is it necessary to perform follow-up analysis to a chi-square test?
60
AMDG
11.2 Inference for Relationships (pp.696-721)
1. What is the hypothesis for a test of homogeneity?
2. Describe the complications with multiple comparisons? How are they overcome?
3. Explain how to calculate the expected counts for a test that compares the distribution of a
categorical variable in multiple groups or populations.
4. Write the formula for the Chi-square test statistic? Is this on the AP Exam formula sheet?
5. What does the Chi-square test statistic measure?
6. What information is contained in a two-way table for a Chi-square test?
7. How many degrees of freedom does a chi-square test for a two-way table with r rows and c
columns have?
8. What requirements must be checked before carrying out a Chi-square test for Homogeneity?
9. State the null and alternative hypothesis for the Chi-square test for Homogeneity?
10. Can you use your calculators to do a Chi-square test of homogeneity? If yes, what are the
calculator commands?
61
AMDG
11. Summarize how to carry out a Chi-square Test for Homogeneity of Populations:
12. Explain how and when to conduct a follow-up analysis for a test of homogeneity?
13. What does it mean if two variables have an association?
14. What does it mean if two variables are independent?
15. State the null and alternative hypotheses for a Chi-square test for Association/Independence.
16. How is a test of association/independence different than a test of homogeneity?
17. How do you calculate expected counts for a test of association/independence?
62
AMDG
18. Summarize how to carry out a Chi-square Test for Association/Independence:
19. What are the conditions for a test of association/independence?
20. When should you use a chi-square test and when should you use a two-sample z test?
63
AMDG
64
AMDG
Chapter 12: More about Regression
Key Vocabulary:










sample regression line
true regression line
t interval for slope
standard error of slope
t test for slope
standardized test statistic
standard error
exponential model
power model
logarithmic model
12.1 Inference for Linear Regression (pp.739-757)
1. What is the difference between a sample regression line and population (true) regression
line?
2. Explain the sampling distribution of b?
3. Give the equation for the true regression line, and state what each component of the equation
represents.
4. Summarize the conditions for regression inference:

L

I

N

E

R
65
AMDG
5. Explain how to check the conditions for regression inference:

L

I

N

E

R
6. Record the formula for the standard error of the slope? Define the variables.
7. What is the formula for the t-interval of the slope of a least-squares regression line? Is this on
the AP exam formula sheet?
8. What is the formula for the t-test for the slope of the population regression line? Is this on the
AP exam formula sheet?
9. Describe the distribution of the standardized test statistic
b
.
SE b
10. What is the formula for constructing a confidence interval for a slope?
11. What calculator commands are used to get the value of t*?
12. Can you use your calculator to conduct a test and confidence interval for the slope?
66
AMDG
12.2 Transforming to Achieve Linearity (pp.765-783)
1. What does it mean to transform data?
2. What is a power model?
3. Give three examples of power models?
4.
Aside from power transformations, how can you linearize an association that follows a
power model in the form y = axp?
5.
Describe a logarithmic model. Give two examples.
6.
Describe an exponential model. Give two examples.
7.
Describe the two methods used to linearize a relationship that follows an exponential
model.
8.
Show how to use logarithms to transform the data given by y = axp to produce a linear
relationship.
9.
The big idea using logarithms to transform data is that "if a variable grows
__________________, its __________________ grow linearly."
10.
Describe how to achieve linearity from a power model as explained on page 777.
11.
After using a logarithm transformation, what does the scatter plot of the data show?
67