* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Download File
Dempster–Shafer theory wikipedia , lookup
Indeterminism wikipedia , lookup
Probability box wikipedia , lookup
Inductive probability wikipedia , lookup
Infinite monkey theorem wikipedia , lookup
Birthday problem wikipedia , lookup
Boy or Girl paradox wikipedia , lookup
Ars Conjectandi wikipedia , lookup
Risk aversion (psychology) wikipedia , lookup
+ Chapter 5: Probability: What are the Chances? Section 5.1 Randomness, Probability, and Simulation The Practice of Statistics, 4th edition – For AP* STARNES, YATES, MOORE + Section 5.1 Randomness, Probability, and Simulation Learning Objectives After this section, you should be able to… DESCRIBE the idea of probability DESCRIBE myths about randomness Definition: The probability of any outcome of a chance process is a number between 0 (never occurs) and 1(always occurs) that describes the proportion of times the outcome would occur in a very long series of repetitions. + Check Your Understanding Example 1 Probability is a measure of how likely an outcome is to occur. Match one of the probabilities that follow with each statement. Be prepared to defend your answer. {0, 0.01, 0.3, 0.6, 0.99, 1} a) This outcome is impossible. It can never occur. b) This outcome is certain. It will occur on every trial. c) This outcome is very unlikely, but it will ocurr once in a while in a long sequence of trials. d) This outcome will occur more often than not. + Solution 0 – never occur 1 – always occur 0.01 – approximately 1% 0.6 – approximately 60% will occur more than not occur, whereas .99 means it happens ALMOST every time. + Example 2 4 students are studying together. While they are getting a snack in the kitchen, someone’s younger brother comes in and mixes up all the textbooks. Each student takes a book at random. The graphs below show the short-run and long-run behavior of the proportion of trials in which there are no matches when four students choose a book at random. The blue line is the correct probability of 0.375. As you can see, in the first 20 trials, there is quite a bit of variability. However, after 500 trials, the proportion of times there was no match is quite close to the actual value. Idea of Probability The law of large numbers says that if we observe more and more repetitions of any chance process, the proportion of times that a specific outcome occurs approaches a single value. Randomness, Probability, and Simulation Chance behavior is unpredictable in the short run, but has a regular and predictable pattern in the long run. + The + Example 3 How much should a company charge for an extended warranty for a specific type of cell phone? Suppose that 5% of these cell phones under warranty will be returned, and the cost to replace the phone is $150. What is the minimum amount the company should charge for the extended warranty? + Solution If the company knew which phones would go bad, it could charge $150 for these phones and $0 for the rest. However, since the company can’t know which phones will be returned but knows that about 1 in every 20 will be returned, they should charge at least 150/20 = $7.50 for the extended warranty. + Myths about Randomness The idea about probability is that randomness is predictable in the long run. Unfortunately, our intuition about randomness tries to tell us that random phenomena should also be predication in the short run. When they aren’t, we look for some explanation other than chance variation. + Example 4 123456654321 or 154524336126 Randomness, Probability, and Simulation Roll a die 12 times and record the result of each roll. Which of the following outcomes is more probable? + 4 Solution These outcomes are both equally (un)likely, even though the first set of rolls has a more noticeable pattern. Randomness, Probability, and Simulation Example Example 5 +Suppose that a basketball announcer suggests a certain player is streaky. That is, the announcer believes that if the player makes a shot, then the player is more likely to make his next shot. As evidence, the announcer points to a recent game where the player took 30 shots and had a streak of 7 made shots in a row. Is this convincing evidence of streakiness, or could it have occurred simply by chance? Assuming that this player makes 50% of his shots and the results of a shot don’t depend on previous shots, how likely is it for the player to have a streak of 7 or more made shots in a row? Using your calculator, let 1 represent a made shot and 0 represent a missed shot. Calculate a random integer (either 0 or 1) 30 times, writing down the outcome after each calculation. Record the length of the longest streak of made shots. Combine your results with those of your classmates. In what proportion of the trials did the player have a streak of at least 7 in a row? + STATE How likely is it for the player to have a streak of 7 or more shots in a row? + PLAN Use the random number generator on the calculator to generate 30 random shots. 0 represents a missed shot. 1 represents a shot made. Record the outcome of each trial. Record the longest streaks on a dotplot. + DO Perform this twice. _________________________________________________________________________________ 2 3 4 5 6 7 8 9 10 + CONCLUDE The player had a streak of 7 or more made shots in ____ of the ____ simulated games (_____%). Randomness, Probability, and Simulation In casinos, there is often a large display next to every roulette table showing the outcomes of the last several spins of the wheel. Since the results of previous spins reveal nothing about the results of future spins, why do the casinos pay for these displays? + Example 6 + Solution Because many players use the previous results to determine what bets to make, even though it won’t help them win. And as long as the players keep making bets, the casino keeps making money. + Section 5.1 Randomness, Probability, and Simulation Learning Objectives After this section, you should be able to… DESIGN and PERFORM simulations + Simulation Performing a Simulation State: What is the question of interest about some chance process? Plan: Describe how to use a chance device to imitate one repetition of the process. Explain clearly how to identify the outcomes of the chance process and what variable to measure. Do: Perform many repetitions of the simulation. Conclude: Use the results of your simulation to answer the question of interest. We can use physical devices, random numbers (e.g. Table D), and technology to perform simulations. Randomness, Probability, and Simulation The imitation of chance behavior, based on a model that accurately reflects the situation, is called a simulation. + IMPORTANT When making conclusions, be careful to say that there is either convincing evidence (or there isn’t convincing evidence) to support a particular claim. DO NOT say that a claim is definitely true or that the evidence proves that a claim is incorrect. + Example 1 Suppose I want to choose a simple random sample of size 6 from a group of 60 seniors and 30 juniors. To do this, I write each person’s name on an equally-sized piece of paper and mix the papers in a large grocery bag. Just as I am about to select the first name, a thoughtful student suggests that I should stratify by class. I agree, and we decide it would be appropriate to select 4 seniors and 2 juniors. However, because I have already mixed up the names, I don’t want to have to separate them all again. Instead, I will select names one at a time from the bag until I get 4 seniors and 2 juniors. Design and carry out a simulation using Table D to estimate the probability that you must draw 10 or more names to get 4 seniors and 2 juniors. + STATE What is the probability that it takes 10 or more selections to get 4 seniors and 2 juniors? + PLAN Using pairs of digits from Table D, we’ll label the 60 seniors 01– 60 and the 30 juniors 61–90. Numbers 00 and 91–99 will be skipped. Moving left to right across a row, we’ll look at pairs of digits until we have 4 different labels from 01–60 and 2 different labels from 61–90. Then we will count how many different labels from 01–90 we looked at. + DO Here is an example of one repetition, using line 101 from Table D: 19 (senior) 22 (senior) 39 (senior) 50 (senior) 34 (senior) 05 (senior) 75 (junior) 62 (junior) In this trial, it took exactly 8 selections to get at least 4 seniors and at least 2 juniors. Here are the results ofDot50 Plottrials: Collection 2 6 7 8 9 10 11 NumSe l 12 13 14 + CONCLUDE In the simulation, 11 of the 50 trials required 10 or more selections to get 4 seniors and 2 juniors, so the probability that it takes 10 or more selections is approximately 0.22. + Example 2 At their annual picnic, 18 students in the mathematics/statistics department at a university decide to play a softball game. Twelve of the 18 students are math majors and 6 are stats majors. To divide into two teams of 9, one of the professors put all the players’ names into a hat and drew out 9 players to form one team, with the remaining 9 players forming the other team. The players were surprised when one team was made up entirely of math majors. Is it possible that the names weren’t adequately mixed in the hat, or could this have happened by chance? Design and carry out a simulation to help answer this question. + STATE What is the probability that when randomly assigning 12 Math majors and 6 Stats majors to two teams that there will be one team with all Math majors? + PLAN Using pairs of digits from Table D, we’ll label the 12 math majors 01–12 and the 6 stats majors 13–18. Number 00 will be skipped. Moving left to right across a row, we’ll look at pairs of digits until we have a team of 9.Then we will count how many different labels from 01–18 we looked at. + DO Here is an example of one trial: Team A: MMMSMMMMM (8 Math majors) Team B: MSMMSSSSM (4 Math majors) Since the team with the most Math majors had 8, we will record the value 8 for this trial. Here are the results of 30 trials: + CONCLUDE Since only 1 trial in 30 resulted in a team with all Math majors, the probability is only approximately 0.033. Since getting a team of all Math majors is unlikely, we can conclude that the names were probably not shuffled very well in the hat. + Golden Ticket Parking Lottery + Example: Read the example on page 290. What is the probability that a fair lottery would result in two winners from the AP Statistics class? Reading across row 139 in Table Students Labels D, look at pairs of digits until you AP Statistics Class 01-28 see two different labels from 0195. Record whether or not both Other 29-95 winners are members of the AP Skip numbers from 96-00 Statistics Class. 55 | 58 89 | 94 04 | 70 70 | 84 10|98|43 56 | 35 69 | 34 48 | 39 45 | 17 X|X X|X ✓|X X|X ✓|Sk|X X|X X|X X|X X|✓ No No No No No No No No No 19 | 12 97|51|32 58 | 13 04 | 84 51 | 44 72 | 32 18 | 19 ✓|✓ Sk|X|X X|✓ ✓|X X|X X|X ✓|✓ X|Sk|X Sk|✓|✓ Yes No No No No No Yes No Yes 40|00|36 00|24|28 Based on 18 repetitions of our simulation, both winners came from the AP Statistics class 3 times, so the probability is estimated as 16.67%. NASCAR Cards and Cereal Boxes + Example: Read the example on page 291. What is the probability that it will take 23 or more boxes to get a full set of 5 NASCAR collectible cards? Driver Label Jeff Gordon 1 Dale Earnhardt, Jr. 2 Tony Stewart 3 Danica Patrick 4 Jimmie Johnson 5 Use randInt(1,5) to simulate buying one box of cereal and looking at which card is inside. Keep pressing Enter until we get all five of the labels from 1 to 5. Record the number of boxes we had to open. 3 5 2 1 5 2 3 5 4 9 boxes 4 3 5 3 5 1 1 1 5 3 1 5 4 5 2 15 boxes 5 5 5 2 4 1 2 1 5 3 10 boxes We never had to buy more than 22 boxes to get the full set of cards in 50 repetitions of our simulation. Our estimate of the probability that it takes 23 or more boxes to get a full set is roughly 0. + Section 5.1 Randomness, Probability, and Simulation Summary In this section, we learned that… A chance process has outcomes that we cannot predict but have a regular distribution in many distributions. The law of large numbers says the proportion of times that a particular outcome occurs in many repetitions will approach a single number. The long-term relative frequency of a chance outcome is its probability between 0 (never occurs) and 1 (always occurs). Short-run regularity and the law of averages are myths of probability. A simulation is an imitation of chance behavior. + Looking Ahead… In the next Section… We’ll learn how to calculate probabilities using probability rules. We’ll learn about Probability models Basic rules of probability Two-way tables and probability Venn diagrams and probability