Survey
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Stats 95 • Experimental Design – Experimental Design & Lady Tasting Tea – Type I and Type II Errors – Null Hypothesis an Research Hypothesis Lady Tasting Tea • How would you design the experiment? • What task would you give her? • What would be the Independent Variable? Dependent variable? Control condition • How many could she guess right by chance? • What if she can taste the difference, but she makes mistakes? • Do you know for certain she can? Do you know for certain she cannot? Hypotheses • H0 Null Hypothesis: there is nothing going on, Straw Man, the probability of guessing Tea in Milk is equal to guessing Milk in Tea • H1 Research Hypothesis: something is going on, probability of correct identification is not equal to guessing between Milk in Tea and Tea in Milk. Hits (response “yes” on signal trial) Probability density Criterion N Say “no” S+N Say “yes” Internal response Correct rejects (response “no” on no-signal trial) Probability density Criterion N Say “no” S+N Say “yes” Internal response Misses (response “no” on signal trial) Probability density Criterion N Say “no” S+N Say “yes” Internal response False Alarms (response “yes” on no-signal trial) Probability density Criterion N Say “no” S+N Say “yes” Internal response “What Cold Possibly Go Worng?”: Type I and Type II Errors Reality Perception YES (Signal + Noise) YES Hit (Signal + Noise) NO (Noise) Miss (Type II) False Negative NO (Noise) False Alarm (Type I) False Positive Correct Rejection “What Cold Possibly Go Worng?”: Type I and Type II Errors REALITY of PREGNANCY YES NO HIT (Pregnant & “+” on test) FALSE ALARM (Type I) Also called False Positive (Not Pregnant & “+”) TEST RESULTS “PREGNANT” (Reject Null) “NOT PREGNANT” (Fail to reject the Null) Miss (Type II) Also called False Negative (Pregnant & “-”) Correct Rejection (Not Pregnant & “-”) The End Statistics in Correlations & Experiments • Correlations measure Relationship – Strength and direction of relatioship • Experiments measure the Differences – Statistical significance of the difference Correlation: Measuring Relationship • Sir Francis Galton (Uncle to Darwin – Development of behavioral statistics – Father of Eugenics – Science of fingerprints as unique – Retrospective IQ of 200 – Drove himself mad just to prove you could do it – Invented the pocket 2.3 The Science of Explanation • Measuring correlation – more-more/less-less – more-less/less-more • Correlation coefficient – – – – measure of direction & strength r = 1 r = -1 r = 0 13 Correlation • What does correlation coefficient mean? 14 2.3 The Science of Explanation • Experiment—2 critical features • (1) Manipulation – independent variable – dependent variable—measured – Control Group Condition (or Variable) – Experimental Group Condition (or Variable) • (2) Randomization - controls for a 3rd variable (you know exists but are not interested in) – versus self-selection 17 Dependent Variables Without Demand Charcteristics • DVs that aren’t subject to biased responses • Examples: – Is a painting in a museum popular? • There will be increased wear on the carpet near it. – Did a dental flossing lecture work? • Students will have cleaner teeth the next day. – Did a safer sex intervention for commercial sex workers work? • There will be more condoms discarded in the park they work in. Variation in IV Causes Variation in DV 1. Cause → Effect: whenever IV occurs, outcome DV should result. Safe sex intervention Condoms in Park 2. Cause absent → Effect absent No SS intervention no condoms 3. Cause variation → Effect variation More or better interventions more condoms in park Experimental & Control Groups • Experimental Condition: Cause is valid – E.g., drug, alcohol • Control Condition: cause • Essence of experiment is to is invalid control conditions beforehand – Placebo, juice The Science of Observation • Validity—able to draw accurate inferences – construct validity: e.g., describing what intelligence is and is not, “construct” refers to the “theory” – predictive validity: over time you find X predicts Y • Reliability—same result each time? - Test/Re-Test - Parallel - Inter-Item 21 Statistical Significance • A finding is statistically significant if the data differ from what we would expect from chance alone, if there were, in fact, no actual difference. • They may not be significant in the sense of big, important differences, but they occurred with a probability below the critical cutoff value, usually a z-score or p < .05 • Reject or Fail to Reject the NULL Hypothesis Graphing Frequency Discrete: Histogram Continuous: Frequency Polygon Stem-and-Leaf: Exam 1 & 3 Selection of ranges & bins like Histogram, but usually simpler. These plots represent the scores on an exam given to two different sections for the same course.