Survey
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Taylor's law wikipedia , lookup
Psychometrics wikipedia , lookup
Foundations of statistics wikipedia , lookup
Statistical inference wikipedia , lookup
History of statistics wikipedia , lookup
Omnibus test wikipedia , lookup
Analysis of variance wikipedia , lookup
Resampling (statistics) wikipedia , lookup
Laboratory for Interdisciplinary Statistical Analysis Anne Ryan [email protected] Virginia Tech Laboratory for Interdisciplinary Statistical Analysis 1948: The Statistical Laboratory was founded as a division of the Virginia Agricultural Experiment Station to help agronomists design experiments and calculate sums of squares. www.lisa.stat.vt.edu Laboratory for Interdisciplinary Statistical Analysis 1949: Based on the success of the Statistical Laboratory, the Department of Statistics at Virginia Polytechnic Institute (VPI) was founded—the 3rd oldest statistics department in the United States. www.lisa.stat.vt.edu Laboratory for Interdisciplinary Statistical Analysis 1973: The Statistical Laboratory was re-formed as the Statistical Consulting Center to assist with statistical analyses in every college of Virginia Polytechnic Institute & State University (VPI&SU). www.lisa.stat.vt.edu Laboratory for Interdisciplinary Statistical Analysis 2007: The Graduate Student Assembly led a movement to save statistical consulting and collaboration from death by budget cuts, ensuring that graduate students could receive help with their research. The College of Science, Provost, Vice President of Research, Graduate School, and six additional colleges agreed that researchers should be able to receive free statistical consulting and collaboration. www.lisa.stat.vt.edu Laboratory for Interdisciplinary Statistical Analysis 2008: The Statistical Consulting Center was re-organized as the Laboratory for Interdisciplinary Statistical Analysis (LISA) to collaborate with researchers across the Virginia Tech (VT) campuses. www.lisa.stat.vt.edu Laboratory for Interdisciplinary Statistical Analysis Year 2000 2001 2002 2003 2004 2005 2006 2007 2008 2009 2010 Clients Hours 299 293 321 304 274 211 171 190 895 719 1124 1368 1938 2220 2192 1775 495 541 965 2184 3093 4420 Established in 2008 www.lisa.stat.vt.edu Laboratory for Interdisciplinary Statistical Analysis 1200 900 600 1368 1938 2220 2192 1775 495 541 965 2184 3093 4420 300 299 293 321 304 274 211 171 190 895 719 1124 0 2000 2001 2002 2003 2004 2005 2006 2007 2008 2009 2010 Clients Hours Clients per year Year 2000 2002 2004 2006 2008 Year www.lisa.stat.vt.edu 2010 Laboratory for Interdisciplinary Statistical Analysis 5000 4000 3000 2000 1368 1938 2220 2192 1775 495 541 965 2184 3093 4420 1000 299 293 321 304 274 211 171 190 895 719 1124 0 2000 2001 2002 2003 2004 2005 2006 2007 2008 2009 2010 Clients Hours Hours per year Year 2000 2002 2004 2006 2008 Year www.lisa.stat.vt.edu 2010 Laboratory for Interdisciplinary Statistical Analysis LISA helps VT researchers benefit from the use of Statistics Experimental Design • Data Analysis • Interpreting Results Grant Proposals • Software (R, SAS, JMP, SPSS...) Our goal is to improve the quality of research and the use of statistics at Virginia Tech. www.lisa.stat.vt.edu 10 Laboratory for Interdisciplinary Statistical Analysis LISA helps VT researchers benefit from the use of Statistics Collaboration LISA statisticians meet with faculty, staff, and graduate students to understand their research and think of ways to help them using statistics. www.lisa.stat.vt.edu 11 Laboratory for Interdisciplinary Statistical Analysis LISA helps VT researchers benefit from the use of Statistics Collaboration Walk-In Consulting Every day from 1-3PM clients get answers to their (quick) questions about using statistics in their research. www.lisa.stat.vt.edu 12 Laboratory for Interdisciplinary Statistical Analysis LISA helps VT researchers benefit from the use of Statistics Collaboration Walk-In Consulting Short Courses Short Courses are designed to teach graduate students how to apply statistics in their research. www.lisa.stat.vt.edu 13 Laboratory for Interdisciplinary Statistical Analysis LISA helps VT researchers benefit from the use of Statistics Collaboration Walk-In Consulting Short Courses All services are FREE for VT researchers. We assist with research—not class projects or homework. www.lisa.stat.vt.edu 14 How can LISA help? • Formulate research question. • Screen data for integrity and unusual observations. • Implement graphical techniques to showcase the data – what is the story? • Develop and implement an analysis plan to address research question. • Help interpret results. • Communicate! Help with writing the report or giving the talk. • Identify future research directions. Laboratory for Interdisciplinary Statistical Analysis To request a collaboration meeting go to www.lisa.stat.vt.edu www.lisa.stat.vt.edu Laboratory for Interdisciplinary Statistical Analysis To request a collaboration meeting go to www.lisa.stat.vt.edu 1. Sign in to the website using your VT PID and password. 2. Enter your information (email address, college, etc.) 3. Describe your project (project title, research goals, specific research questions, if you have already collected data, special requests, etc.) 4. Wait 0-3 days, then contact the LISA collaborators assigned to your project to schedule an initial meeting. www.lisa.stat.vt.edu Laboratory for Interdisciplinary Statistical Analysis www.lisa.stat.vt.edu Laboratory for Interdisciplinary Statistical Analysis • • Introduction to R R is a free software environment for statistical computing and graphics. Download: http://www.r-project.org/ Topics Covered: • • • Data objects in R, loops, import/export datasets, data manipulation Graphing Basic Analyses: T-tests, Regression, ANOVA www.lisa.stat.vt.edu Laboratory for Interdisciplinary Statistical Analysis • • Linear Regression & Structural Equation Monitoring Linear regression is used to model the relationship between a continuous response and a continuous predictor. SEM is a modeling technique that investigates causal relationships among variables. • Time –related latent variables, modification indices and critical ratio in exploratory analyses, and computation of implied moments, factor score weights, total effects, and indirect effects. www.lisa.stat.vt.edu Laboratory for Interdisciplinary Statistical Analysis Generalized Linear Models • • • • Modeling technique for situations where the errors are not necessarily normal. Can handle situations where you have binary responses, counts, etc. Uses a link function to relate the response to the linear model. Cover: Basic statistical concepts of GLM and how it relates to regression using normal errors. www.lisa.stat.vt.edu Laboratory for Interdisciplinary Statistical Analysis • Mixed Models and Random Effects Mixed Model: A statistical model that has both random effects and fixed effects. • • Fixed Effect: Levels of the factor are predetermined. Random Effect: Levels of the factor were chosen at random. The primary focus of the course will be to identify scenarios where a mixed model approach will be appropriate. The concepts will be explained almost wholly through examples in SAS or in R. www.lisa.stat.vt.edu Anne Ryan 23 Defense: Represent the accused (defendant) Prosecution: Hold the “Burden of Proof”—obligation What’s the Assumed Conclusion? to shift the assumed conclusion from an oppositional opinion to one’s own position through evidence ANSWER: The accused is innocent until proven guilty. •Prosecution must convince the judge/jury that the defendant is guilty beyond a reasonable doubt 24 Burden of Proof—Obligation to shift the conclusion using evidence Hypothesis Test Accept the status quo (what is believed before) until the data suggests otherwise Trial Innocent until proven guilty 25 Decision Criteria Hypothesis Test Occurs by chance less than 100α% of the time (ex: 5%) Trial Evidence has to convincing beyond a reasonable 26 Hypothesis Test: Procedure for examining a claim about the value of a parameter ◦ i.e. 𝜇 = 3.5 Hypothesis tests are very methodical with several key pieces. 27 1. Test 2. Assumptions 3. Hypotheses 4. Mechanics 5. Conclusion 28 State the name of the testing method to be used It is important to not be off track in the very beginning Hypothesis Tests we will Perform: ◦ ◦ ◦ ◦ One Sample t test for μ Two sample t test for μ Paired t test ANOVA 29 List all the assumptions required for your test to be valid. All tests have assumptions Even if assumptions are not met you should still comment on how this affects your results. 30 State the hypothesis of interest There are two hypotheses ◦ Null Hypothesis: Denoted H 0 ◦ Alternative Hypothesis: Denoted H1 or H a Examples of possible hypotheses: H 0 : 13 vs. H a : 13 31 For hypothesis testing there are three popular versions of testing ◦ Left Tailed Hypothesis Test ◦ Right Tailed Hypothesis Test ◦ Two Tailed or Two Sided Hypothesis Test 32 1. Left Tailed Hypothesis Test: Researchers are only interested in whether the true value is below the hypothesized value. e.g— H 0 : 0 vs. H a : 0 2. Right Tailed Hypothesis Test: Researchers are only interested in whether the True Value is above the hypothesized value. e.g.– 𝐻0 : 𝜇 ≤ 𝜇0 𝑣𝑠. 𝐻𝑎 : 𝜇 > 𝜇0 33 3. Two Tailed or Two Sided Hypothesis Test: The researcher is interested in looking above and below they hypothesized value. H 0 : 0 vs. H a : 0 34 Three Requirements for Stating Hypotheses: 1. Two complementary hypotheses. and or and 2. A parameter about which the test is to be based e.g.—μ 3. Hypothesized Value for parameter Denoted 𝜇0 but generally takes on numeric values in practice 35 Computational Part of the Test What is part of the Mechanics step? ◦ ◦ ◦ ◦ Stating the Significance Level Finding the Rejection Rule Computing the Test Statistic Computing the p-value 36 Significance Level: Here we choose a value to use as the significance level, which is the level at which we are willing to start rejecting the null hypothesis. Denoted by α Default value is α=.05, use α=.05 unless otherwise noted! 37 Rejection Rule: State our criteria for rejecting the null hypothesis. ◦ “Reject the null hypothesis if p-value<.05”. p-value: The probability of obtaining a point estimate as “extreme” as the current value where the definition of “extreme” is taken from the alternative hypotheses assuming the null hypothesis is true. 38 Test Statistic: Compute the test statistic, which is usually a standardization of your point estimate. Translates your point estimate, a statistic, to follow a known distribution so that is can be used for a test. 39 p-value: After computing the test statistic, now you can compute the p-value. Use software to compute p-values. 40 Conclusion: Last step of the hypothesis test just like it is the last step when computing confidence intervals. Conclusions should always include: ◦ Decision: reject or fail to reject ◦ Linkage: why you made the decision (interpret pvalue) ◦ Context: what your decision means in context of the problem. 41 Note: Your decision can only be one of two choices: 1. Reject H 0 --data gives strong indication that H a is more likely 2. Fail to Reject H 0 --data gives no strong indication that H a is more likely When conducting hypothesis tests, we assume that H 0 is true, therefore the decision CAN NOT be to accept the null hypothesis 42 43 Used to test whether the population mean is different from a specified value. Example: Is the mean height of 12 year old girls greater than 60 inches? http://office.microsoft.com/en-us/images 44 The population mean is not equal to a specified value. Null Hypothesis, H0: μ = μ0 Alternative Hypothesis: Ha: μ ≠ μ0 The population mean is greater than a specified value. H0: μ = μ0 Ha: μ > μ0 The population mean is less than a specified value. H0: μ = μ0 Ha: μ < μ0 45 The sample is random. The population from which the sample is drawn is either normal or the sample size is large. 46 Step 3: Calculate the test statistic: y 0 t s/ n Where n s y i 1 y 2 i n 1 Step 4: Calculate the p-value based on the appropriate alternative hypothesis. Step 5: Write a conclusion. 47 A researcher would like to know whether the mean sepal width of a variety of irises is different from 3.5 cm. Use 𝛼 = 0.05. The researcher randomly selects 50 irises and measures the sepal width. Step 1: Hypotheses H0: μ = 3.5 cm Ha: μ ≠ 3.5 cm http://en.wikipedia.org/wiki/Iris_flower_data_set 48 Steps 2-4: JMP Demonstration Analyze Distribution Y, Columns: Sepal Width Normal Quantile Plot Test Mean Specify Hypothesized Mean: 3.5 49 Step 5 Conclusion: Fail to reject 𝐻𝑜 since the p-value=0.1854 is greater than 0.05. There is significant sample evidence to indicate that the mean sepal width is not different from 3.5 cm. 50 51 Two sample t-tests are used to determine whether the population mean of one group is equal to, larger than or smaller than the population mean of another group. Example: Is the mean cholesterol of people taking drug A lower than the mean cholesterol of people taking drug B? 52 The population means of the two groups are not equal. H0: μ1 = μ2 Ha: μ1 ≠ μ2 The population mean of group 1 is greater than the population mean of group 2. H0: μ1 = μ2 Ha: μ1 > μ2 The population mean of group 1 is less than the population mean of group 2. H0: μ1 = μ2 Ha: μ1 < μ2 53 The two samples are random and independent. The populations from which the samples are drawn are either normal or the sample sizes are large. The populations have the same standard deviation. 54 Step 3: Calculate the test statistic y1 y2 t 1 1 sp n1 n2 (n1 1) s12 (n2 1) s22 where s p n1 n2 2 Step 4: Calculate the appropriate p-value. Step 5: Write a Conclusion. 55 A researcher would like to know whether the mean sepal width of setosa irises is different from the mean sepal width of versicolor irises. The researcher randomly selects 50 setosa irises and 50 versicolor irises and measures their sepal widths. Step 1 Hypotheses: H0: μsetosa = μversicolor Ha: μsetosa ≠ μversicolor http://en.wikipedia.org/ wiki/Iris_flower_data_set http://en.wikipedia.org/ wiki/Iris_versicolor 56 Steps 2-4: JMP Demonstration: Analyze Fit Y By X Y, Response: Sepal Width X, Factor: Species Means/ANOVA/Pooled t Normal Quantile Plot Plot Actual by Quantile 57 -2.33 -1.64 -1.28 -0.67 0.0 setosa 0.67 1.281.64 2.33 0.98 0.9 0.8 0.5 0.2 0.1 0.02 versicolor Normal Quantile Step 5 Conclusion: There is strong evidence (p-value < 0.0001) that the mean sepal widths for the two varieties are different. 58 59 The paired t-test is used to compare the population means of two groups when the samples are dependent. Example: A researcher would like to determine if background noise causes people to take longer to complete math problems. The researcher gives 20 subjects two math tests one with complete silence and one with background noise and records the time each subject takes to complete each test. 60 The population mean difference is not equal to zero. H0: μdifference = 0 Ha: μdifference ≠ 0 The population mean difference is greater than zero. H0: μdifference = 0 Ha: μdifference > 0 The population mean difference is less than a zero. H0: μdifference = 0 Ha: μdifference < 0 61 The sample is random. The data is matched pairs. The differences have a normal distribution or the sample size is large. 62 Step 3: Calculate the test Statistic: d 0 t sd / n Where d bar is the mean of the differences and sd is the standard deviations of the differences. Step 4: Calculate the p-value. Step 5: Write a conclusion. 63 A researcher would like to determine whether a fitness program increases flexibility. The researcher measures the flexibility (in inches) of 12 randomly selected participants before and after the fitness program. Step 1: Formulate a Hypothesis H0: μAfter - Before = 0 Ha: μ After - Before > 0 http://office.microsoft.com/en-us/images 64 Steps 2-4: JMP Analysis: Create a new column of After – Before Analyze Distribution Y, Columns: After – Before Normal Quantile Plot Test Mean Specify Hypothesized Mean: 0 65 Step 5 Conclusion: There is not evidence that the fitness program increases flexibility. 66 67 ANOVA is used to determine whether three or more populations have different distributions. A B C Medical Treatment 68 The first step is to use the ANOVA F test to determine if there are any significant differences among the population means. If the ANOVA F test shows that the population means are not all the same, then follow up tests can be performed to see which pairs of population means differ. 69 yij i ij Where yij is the response of the jth trial on the ith factor level i is the mean of the ith group ij ~ N (0, 2 ) i 1,, r j 1, , ni In other words, for each group the observed value is the group mean plus some random variation. 70 Step 1: We test whether there is a difference in the population means. H 0 : 1 2 r H a : The i are not all equal. 71 The samples are random and independent of each other. The populations are normally distributed. The populations all have the same standard deviations. The ANOVA F test is robust to the assumptions of normality and equal standard deviations. 72 C A B C A B Medical Treatment Compare the variation within the samples to the variation between the samples. 73 F Variation between Groups MSG Variation within Groups MSE Variation within groups small compared with variation between groups → Large F Variation within groups large compared with variation between groups → Small F 74 The mean square for groups, MSG, measures the variability of the sample averages. SSG stands for sums of squares groups. SSG MSG r -1 n1 ( y1 y ) 2 n 2 ( y2 y ) 2 n r ( y1 y ) 2 r -1 75 Mean square error, MSE, measures the variability within the groups. SSE stands for sums of squares error. SSE n-r (n 1 - 1)s12 (n 2 - 1)s 22 (n r - 1)s 2r n-r Where MSE ni si (y j 1 ij yi ) ni 1 76 Step 4: Calculate the p-value. Step 5: Write a conclusion. 77 A researcher would like to determine if three drugs provide the same relief from pain. 60 patients are randomly assigned to a treatment (20 people in each treatment). Step 1: Formulate the Hypotheses H0: μDrug A = μDrug B = μDrug C Ha : The μi are not all equal. http://office.microsoft.com/en-us/images 78 JMP demonstration Analyze Fit Y By X Y, Response: Pain X, Factor: Drug Normal Quantile Plot Plot Actual by Quantile Means/ANOVA 79 -2.33 -1.64 -1.28 -0.67 75 0.0 0.67 1.281.64 2.33 Drug B Drug Drug C A 65 60 0.98 0.9 Drug C 0.8 Drug B Drug 0.5 Drug A 0.2 50 0.1 55 0.02 Pain 70 Normal Quantile Step 5 Conclusion: There is strong evidence that the drugs are not all the same. 80 The p-value of the overall F test indicates that the level of pain is not the same for patients taking drugs A, B and C. We would like to know which pairs of treatments are different. One method is to use Tukey’s HSD (honestly significant differences). 81 Tukey’s test simultaneously tests H 0 : i i ' H a : i i ' for all pairs of factor levels. Tukey’s HSD controls the overall type I error. JMP demonstration Oneway Analysis of Pain By Drug Compare Means All Pairs, Tukey HSD 82 Level Drug C Drug C Drug B - Level Drug A Drug B Drug A Difference 5.850000 3.600000 2.250000 Std Err Dif 1.677665 1.677665 1.677665 Lower CL 1.81283 -0.43717 -1.78717 Upper CL 9.887173 7.637173 6.287173 p-Value 0.0027* 0.0897 0.3786 The JMP output shows that drugs A and C are significantly different. 83 84 We are interested in the effect of two categorical factors on the response. We are interested in whether either of the two factors have an effect on the response and whether there is an interaction effect. ◦ An interaction effect means that the effect on the response of one factor depends on the level of the other factor. 85 No Interaction Interaction Low High Dosage Drug A Drug B Improvement Improvement Drug A Drug B Low High Dosage 86 yijk i j ( ) ij ijk Where yijk is the response of the kth trial on the ith factor A level and the jth factor B level is the overall mean i is the main effect of the ith level of factor A j is the main effect of the jth level of factor B ( ) ij is the interactio n effect of the ith level of factor A and the jth level of factor B ijk ~ N (0, 2 ) i 1, , a j 1, , b k 1,..., nij 87 We would like to determine the effect of two alloys (low, high) and three cooling temperatures (low, medium, high) on the strength of a wire. JMP demonstration Analyze Fit Model Y: Strength Highlight Alloy and Temp and click Macros Factorial to Degree Run Model http://office.microsoft.com/en-us/images 88 Conclusion: There is strong evidence of an interaction between alloy and temperature. 89 The one sample t-test allows us to test whether the population mean of a group is equal to a specified value. The two-sample t-test and paired t-test allow us to determine if the population means of two groups are different. ANOVA allows us to determine whether the population means of several groups are different. 90 For information about using SAS, SPSS and R to do ANOVA: http://www.ats.ucla.edu/stat/sas/topics/anova .htm http://www.ats.ucla.edu/stat/spss/topics/anov a.htm http://www.ats.ucla.edu/stat/r/sk/books_pra. htm 91 Fisher’s Irises Data (used in one sample and two sample t-test examples). Flexibility data (paired t-test example): Michael Sullivan III. Statistics Informed Decisions Using Data. Upper Saddle River, New Jersey: Pearson Education, 2004: 602. 92 Special thanks to Jennifer Kensler for course materials and help with JMP! 93