Survey
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
1/30/2015 TheBigPicture STAT250 Dr.KariLockMorgan DescribingData: OneQuantitativeVariable Population Sample SECTIONS2.2,2.3 • Onequantitativevariable(2.2,2.3) Statistics:UnlockingthePowerofData Sampling Statistical Inference Lock5 Descriptive Statistics Statistics:UnlockingthePowerofData Lock5 Obesity Trends* Among U.S. Adults BRFSS, 1990, 2000, 2010 DescriptiveStatistics (*BMI 30, or about 30 lbs. overweight for 5’4” person) Inordertomakesenseofdata,weneedwaysto summarize andvisualize it 2000 1990 Summarizingandvisualizingvariablesand relationshipsbetweentwovariablesisoftenknown asdescriptivestatistics(alsoknownasexploratory dataanalysis) 2010 Typeofsummarystatisticsandvisualization methodsdependonthetypeofvariable(s)being analyzed(categoricalorquantitative) Today:Onequantitativevariable Statistics:UnlockingthePowerofData No Data Lock5 ObesityinAmerica <10% 10%–14% 15%–19% 20%–24% 25%–29% ≥30% Source: Behavioral Risk Factor Surveillance System, CDC. BehavioralRiskFactorSurveillanceSystem ObesityisaHUGEprobleminAmerica We’llexplorethiswithtwodifferenttypesof data,bothcollectedbytheCDC: Proportionofadultswhoareobeseineachstate BMIforarandomsampleofAmericans http://www.cdc.gov/obesity/data/table‐adults.html Statistics:UnlockingthePowerofData Lock5 Statistics:UnlockingthePowerofData Lock5 1 1/30/2015 ObesitybyState Dotplot Inadotplot,eachcaseisrepresentedby adotanddotsarestacked. Easywaytoseeeachcase Minitab: Graph -> Dotplot -> One Y -> Simple Statistics:UnlockingthePowerofData Lock5 Statistics:UnlockingthePowerofData Histogram Lock5 Shape Theheightoftheeachbarcorrespondstothe numberofcaseswithinthatrangeofthevariable Long right tail 5stateswith obesityrate between 33.25and 33.75 Symmetric Right‐Skewed Left‐Skewed Minitab: Graph -> Histogram -> Simple Statistics:UnlockingthePowerofData Lock5 NationalHealthandNutrition ExaminationSurvey Statistics:UnlockingthePowerofData Statistics:UnlockingthePowerofData Lock5 BMIofAmericans Lock5 Statistics:UnlockingthePowerofData Lock5 2 1/30/2015 BMIofAmericans Notation ThedistributionofBMIforAmericanadultsis a) Symmetric b) Left‐skewed c) Right‐skewed Thesamplesize,thenumberofcasesinthe sample,isdenotedbyn Weoftenletx ory standforanyvariable,andx1 ,x2 ,…,xn representthen valuesofthevariablex x1 =32.4,x2 =28.4,x3 =26.8,… Statistics:UnlockingthePowerofData Lock5 Mean Statistics:UnlockingthePowerofData Lock5 Mean Themean oraverageofthedatavaluesis ⋯ Theaverageobesityrateacrossthe50statesisµ=28.606. ∑ TheaverageBMIforAmericansinthissampleis ̅ 24.887. Samplemean: ̅ Populationmean: (“mu”) Minitab: Stat -> Basic Statistics -> Display Descriptive Statistics Statistics:UnlockingthePowerofData Lock5 Median Statistics:UnlockingthePowerofData Lock5 MeasuresofCenter Forsymmetricdistributions,themeanandthe Themedian,m,isthemiddlevaluewhenthe dataareordered. Ifthereareanevennumberofvalues,the medianistheaverageofthetwomiddlevalues. medianwillbeaboutthesame Forskeweddistributions,themeanwillbe morepulledtowardsthedirectionofskewness Themediansplitsthedatainhalf. Minitab: Stat -> Basic Statistics -> Display Descriptive Statistics Statistics:UnlockingthePowerofData Lock5 Statistics:UnlockingthePowerofData Lock5 3 1/30/2015 MeasuresofCenter Skewness andCenter m=24.163 Adistributionisleft‐skewed.Whichmeasureof centerwouldyouexpecttobehigher? Meanis“pulled” =24.887 inthedirection ofskewness Statistics:UnlockingthePowerofData a) Mean b) Median Lock5 Statistics:UnlockingthePowerofData Lock5 Outliers Outlier Anoutlier isanobservedvaluethat isnotablydistinctfromtheother valuesinadataset. Moreinfohere Statistics:UnlockingthePowerofData Lock5 Resistance Whenusingstatisticsthatarenotresistantto outliers,stopandthinkaboutwhetherthe outlierisamistake Themedianisresistantwhilethemeanisnot. With Outlier WithoutOutlier Statistics:UnlockingthePowerofData Lock5 Outliers Astatisticisresistant ifitis relativelyunaffectedbyextreme values. Mean 105.22 102.56 Statistics:UnlockingthePowerofData Median 101.0 100.5 Lock5 Ifnot,youhavetodecidewhethertheoutlier ispartofyourpopulationofinterestornot Usually,foroutliersthatarenotamistake,it’s besttoruntheanalysistwice,oncewiththe outlier(s)andoncewithout,toseehowmuch theoutlier(s)areaffectingtheresults Statistics:UnlockingthePowerofData Lock5 4 1/30/2015 StandardDeviation StandardDeviation Thestandarddeviation fora quantitativevariablemeasuresthe spreadofthedata Thestandarddeviationgivesaroughestimate ∑ ̅ ofthetypicaldistanceofadatavaluesfrom themean Thelargerthestandarddeviation,themore 2 variabilitythereisinthedataandthemore spreadoutthedataare 1 Samplestandarddeviation:s Populationstandarddeviation: (“sigma”) Minitab: Stat -> Basic Statistics -> Display Descriptive Statistics Statistics:UnlockingthePowerofData Lock5 Statistics:UnlockingthePowerofData 95%Rule 150 -5 0 5 150 -10 10 15 Forapopulation,95%ofthedatawillbebetween s4 µ– 2 andµ+2 0 50 Frequency Ifadistributionofdataisapproximately symmetricandbell‐shaped,about95% ofthedatashouldfallwithintwo standarddeviationsofthemean. s 1 0 50 Frequency StandardDeviation -15 Lock5 Forasample,95%ofthedatawillbebetween -15 -10 -5 0 5 10 ̅ 15 2 and ̅ 2 Bothofthesedistributionsarebell‐shaped Statistics:UnlockingthePowerofData Lock5 The95%Rule Statistics:UnlockingthePowerofData Lock5 95%Rule Giveanintervalthatwilllikelycontain95%of obesityratesofstates. Statistics:UnlockingthePowerofData Lock5 Statistics:UnlockingthePowerofData Lock5 5 1/30/2015 95%Rule Couldweusethesamemethodtogetan intervalthatwillcontain95%ofBMIsof Americanadults? 150 s 1 0 50 Frequency The95%Rule a) Yes b) No -2 -1 0 1 2 3 150 s4 0 50 Frequency -3 -15 -10 -5 0 5 10 15 StatKey Statistics:UnlockingthePowerofData Lock5 Thestandard deviationforhoursof sleeppernightis closestto Statistics:UnlockingthePowerofData Lock5 ToDo The95%Rule a) b) c) d) e) Statistics:UnlockingthePowerofData ReadSections2.2and2.3 DoHomework2.2(dueFriday,2/6) ½ 1 2 4 Ihavenoidea Lock5 Statistics:UnlockingthePowerofData Lock5 6