Download Syllabus - KSU Web Home

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Psychometrics wikipedia , lookup

Bootstrapping (statistics) wikipedia , lookup

Taylor's law wikipedia , lookup

Foundations of statistics wikipedia , lookup

Gibbs sampling wikipedia , lookup

Time series wikipedia , lookup

History of statistics wikipedia , lookup

Student's t-test wikipedia , lookup

Resampling (statistics) wikipedia , lookup

Misuse of statistics wikipedia , lookup

Transcript
____________________________________________________________________________________
Syllabus for
ECON 2300, BUSINESS STATISTICS
Fridays, 8 am – 1:45 pm, BB-372, Summer 2005
Professor: Penny Verhoeven
e-mail: [email protected]
Office: BB-324
Office Phone: (770) 423-6537
Office Hours: M 9-10:30 am
W 2-3:30 pm
F 2-4 pm
____________________________________________________________________________________
Course Description from Current Catalog
ECON 2300. Business Statistics. 3-0-3.
Prerequisite: MATH 1101.
An introduction to descriptive and inferential statistics with an emphasis on business applications.
Topics covered include data summarization, probability distributions, sampling methods, confidence
intervals, hypothesis testing, online data sources, and ethics in research. Small case studies are used
to illustrate statistical applications within business settings.
Course Objectives
1. Convey methods for organizing, summarizing, and presenting data;
2. Convey methods for drawing conclusions about entire sets (populations) of data when only portions
(samples) of those data sets are examined, as well as the rationale behind those methods; and
3. Illustrate the role of data analysis in supporting the decision-making process in a variety of business
contexts.
4. Promote ethical conduct in research.
Course Material Available at prof’s website http://ksuweb.kennesaw.edu/~pverhoev :
 this Syllabus
 Unit One, Unit Two, and Unit Three packets, which contain: information to reduce note-taking in
class, answers to all homework problems, practice problems for tests; and formula sheets
 Online Data Sources: Search Guidelines
 Optional (for interested parties): Excel Instructions for performing various analyses using Excel.
Required Text: Essentials of Statistics for Business and Economics (3rd ed.), by D.R. Anderson, D.J.
Sweeney, and T.A. Williams, South-Western, 2003.
Required Ethics in Research Readings (will be distributed in class):
1) “Ethical Guidelines for Statistical Practice,” Committee on Professional Ethics of the American
Statistical Association (ASA), 1999. URL:
www.amstat.org/profession/index.cfm?fuseaction=ethicalstatistics.
2) Ethics case (for in-class group discussion) entitled “Ethics of Data Quality.” ASA Case Study #3.
URL: www.tcnj.edu/~ethcstat.
3) The following sections of On Being a Scientist: Responsible Conduct in Research, National
Academy of Sciences (NAS), Washington, D.C.: National Academy Press, 1995 (URL:
www.nap.edu/readingroom/books/obas/): (1) “Misconduct in Science;” and (2) “Responding to
Violations of Ethical Standards,” which contains the ethics case (for in-class group discussion)
entitled “A Career in the Balance.”
2
Other material required: calculator
Last day to withdraw without academic penalty: Wednesday, June 29. Current KSU policy regarding
withdrawals is in effect.
Tutoring: available in BB-292 (once determined, a tutoring schedule will be posted outside the door); also,
you can post questions about homework problems on WebCT at the ECON 2300/Common site and
tutors will respond to your questions
Determination of Grade in Course
 There will be three tests, weighted evenly.
 Relationship between average test score and letter grade: [90,100) = A, [80,90) = B, [70,80) = C,
[60,70) = D, below 60 = F.
 Only under compelling circumstances may a test be taken at other than the scheduled time.
Receiving a Grade of Incomplete: Current KSU policies for receiving (and subsequently removing) a
grade of incomplete are in effect. As stated on pages 42-43 of the KSU 2004-2005 Undergraduate
Catalog, an incomplete “…will be awarded only when the student has done satisfactory work up to the
last two weeks of the semester, but for non-academic reasons beyond his/her control is unable to meet the
full requirements of the course.”
Class Attendance and Homework
To maximize your learning, it is essential that you attend every class and complete all of your homework
assignments. If you must miss a class, you should--prior to approaching the professor should questions
about that day's coverage arise--get the notes from a classmate, study the notes and associated text
material, and work on (i.e., at least attempt) the homework problems assigned that day.
Notice regarding class enrollment: Students must exclusively attend the course section in which they are
officially enrolled. Your grade will be based on your performance in that course section. It is your
responsibility to check your registration form and confirm that you are attending the course and section in
which you are enrolled. There will be no adjustments made to class rolls once the Drop-Add period ends.
Academic Integrity
All work on the tests is to be your independent work.
Every KSU student is responsible for upholding the provisions of the Student Code of Conduct, as
published in the Undergraduate and Graduate Catalogs. Section II of the Student Code of Conduct
addresses the University's policy on academic honesty, including provisions regarding plagiarism and
cheating, unauthorized access to University materials, misrepresentation/falsification of University
records or academic work, malicious removal, retention, or destruction of library materials,
malicious/intentional misuse of computer facilities and/or services, and misuse of student identification
cards. Incidents of alleged academic misconduct will be handled through the established procedures
of the University Judiciary Program, which includes either an "informal" resolution by a faculty
member, resulting in a grade adjustment, or a formal hearing procedure, which may subject a student
to the Code of Conduct's minimum one semester suspension requirement.
Posting of Grades: You may access your course grade via Owl Express at www.kennesaw.edu by 4 pm on
Saturday, July 30. By departmental policy, no grades will be given over the phone. Your graded third
test may be picked up during the Fall 2005 term; alternatively, if you submit a stamped, self-addressed
envelope with your third test, your graded test will be mailed to you.
3
COURSE OUTLINE
_______________________________________________________________________________________
Friday
Class
Topics and associated readings (from Anderson et al text unless otherwise specified),
Homework for ensuing week (homework is from the EXERCISES sections of the Anderson
et al text unless otherwise indicated)
_______________________________________________________________________________________
June 3
Topics: Types of Data; Frequency, Relative Frequency, and Percent Frequency Distributions;
Summary Measures; Probability Concepts; Discrete Probability Distributions
Readings: Chapter 1, Sections 2.1-2.2, Sections 3.1-3.3, Sections 4.1-4.4, Sections 5.1-5.3
Homework: Ch.1: 5, 6, 14, 22. Ch. 2: 9 [in part c, construct the bar graph portraying the frequency
distribution; in part d, construct the pie chart portraying the percent frequency distribution], 18e [construct the
histogram portraying the percent frequency distribution], 19 a,b [also portray the frequency distribution as a
histogram], 20, 21b [also portray the relative frequency distribution as a histogram], 37 [in part c, construct
the bar graph portraying the frequency distribution]. Ch. 3: 5a-c, 12, 22, 25, 26. Ch. 4: 10a-d, 18,
19, 25, 28, 32, 34[assume that the blood type of one spouse is independent of that of the
other spouse], 35, 37. Ch. 5: 17, 21[treat the given probability distributions as estimates of the true
probability distributions, and thus treat your answers as estimates].
◄ In preparation for the next class, read Ethics in Research Readings 1) and 2).
June 10
Topics: Online Data Sources; Ethics in Research; the Binomial Distribution; Continuous
Probability Distributions; the Normal Distribution
Readings: Ethics in Research Readings 1) and 2), Section 5.4, p. 227 (first half), Section 6.2
Homework: Take-home problem on online data sources for Test #1 (due next class).
Ch. 5: 28, 31, 34, 59. Ch. 3: 32[apply the Empirical Rule], 33[apply the Empirical Rule], 34. Ch. 6:
9b-c[apply the Empirical Rule], 12, 13, 20, 22[in part b, change, in the question, “top 20%” to “top 10%”],
25, 37.
June 17
Review for Test #1 (8-10 am). Test #1 (on objectives 1-11) begins at 11 am. Take-home
Test #1 problem on online data sources due by 11 am.
◄ Bring to the next class a copy of the Unit Two packet, available at prof’s website
June 24
Topics: Sampling Methods; Point Estimates; Sampling Distribution of the Mean; Central
Limit Theorem; Interval Estimation of a Population Mean; Student’s t distribution; Sampling
Distribution of the Proportion; Interval Estimation of a Population Proportion; Determining
an appropriate sample size
Readings: Sections 7.1-7.3, 7.4-7.7, 8.1-8.4, Case Problem 3 in Chapter 8
Homework: Ch. 7: 3, 6, 13, 15, 16, 17. Ch. 8: 10, 11(x = 6.34 , s = 2.16),
12(x = 3.8 , s = 2.3), 17, 19(x = 6.5 , s = .5), 21, 27, 29, 36, 41, 51, 54, 56, 59a, 60.
◄ In preparation for the next class, read Ethics in Research Reading 3).
4
_____________________________________________________________________________________________________
Friday
Class
Topics and associated readings (from Anderson et al text unless otherwise specified),
Homework for ensuing week (homework is from the EXERCISES sections of the Anderson
et al text unless otherwise indicated)
_______________________________________________________________________________________
July 1
Topics: Online Data Sources; Interval Estimation of the Difference between Two Population
Means; Interval Estimation of the Difference between Two Population Proportions; Testing
Hypotheses about a Population Mean; Ethics in Research; Testing Hypotheses about a
Population Proportion; Type I and Type II Errors
Readings: Sections 10.1, 11.1, Case Problem 1 in Chapter 10 (questions 3-5),
Sections 9.1-9.6, Case Problem 2 in Chapter 9, Ethics in Research Reading 3)
Homework*: Take-home problem on online data sources for Test #2 (due next class). Ch.
10: 5, 6[note: for Miami, x = 6.34 , s = 2.16; for LA, x = 6.72, s = 2.37], 7a,c. Ch. 11: 3, 8b, 31b.
Ch. 9: 5, 6, 7, 8, 16, 17, 19, 25, 30, 38[assume normally distributed expenditures by
Corning households], 41[assume normally distributed driving distances], 42[assume
normally distributed planting times], 48, 50, 59, 66, 68[is there evidence that the proportion is now
below .47?].
*note: For all hypothesis testing problems in this course: (1) test the given or implied null
hypothesis using the p-value approach to hypothesis testing, and (2) if the authors specify a
significance level , determine whether the null hypothesis would be rejected at that
significance level.
July 8
Review for Test #2 (8-10 am). Test #2 (on objectives 5, 12-21) begins at 11 am. Takehome Test #2 problem on online data sources due by 11 am.
◄ Bring to the next class a copy of the Unit Three packet, available at prof’s website
July 15
Topics: Chi-square test of independence; Covariance and Correlation Coefficient; Simple
Linear Regression
Readings: Sections 11.3, 3.5, 12.1-12.5, 12.8
Homework: Ch. 11: 21, 22, 40. Ch. 3: 50, 52, 69b. Refer to the Regression Homework in
the Unit Three packet and work problems 1a-f, 2a-f, and 3a-f.
July 22
Topics: Simple Linear Regression; Discussion of Multiple Linear Regression; Review for
Test #3
Readings: Section 12.6
Homework: Refer to the Regression Homework in the Unit Three packet and work
problems 1g-j, 2g-j, 3g-j, and 4a-j.
July 29
Test #3 (on objectives 22-29) from 8:30-10:30 am (time reserved for final exam)
5
LEARNING OBJECTIVES FOR ECON 2300
At the end of this course, the student will be able to:
1.
Distinguish between, and both recognize and provide examples of:
(a) populations and samples
(b) parameters and statistics
(c) descriptive statistics and inferential statistics
(d) qualitative data and quantitative data
(e) cross-sectional data and time series data
(f) discrete variables and continuous variables
2.
Identify what each of the following symbols represents, and recognize each symbol’s referent in a
verbal passage:
(a) N
(e) n
(i) p
(b) 
(f)x
(j)p
(c) ²
(g) s²
(d) 
(h) s
3.
Summarize a set of quantitative data by:
(a) constructing a frequency distribution and portraying the frequency distribution in the form of a
histogram
(b) constructing a relative frequency distribution and portraying the relative frequency distribution in
the form of a histogram
(c) constructing a percent frequency distribution and portraying the percent frequency distribution in
the form of a histogram
(d) determining the mean, median, mode, variance, standard deviation, and coefficient of variation of
the data set
4.
Summarize a set of qualitative data by:
(a) constructing a frequency distribution and portraying the frequency distribution in the form of a bar
chart or Pareto diagram or pie chart
(b) constructing a relative frequency distribution and portraying the relative frequency distribution in
the form of a bar chart of Pareto diagram or pie chart
(c) constructing a percent frequency distribution and portraying the percent frequency distribution in
the form of a bar chart or Pareto diagram or pie chart
5.
Obtain requested data from the following websites: www.bea.gov, www.bls.gov, www.census.gov,
www.sec.gov, www.stlouisfed.org, http://unstats.un.org
6.
With respect to probability concepts:
(a) distinguish between the classical, relative frequency, and subjective methods of assigning
probabilities to experimental outcomes
(b) define what it means for two events to be independent, and identify whether or not two events are
independent
(c) define what it means for two events to be mutually exclusive
(d) apply basic probability identities (rules) to calculate probabilities
7.
Determine the expected value (mean), variance, and standard deviation of a discrete random variable
from its probability distribution.
6
8.
Answer a probability question about a variable having a binomial distribution.
9.
Answer a probability question about a variable having a normal distribution.
10.
Identify four characteristics true of all normal distributions.
11.
Apply the Empirical Rule to answer questions about, or describe, a quantitative data set whose
distribution is bell-shaped (thus normal or approximately normal) and for which the mean and
standard deviation are known.
12.
Suggest four reasons why, to obtain information about populations, one might draw samples from the
populations rather than examining the populations in their entirety.
13.
Define what is meant by a simple random sample of size n, and draw such a sample from a given
population.
14.
With respect to sampling distributions:
(a) state the Central Limit Theorem
(b) graph one population distribution and three associated sampling distributions of the mean that
exemplify the Central Limit Theorem
15.
Define what is meant by a C% confidence interval for a population parameter.
16.
Identify four characteristics true of all t-distributions.
17.
Obtain an interval estimate for:
(a) the mean of a population (excluding situations where a small sample is drawn from a nonnormally distributed population)
(b) the proportion of a population falling in a certain category (excluding situations where a small
sample is drawn)
(c) the difference between the means of two populations (excluding situations where small samples
are drawn)
(d) the difference between the proportions of two populations falling in a certain category (excluding
situations where small samples are drawn)
18.
Determine how many entities should be sampled to obtain an interval estimate--for a mean or
proportion--having a given level of confidence and a given margin of error.
19.
Test--using the p-value approach--a hypothesis about:
(a) the mean of a population (excluding situations where a small sample is drawn from a nonnormally distributed population); or
(b) the proportion of a population falling in a certain category (excluding situations where
a small sample is drawn)
20.
In the context of hypothesis testing:
(a) distinguish between the significance level  and the p-value p
(b) based on the p-value, determine whether the null hypothesis would be rejected at a given
significance level 
7
21.
Distinguish, in the context of hypothesis testing, between a Type I error and Type II error, and, for a
particular null hypothesis, suggest one negative consequence of making a Type I error and one
negative consequence of making a Type II error.
22.
Given a scenario about real (or potential) ethical misconduct in statistical practice:
(a) identify an ethical guideline of the American Statistical Association that is (or may be) being
violated, and explain how that guideline is (or may be) being violated
(b) provide pros and cons for two proposed courses of action by a party involved in the scenario.
23.
Perform a chi-square test of independence.
24.
With respect to the covariance and the (Pearson product-moment) correlation coefficient:
(a) describe what each measures
(b) calculate each measure given a sample of n (x,y) data points or a population of N (x,y) data points
(c) given a sample of (x,y) data points depicted in a scatter diagram, provide a reasonable estimate of
the sample correlation coefficient
(d) given a description of two variables, identify which type of correlation between the variables is
expected: positive, negative, or none.
25.
Describe three potential purposes of regression analysis, and differentiate between simple and
multiple regression analysis.
26.
With respect to the (classical normal) simple linear regression model:
(a) express (symbolically) the model
(b) describe the meaning of the residual term in the model
(c) state the model assumptions
(d) distinguish between the population regression equation and sample regression equation
27.
Describe, in the context of regression analysis, the least squares criterion for arriving at the “bestfitting” sample regression equation.
28.
Given an Excel printout containing a sample of (x,y) data points, a scatter diagram of the data points,
and the results of applying Excel’s REGRESSION tool:
(a) write down the sample regression equation
(b) graph (superimposed on the scatter diagram) the sample regression equation, and label two points
on that line
(c) locate (in the Excel output) and interpret (in the context of the problem) se, the standard error of
the estimate
(d) refer to the residual plot (in the Excel output) and make a judgment as to whether any (and, if so,
which) of the model assumptions appear to be
(e) verbalize what each of SST, SSR, and SSE represent in the context of the problem, and locate (in
the Excel output) and interpret (in the context of the problem) r2
(f) test for a linear relationship between the dependent and independent variables
(g) obtain a point estimate for the mean value of the dependent variable across all entities having a
specified value for the independent variable
(h) obtain a point estimate for the value of the dependent variable for a single entity having a
specified value for the independent variable
(i) interpret the sample regression coefficient (slope of the sample regression line) in the context of
the problem
(j) suggest two additional independent variables which might enable one to better predict or estimate
the value of the dependent variable.
29. Identify 4 precautions associated with regression analysis.