Download 1 Learning Team Reflection: Final Reflection Learning Team

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Bootstrapping (statistics) wikipedia , lookup

Taylor's law wikipedia , lookup

Foundations of statistics wikipedia , lookup

Psychometrics wikipedia , lookup

Omnibus test wikipedia , lookup

Resampling (statistics) wikipedia , lookup

Misuse of statistics wikipedia , lookup

Student's t-test wikipedia , lookup

Transcript
Learning Team Reflection: Final Reflection
Learning Team Reflection: Final Reflection
QNT/351
University of Phoenix
1
Learning Team Reflection: Final Reflection
2
Hypothesis Testing Strategy
Throughout this course our team has performed different descriptive and inferential statistical
analyses in reference to the prices and sizes of homes. Raw data for these analyses has come
from the Real Estate dataset, which contains 105 entries. The paper for this week will performed
a one-sample test of hypothesis against a hypothesized value. The sample used in the analysis
corresponds to the prices of homes with sizes greater than 2200 sq. ft. Our hypothesis is that the
mean price of these homes is greater than $213,600. The test is done with the five-step
hypothesis testing methodology.
Research Question and Hypothesis
Do larger homes command higher prices? This is the research question our team will answer
through a hypothesis test. We will use a one-sample test of hypothesis to examine the research
question. Looking at the descriptive statistics results from prior weeks we know that the median
home price according to the Real Estate dataset is $213,600, and the median size is 2200 sq. ft.
Our hypothesis will be that the average price of homes larger than 2200 sq. ft. is greater than
$213,600. In other words, we are hypothesizing the mean price of homes larger than the median
home is greater than the median home price.
Verbally, the null and alternate hypotheses are:
Ho : the mean price of homes larger than 2200 sq. ft. is lower than or equal to $213,600.
H1 : the mean price of homes larger than 2200 sq. ft. is greater than $213,600.
Numerically, these hypotheses can be written as:
Ho : μ ≤ 213,600
H1 : μ > 213,600
Learning Team Reflection: Final Reflection
3
Hypothesis Testing
The hypothesis test follows the 5-step methodology. The first step consists in stating the null and
alternate hypotheses. This was already done above.
The second step consists in selecting a level of significance. The test will be done with a
significance level of α = 0.05, which is a typical value used in business-related hypothesis
testing. The level of significance is important because it bounds the acceptable Type I error (the
error of having a false rejection) of the test. Here we are saying that the Type I error has at most
a 5% of happening.
The third step corresponds to identifying the appropriate test statistic. This is done based on the
size of the sample. Samples with size n≥30 use the z-test, while samples with size n<30 use the ttest. This is because when the sample size is at least 30 we can assume the sample standard
deviation closely approximates the population’s standard deviation according to the Central
Limit theorem. In our case there are 45 homes in the dataset with size greater than 2200 sq. ft.
Hence, n = 45 and we use a z-test statistic given by:
where
is the mean of the sample and σ its standard deviation.
Since the sample size is relatively large, we can be confident of having a small Type II
error (the error of having a false acceptance).
During step 4 the research formulates a decision rule taking the level of significance and
the test statistic into account. With an α=0.05 and a right-tailed z-test the critical value is z =
1.645. The decision rule is: reject Ho if z > 1.645. The decision rule is sketched in Figure 1.
Learning Team Reflection: Final Reflection
Figure 1. Sketch of the critical value and rejection region.
Finally, in step 5 the sample data is used to find the sample mean and standard deviation. The
test statistic is evaluated and compared against the decision rule and a decision is made. The
following is the output generated by MegaStat for this test:
Hypothesis Test: Mean vs. Hypothesized Value
213.6000
hypothesized value
235.8333
mean Data
49.9459
std. dev.
7.4455
std. error
45
n
4
Learning Team Reflection: Final Reflection
2.99
z
.0014
p-value (one-tailed, upper)
221.2404
confidence interval 95.% lower
250.4262
confidence interval 95.% upper
14.5929
margin of error
As indicated by MegaStat, the test statistic is z = 2.99, which falls inside the rejection region
with a level of significance of 5%. As we can see, the p-value is a very small 0.0014. This
indicates that there is less than 0.15% probability of having a Type I error.
Why use a one-sample test method
In this research we used the one sample method to perform our test. A one-sample t-test helps
determine if the population mean is equal to the hypothesis test value. If the difference between
the sample mean and the test mean is larger than the sample mean, then it probably is not equal
to the test value or the test mean. This is an ideal situation to use a one sample test because we
have continuous data available from a single random sample. This test assumes that the
population is normally distributed. It is very important that data gathering is done properly and
the right audience is targeted. Bias data and influences can cause an impact to the final results
and so the end result might not be a true reflection on the findings.
5
Learning Team Reflection: Final Reflection
6
For example to test if smoking causes cancer, one has to make sure that research is performed on
people who smoke and got cancer and also people who don’t smoke and got cancer. Only
looking at one group will not provide a true and complete reflection of the situation.
Result and Decision
Complete the step five of the test and determine which of the following statements are true:

H0: the mean price of homes larger than 2200 sq. ft. is lower than or equal to $213,600.

H1: the mean price of homes larger than 2200 sq. ft. is greater than $213,600.
Based on the result we will determine if the business can categorize the data range in two
equal parts or not. Business will be able to determine if the policy: “of agent should focus on
showing houses up to 2300 square foot to the buyer with a budget of $213,600 or less” can be
implemented with less than 5% variance.
Conclusion
With our team performing different descriptive and inferential statistical analyses in reference to
the prices and sizes of homes, raw data for these analyses has come from the Real Estate dataset,
which contains 105 entries. We have explained why we chose to perform a one-sample test of
hypothesis against a hypothesized value and why the sample used in the analysis corresponds to
the prices of homes with sizes greater than 2200 sq. ft. Our hypothesis was that the mean price of
these homes is greater than $213,600. The test was done with the five-step hypothesis testing
methodology.
Learning Team Reflection: Final Reflection
References
Doane, D. P., & Seward, L. E. (2007). Applied Statistics in Business and Economics. Boston,
MA: McGraw-Hill/Irwin.
7