Download ID_4566_Biostatistics- Hypothesis test_English_sem_4

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Predictive analytics wikipedia , lookup

Data analysis wikipedia , lookup

Regression analysis wikipedia , lookup

Least squares wikipedia , lookup

Generalized linear model wikipedia , lookup

Data assimilation wikipedia , lookup

Taylor's law wikipedia , lookup

Bootstrapping (statistics) wikipedia , lookup

Transcript
1.
A.
B.
C. *
D.
E.
2.
A.
B.
C. *
D.
E.
3.
A.
B. *
C.
D.
E.
4.
A. *
B.
C.
D.
E.
5.
A. *
B.
C.
D.
E.
6.
A.
B.
C.
D. *
E.
7.
A.
B.
C.
?Which one of the following variables that are examples of categorical variables:
Number of episodes of disease in a patient over a year
Serum bilirubin level
Patient gender (M/F)
Haemoglobin level
Reduction in blood pressure following antihypertensive treatment
Which one of the following variables that are examples of categorical variables:
Blood pressure
Serum bilirubin level
Severity of haemophilia (mild/moderate/severe)
Haemoglobin level
Reduction in blood pressure following antihypertensive treatment
Which one of the following variables which are measured on a nominal scale.
Height in cm.
Ethnic group (white/black/asian).
Social class (I/II/III-N/III-M/IV/V).
Age categorised as young, middle-aged or old.
Age in years.
Which one of the following statements which you believe to be true. A data file in ASCII format:
Is the same as a text file.
Can only contain letters and not digits.
Has to be typed using a wordprocessing package.
Must have each data value separated only by a space.
Can only contain digits and not letters.
Which one of the following statements which you believe to be true. A data file in ASCII format:
Can be saved by a spreadsheet package.
Can only contain letters and not digits.
Has to be typed using a wordprocessing package.
Must have each data value separated only by a space.
Can only contain digits and not letters.
Often the same data can be collected in many different ways. Select the variable below that would
allow you to most accurately figure the age at the time of an interview.
Age at interview categorised into four groups: < 20 years, 21-29 years, 30-39 years, > 40 years
Age at interview categorised into six groups: 15-19 years, 20-24 years, 25-29 years, 30-34 years,
35-39 years, > 40 years
Age at interview recorded in years (continuous)
Date of birth and date of interview
Correct answer did not listed there
Often the same data can be collected in many different ways. Select the group of variables below that
would allow you to most accurately find out the side effects of a particular experimental drug.
1) Number of side effects in first month. 2) Number of side effects in second month. 3) Number of
side effects in third month etc.
1) Date of nausea/vomiting/sickness. 2) Date of dizziness. 3) Date of headache. 4) Date of sore
throat etc.
1) Number of episodes of nausea/vomiting/sickness. 2) Number of episodes of dizziness. 3) Number
of episodes of headache. 4) Number of episodes of sore throat etc.
D. *
E.
8.
A.
B.
C.
D.
E. *
9.
A.
B.
C.
D. *
E.
10.
A.
B.
C.
D. *
E.
11.
A.
B.
C.
D. *
E.
12.
A.
B.
C.
D. *
E.
13.
A.
B.
C.
D. *
E.
14.
1) Date and type of first side effect. 2) Date and type of second side effect. 3) Date and type of third
side effect etc.
Correct answer did not listed there
Which one of the following statements which you believe to be true. Checking your data for errors
can be done prior analysis by the way:
Ensures that the results of the statistical analysis are valid.
Ensures that your data cannot contain any transcription errors.
Is usually a waste of time as they have little effect on the results.
Cannot be done if the data set contains missing values.
Can be done by typing the data in twice and making the appropriate comparisons.
The following are a list of dates of birth of participants in a study of middle-aged women carried out
in 1999. Select all of the dates which you suspect may be erroneous.
01/07/53
21/09/57
16/05/62
19/05/80
02/12/55
The following are a list of dates of birth of participants in a study of middle-aged women carried out
in 1999. Select all of the dates which you suspect may be erroneous.
21/04/56
29/02/48
30/04/62
06/01/22
02/12/54
Which one of the following statements about the histogram which you believe to be true:
Can be used instead of a pie chart to display categorical data.
Contains contiguous bars, with the height of each bar being proportional to the frequency of the
observations in the range specified by the bar.
Is used to show the relationship between two variables.
Can be used to display either a frequency or a relative frequency distribution.
Can be used instead of a bar chart to display continuous data.
Which one of the following statements about the bar which you believe to be true:
Can also be called a histogram.
Is used to show the relationship between two variables.
Should be drawn without gaps between the bars.
Contains contiguous bars, with the height of each bar being proportional to the frequency of the
observations in the range specified by the bar.
Can only be used to display data which have a symmetrical distribution.
Which one of the following statements about the bar which you believe to be true:
Can also be called a histogram.
Is used to show the relationship between two variables.
Should be drawn without gaps between the bars.
Contains separate bars, with the length of each bar being proportional to the relevant frequency or
relative frequency.
Can only be used to display data which have a symmetrical distribution.
Which one of the following type(s) of figures that would be appropriate for illustrating the
distribution of heights of children in a class.
A.
B.
C. *
D.
E.
15.
A.
B.
C. *
D.
E.
16.
A.
B.
C.
D. *
E.
17.
A. *
B.
C.
D.
E.
18.
A. *
B.
C.
D.
E.
19.
A.
B.
C.
D. *
E.
20.
A.
B.
C.
D. *
E.
21.
Bar chart
Pie chart
Histogram
Scatter plot
Segmented bar chart
Which one of the following type(s) of figures that would be appropriate for illustrating the
distribution of heights of children in a class.
Clustered bar chart
Pie chart
Stem-and-leaf plot
Scatter plot
Segmented bar chart
Which one of the following type(s) of figures that would be appropriate for illustrating the
relationship between height and weight among individuals in a study.
Bar chart
Pie chart
Histogram
Scatter plot
Box-plot
Which one of the following type(s) of figures that would be appropriate for illustrating the
distribution of blood groups in a sample of adults.
Bar chart
Pie chart
Histogram
Scatter plot
Box-plot
Which one of the following type(s) of figures that would be appropriate for illustrating the
relationship between gender and blood group in a sample of adults.
Clustered bar chart
Pie chart
Histogram
Scatter plot
Stem-and-leaf plot
?Which one of the following statements which you believe to be true. The Normal distribution:
Is the distribution of a variable measured on healthy individuals.
Is skewed to the right.
Has a mean of zero and a standard deviation of one.
Has its mean equal to its median.
Has about 95% of its observations contained within the limits defined by (mean ± standard
deviation).
Which one of the following statements which you believe to be true. The Normal distribution:
Is the distribution of a variable measured on healthy individuals.
Is skewed to the right.
Has a mean of zero and a standard deviation of one.
Is a continuous probability distribution.
Has about 95% of its observations contained within the limits defined by (mean ± standard
deviation).
Which one of the following variables that are likely to follow a Normal distribution.
A.
B.
C. *
D.
E.
22.
A.
B.
C. *
D.
E.
23.
A.
B.
C.
D.
E. *
24.
A. *
B.
C.
D.
E.
25.
A.
B.
C.
D.
E. *
26.
A.
B. *
C.
D.
E.
27.
A. *
B.
C.
D.
E.
28.
A. *
B.
C.
The number of hospital attendances in a year in a sample of adults from the general population.
Survival times following a heart transplant.
Heights of individuals in the population.
The ages of first year medical students.
Shoe sizes of 11 year olds in a mixed school class.
Which one of the following statements which you believe to be true. The Binomial distribution:
Is the distribution of a continuous random variable.
Is always symmetrical.
Is used for making inferences about proportions.
Can be used to approximate the Normal distribution in certain circumstances.
Has a mean of zero and a standard deviation of one.
Which one of the following statements which you believe to be true. The t-distribution:
Is particularly useful for analyzing categorical data.
Is skewed to the right.
Always has a mean of zero and a standard deviation of one.
Becomes less like the Normal distribution as the sample size increases.
Is characterised by the degrees of freedom.
Which distribution is a ratio of two variances likely to follow?
F-distribution
Normal distribution
Poisson distribution
Lognormal distribution
Binomial distribution
Which distribution is the proportion of individuals with a disease who are successfully treated with a
new drug likely to follow?
F-distribution
Normal distribution
Poisson distribution
Lognormal distribution
Binomial distribution
Which distribution is the weight of doctors in a hospital likely to follow?
F-distribution
Normal distribution
Poisson distribution
Lognormal distribution
Binomial distribution
Which one of the following statements about sampling which you believe to be true.
A sample statistic is a point estimate of a population parameter.
Sampling error arises when we transcribe data incorrectly.
Random sampling implies a haphazard approach to the data analysis.
A sample data alvays have a Normal Ditribution
The inferential process involves drawing conclusions about the sample.
Which one of the following statements about sampling which you believe to be true.
For a given data set, the standard deviation is always greater than the standard error of the mean.
Sampling error arises when we transcribe data incorrectly.
Random sampling implies a haphazard approach to the data analysis.
D.
E.
29.
A. *
B.
C.
D.
E.
30.
A.
B. *
C.
D.
E.
31.
A.
B. *
C.
D.
E.
32.
A.
B.
C.
D. *
E.
33.
A.
B. *
C.
D.
E.
34.
A.
B. *
C.
D.
E.
35.
A sample data alvays have a t-distribution Ditribution
The inferential process involves drawing conclusions about the sample.
Which one of the following statements which you believe to be true. The standard error (denoted as
m) of the mean:
Provides a measure of the precision of the sample mean as an estimate of the population mean.
Can only be estimated if we take repeated samples from the population.
Will increase in value as the sample size increases.
Depends only on the size of the sample.
Can not be greatest then 1.0
Which one of the following statements which you believe to be true. A histogram:
Can be used instead of a pie chart to display categorical data.
It is similar (visually) to a bar chart but there are no gaps between the bars.
Contains contiguous bars, with the height of each bar being proportional to the frequency of the
observations in the range specified by the bar.
Is used to show the relationship between two variables.
Must have the symmetrical shape.
Which one of the following statements which you believe to be true. A histogram:
Can be used instead of a pie chart to display categorical data.
Can be used to display either a frequency or a relative frequency distribution.
Contains contiguous bars, with the height of each bar being proportional to the frequency of the
observations in the range specified by the bar.
Is used to show the relationship between two variables.
Must have the symmetrical shape.
?Hypothesis testing. How posible to interprete the P-value
The probability that the null hypothesis is true.
The probability that the alternative hypothesis is true.
The probability of obtaining the observed or more extreme results if the alternative hypothesis is
true.
The probability of obtaining the observed results or results which are more extreme if the null
hypothesis is true.
Always less than 0.05.
Which one of the following statements which you believe to be true. The one-sample t-test is
appropriate when:
Our aim is to compare the mean of a variable in one group of individuals to that of another.
Our aim is to compare the mean of a variable in a group of individuals to a particular value.
The variable of interest is categorical.
The assumptions underlying the sign test are not satisfied.
Always less than 0.05.
Which one of the following statements which you believe to be true. The one-sample t-test is
appropriate when:
Our aim is to compare the mean of a variable in one group of individuals to that of another.
The variable of interest is Normally distributed.
The variable of interest is categorical.
The assumptions underlying the sign test are not satisfied.
Always less than 0.05.
Which one of the following statements which you believe to be true. If the 95% confidence interval
for the mean of a variable of interest obtained from a sample for values contains a hypothesized
value, µ1, of the mean:
A.
B.
C.
D. *
E.
36.
A.
B.
C. *
D.
E.
37.
A.
B. *
C.
D.
E.
We are 95% certain that the sample mean lies within the interval
We are 95% certain that the true mean in the population equals µ1
We are 95% certain that the sample mean equals the population mean.
There is a 5% chance that the population mean lies outside this interval.
We can reject the null hypothesis that the true population mean equals m1 at the 5% level of
significance.
Which one of the following statements which you believe to be true. The paired t-test is appropriate
when:
The variable of interest is binary.
We want to compare two numerical variables when each is measured on every individual in the
sample.
The differences between the pairs of observations are Normally distributed.
We wish to test the null hypothesis that the mean of the differences between the pairs of
observations in the sample is equal to zero.
We wish to test the null hypothesis that the median of the differences between the pairs of
observations in the population is equal to zero.
In a study to assess the relationship between the concentration of alcohol in urine (UAC) and in blood
(BAC), measured in mg%, the following simple linear regression equation was derived: BAC = -5.6
+ 0.811 x UAC. Which one of the following statements which you believe to be true.
BAC and UAC are the estimated regression coefficients of the model.
BAC is the dependent variable.
On average, UAC increases by 0.811mg% for every 1.0 mg% increase in BAC.
At a urine concentration of 250 mg%, the predicted BAC is 297.15 mg%.
The UAC that will lead to a predicted BAC of 250 mg% is 330 mg%.