Download Pages 455 through 461 Melissa

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

History of statistics wikipedia , lookup

Psychometrics wikipedia , lookup

Quantitative trait locus wikipedia , lookup

Foundations of statistics wikipedia , lookup

Population genetics wikipedia , lookup

Resampling (statistics) wikipedia , lookup

Student's t-test wikipedia , lookup

Misuse of statistics wikipedia , lookup

Transcript
There are three outcomes that will result when STR interpretation is performed. There is
a match, no match, and inconclusive. The only result that requires statistics is a match. Statistics
are mathematically calculated to determine the estimate of a random match probability of that
sample occurring between two unrelated people. The estimate requires knowledge and testing of
allele frequency from different racial/ethnic samples. There is not one universal standard when it
comes to a statistical solution, however, there are methods that are more frequently utilized, and
those tests are recommended by NRCII and DAB.
Probability is the number of times something actually occurs divided by the amount of
times it has the opportunity to happen. It is viewed on a continuum between zero and one.
Probabilities are described mathematically with symbols. There are three laws of probability
1. Probabilities can take place in the range of zero to one
2. Events can be mutually exclusive, meaning that if any one of a particular set of events
has occurred, then none of the others has occurred.
3. When two events are independent of one another, their probabilities can be multiplied
with one another.
Likelihood ratios are comparisons made between the Null Hypothesis and the Alternative
Hypothesis. The Null Hypothesis can be described as a hypothesis indicating that a difference
does not exist. The Alternative Hypothesis suggests that a difference does exist.
There are six steps involved in making a statistical analysis of a hypothesis, they are as
follows;
1. formulate two competing hypothesis
2. select appropriate model that identifies the test statistic
3. determine level of significance (measure of risk)
4. collect a sample and calculate an estimate of the test statistic
5. define the region of rejection
6. select the appropriate hypothesis
There are two types of errors that can occur in hypothesis testing. Type I involves
rejecting the hypothesis when in fact it is really true. Type II Error involves accepting the
hypothesis when in fact it is really false.
The Chi Square test is known as the “goodness to fit” test because it is the sum of the
squared value between the observed results and the expected results divided by the expected
value. The results of this are charted and compared to determine the deviation from the expected
normal values. The differences the numbers are known as the degrees of freedom.
Using the confidence intervals will help in determining the precision of a point estimate.
They are usually measured on a 95% confidence interval, which is the sample average give, or
take two standard deviations. The confidence interval is a function of frequency of the
individuals in population and the observation.
Randomization tests are to help ask questions when collecting data that comes out
differently and to confirm the validity of the data. There are two different ways to retest the
samples, bootstrapping and jack-knifing. Bootstrapping is a computer simulation where the
sample being retested is a replacement of the original observation. Jack-knifing involves
creating a new sample that is one less than the original observation.
The study of inherited variation is known as population genetics, this is the observation
with different population groups in genotype frequencies and alleles. The alleles have two
copies of autosomal genes, one from the mother or egg and one from the father or sperm. When
there is a difference between the two alleles there are known as heterozygous. For example A
from mom and a from dad creating Aa. When two alleles come together and they are identical
they are known as homozygous. For example B from mom and B from dad creating BB. There
is variability within the alleles, which helps determine the stability of that allele being passed on
to the next generation. Forces in the population genetic are mutations, migration, natural
selection, and random genetic drift that affect gene frequency of alleles present in the population.
Laws of Mendelian Genetics:
1. Law of segregation - two members of a gene pair segredate from each other during sex
cell formation
2. Law of independent assortment - different segregating gene pairs behave independently.
Both laws used in Hardy-Weinberg equilibrium (HWE). Using a Punnett Square, each
allele frequency can be calculated based on either the observed characteristics of offspring or the
original characteristics of the parents. The relationship between the number of alleles and the
number of possible genotypes can be expressed by n(n + 1)/2.
Comparisons between two or more sets of alleles may be performed with R×C
contingency tables, which involve showing the response of one variable as a function of another
variable.
To understand the numbers used in population data publication can be used to relative
each STR maker which has been typed, the measures of the evaluated DNA makers include
Power Discrimination(PD), the power of exclusion(PE), the a priori chance of exclusion(CE),
the polymorphism information content (PIC), and a marker’s heterozygousity(H).
The “best” markers typically use STR markers, European laboratories largely using SGM
Plus kit that amplifies 10 STR loci and North American laboratories use 13 core CODIS loci.
The eight STR is common between European and North America STR testing sets: FGA,
TH01, VWA, D3S1358, D8S1179, D16S539, D18S51, and D21S11.these provide a useful
measure for evaluating the discriminating power of the DNA profiling system.
The most common type can be calculated by considering a sample type that is
heterozygous at all loci processing the two most common alleles at each locus.
Ch 20
A population DNA database is used to find common alleles present in the population
under consideration and is often done by getting healthy, unidentified samples from a blood bank
or local hospital, ideally of unrelated individuals of known ethnicity. PCR amplification is done
on the extracted samples and genotyped at the STR loci of interest, and the allele frequency is
found. 100-120 individuals per locus per population is sufficient for robust likelihood
calculations.
Institutions require approval from the Institutional Review Board (IRB) when testing a
subject. The board needs consent by those being tested or the testing is completely anonymous
and the genetic information cannot be linked to the subject.
When STR genotypes are generated, the sample data is evaluated to ensure that the
database is working for human tests. Many computer programs are now available to test the
independence of statistical tests on genetic data. When testing various tests for independence a
computer program from Ranajit Chakraborty’s group from the University of Cincinnati has a
system called DNATYPE. The system checks for errors, accuracy, and also searches for
duplicates of alleles. Program H in DNATYPE examines that the distribution of genotypes are in
frequencies of Hardy-Weinberg equilibrium.
HWE predicts the stability of alleles; the purpose is to test if alleles within a locus are
independent. Frequencies should not change if the locus is genetically stable, however; natural
populations may violate the HWE and change over time. Other purpose of HWE test is for
indications of excess homozygote; this can happen due to allelic dropout. Allele frequencies
might generate expected genotype frequencies when observed. When observed and the expected
value is similar it is said that those alleles in the genetic locus value are stable, therefore “in
equilibrium.”
Three important reasons for departure from the Hardy-Weinberg equilibrium are (1)
parents might be related to one another leading to inbreeding, (2) population structure and (3)
selection since surviving genotypes reproduce at different rates.
For random matching probabilities the loci needs to be independent. Linkage Equilibrium occurs
when STR loci is transferred independently during DNA segment meiosis.
Random match probabilities are equated respectively with the DNA profile of interest.
This test population is tested based on alleles frequencies. Deviations from independence are rare
and found in the 13 CODIS STRs because they are located on different chromosomes except for
CSF1PO and D5S818. If independence can be shown between alleles and loci, we can use the
product rule to show random match probabilities.