* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Download Restriction Fragment Length Polymorphisms (RFLPs)
DNA damage theory of aging wikipedia , lookup
Nucleic acid analogue wikipedia , lookup
United Kingdom National DNA Database wikipedia , lookup
Genomic imprinting wikipedia , lookup
DNA vaccination wikipedia , lookup
Cancer epigenetics wikipedia , lookup
Frameshift mutation wikipedia , lookup
Epigenetics of neurodegenerative diseases wikipedia , lookup
Gel electrophoresis of nucleic acids wikipedia , lookup
Nucleic acid double helix wikipedia , lookup
Human genetic variation wikipedia , lookup
Hardy–Weinberg principle wikipedia , lookup
Metagenomics wikipedia , lookup
DNA supercoil wikipedia , lookup
Genetic drift wikipedia , lookup
Human genome wikipedia , lookup
Genetic engineering wikipedia , lookup
Zinc finger nuclease wikipedia , lookup
Nutriepigenomics wikipedia , lookup
Bisulfite sequencing wikipedia , lookup
Genealogical DNA test wikipedia , lookup
Molecular cloning wikipedia , lookup
Genome evolution wikipedia , lookup
Extrachromosomal DNA wikipedia , lookup
Deoxyribozyme wikipedia , lookup
Genome (book) wikipedia , lookup
Vectors in gene therapy wikipedia , lookup
Epigenomics wikipedia , lookup
Public health genomics wikipedia , lookup
Quantitative trait locus wikipedia , lookup
No-SCAR (Scarless Cas9 Assisted Recombineering) Genome Editing wikipedia , lookup
Cell-free fetal DNA wikipedia , lookup
Polymorphism (biology) wikipedia , lookup
Population genetics wikipedia , lookup
Cre-Lox recombination wikipedia , lookup
Non-coding DNA wikipedia , lookup
Therapeutic gene modulation wikipedia , lookup
SNP genotyping wikipedia , lookup
Designer baby wikipedia , lookup
Molecular Inversion Probe wikipedia , lookup
Point mutation wikipedia , lookup
Genomic library wikipedia , lookup
Dominance (genetics) wikipedia , lookup
History of genetic engineering wikipedia , lookup
Genome editing wikipedia , lookup
Site-specific recombinase technology wikipedia , lookup
Artificial gene synthesis wikipedia , lookup
Microsatellite wikipedia , lookup
Lecture 30-32: 2007 Biology 207; Section B2; Good Restriction Fragment Length Polymorphisms (RFLPs) -1 Readings: Griffiths et al: 7th Edition: Ch. 12 pp. 384-386; Ch.13 pp404-407 8th Edition: pp. 364-366 Assigned Problems: 8th Ch. 11: 32, 34, 38-39 7th Ch. 12: 21 Ch. 13: 4, 5, 9, 10, 12-14 Concepts: What are RFLPs and how do they act like genetic marker loci ? 1. Some mutations alter the base pair sequence of restriction enzyme sites while others alter the distance between adjacent sites. Such alterations result in restriction fragment length polymorphisms (RFLP). 2. RFLPs represent allelic forms of a locus and behave as alleles in inheritance. 3. The location of RFLP can be mapped genetically and thereby used to map chromosomes. Restriction Fragments in Large Genomes The last few lectures have outlined the cloning of genes and other DNA fragments from large genomes. Now move to the analysis of such sequences. Steps: Chromosome ----------------------|--|--------------O---/\ / \ / \ Cosmid clone /E E\ ____|--------|--|--------|___ /\ / \ / \ Plasmid subclone / \ ]------------------[ E E Each of the chromosomes, the cosmid clone and the plasmid subclone have the restriction fragment. One can isolate genomic DNA from an organism and digest it with a restriction enzyme. Enzyme cleaves the double stranded DNA at each sequence recognized by the enzyme. This digested DNA can be separated on an agarose gel. Digest with same enzyme (e.g. EcoRI) Do Southern Transfer - DNA transferred to membrane. Southern Transfer: Probed with 32P labeled unique insert DNA - hybridized to: 1. sub cloned insert in plasmid + insert - not to vector 2. cloned insert in cosmid + insert - not to vector or other fragments 3. restriction fragment present on chromosome in genomic DNA - not to any other fragments (unique DNA probe) Autoradiogram of Southern blotted gel 1 Lecture 30-32: 2007 Homolog gene a+ Biology 207; Section B2; Good E E E 1 ---|--------|------------|--------|----------|----- Homolog gene a2 ---|--------|------------|-------------------|----E 5Kb E Alleles |-probe--| Southern Blot. Notes: 1. Hybridization intensity should be proportional to the amount of insert DNA (to which the probe can hybridize). 2. Thus, using a labeled probe and genomic DNA, one can identify the size of a restriction fragment at a particular site in the genome. 3. Diploid organisms have homologous chromosomes, thus the genome has two similar genes (sequences) at each locus -> 2 alleles. These two alleles may be the same or different sized fragments. Hypothetical Example: In this example: loss of middle Eco site is due to an ancestral mutation that changed the GAATTC sequence on homolog 1 to an unrecognized sequence (e.g. TAATTC) on homolog 2 -> (no cleavage). Or visa versa. The difference in the DNA sequence between the homologs makes the RFLP on each homolog an allelic form. Allele#1 = has the Eco site Allele#2 = lacks the Eco site We can see the effect when doing a genomic southern blot with the 2Kb Eco probe one can detect this allelic difference. 2 Lecture 30-32: 2007 Biology 207; Section B2; Good The "phenotype" of the allele is the size of the restriction fragment that hybridizes to the probe. One can detect both homologs (both alleles) on a genomic southern blot, thus one sees the phenotype for each allele. The phenotype is co-dominant - determine both alleles (no dominant or recessive here). The inheritance of these alleles is exactly like any other. Just as a+ or a- would be passed on, so would the 2Kb or 5Kb restriction fragment size as a phenotype (that directly reveals the genotype). Parents Ratio 2Kb/2Kb x 5Kb/5Kb | F1 2Kb/5Kb | F2: 2Kb/2Kb:2Kb/5Kb:5Kb/5Kb 1: 2: 1 2Kb and 5Kb alleles represent two different forms, or alleles at this "locus". This "locus" is the middle Eco site, where the DNA sequence differs between the homologous. Polymorphic locus – multiple alleles at a locus exist in the population Remember a diploid individual can only have two allelic forms, one for each homolog The advantages of RFLP’s include: Unlimited number of markers Co-dominant No developmental component No environmental factors involved in expression Degrees of Polymorphism Two forms in the population - dimorphic But some loci have many forms in a population - e.g. many different sized fragments - considered polymorphic in the population Distribution of Polymorphisms: Within a Genome: Some sequences are not polymorphic (monomorphic) - - essentially everyone in the population has the same sized fragment. - - (e.g. sequence encoding a protein - selection) Other sites are highly polymorphic, - many size fragments (alleles) in the population. (e.g. regions between genes - little/no selection against mutations) RFLP can be mapped genetically From previous example: Have two loci (gene a and the Eco site) that have allelic forms Do standard genetic cross: P1 a+ e+ X aea+ e+ ae- 3 Lecture 30-32: 2007 F1 a+ a- e+ e- Biology 207; Section B2; Good X aa- ee- test cross Score progeny for a+ or a- (assume a+ is dominant to a-) Score progeny for + or - for Eco site by southern blot of each progeny. test for 2Kb or 5Kb or both If a gene and Eco site are close to each other then Recombinant Frequency will be low (RF=low) -> few crossovers between a and Eco site. If a gene and Eco site are distant from each other then Recombinant Frequency will be high (RF approach 50%) -> many crossovers between a and Eco site. By extension RFLP can be mapped relative to each other by using two different probes and scoring appropriately for the respective phenotypes (fragment sizes). Restriction Fragment Length Polymorphisms (RFLPs) - 2 Locating Genes in Large Genomes Using RFLPs Readings: Griffiths et al: 8th Edition: pp. 394-402 7th Edition: Ch. 3 pp97; Ch 13 pp 406-408; 425-427; Ch. 14 pp 436-442 Concepts: How are RFLPs used to find different loci ? 1. RFLPs can be used as alleles in pedigree analysis. 2. RFLPs can be directly associated with the sequence changes that cause a normal gene to be a mutant allele (e.g. sickle-cell anemia). 3. In most cases an RFLP is used only as a nearby genetic marker to find linkage with a phenotype such as an inherited disease. 4. RFLP are used in "DNA fingerprinting". RFLPs in Pedigree analysis - (Human applications) Advantages of RFLP analysis 1. RFLP can be found at almost every location (probe) in a genome as long as many different restriction enzymes are tried. Thus any randomly chosen unique DNA probe can usually serve as an RFLP marker locus probe. 2. RFLP analysis requires a small sample of DNA. Blood sample is usually enough to do many tests. Can culture white blood cells if more is needed. 3. RFLP analysis can be done before any disease symptoms appear (or expected to appear). It may even be done before birth. RFLPs can be directly associated with the base pair mutation: Rare situation, but demonstrates the concept. e.g. sickle-cell anemia - Fig 11-24 (8th) 13-29.(7th). 4 Lecture 30-32: 2007 Biology 207; Section B2; Good Sickle-cell anemia is an recessive, inherited genetic disease that causes round red blood cells to become sickled in shape. The mutation is a DNA sequence change that alters the normal glutamine to a valine amino acid in the b-globin polypeptide chain (GAG to GTG codon). This change results in the loss of a DdeI restriction site ( Dde I = Mst II) . The sickle-cell hemoglobin gene lacks the Dde I site. This provides absolute linkage between the RFLP and a mutant gene (the mutation itself). (Not the usual situation.) RFLP linkage with an inherited disease -Fig 12-10 (8th) 13-3 (7th) RFLPs can be used as marker loci to find linkage with a disease gene locus. Probe P detects Morph 1 (two small bands) or Morph 2 (one large band - the sum of the two small bands). Examine a pedigree of an affected individual- disease D is due to a dominant mutation. Dd - affected has morph 1, 2 (heterozygote mother) dd - unaffected has morph 2 only – homozygote (father) ->(classic test cross) 5 Lecture 30-32: 2007 Biology 207; Section B2; Good Note: We can not say that because morph 1 is in Dd that morph 1 = D Each pedigree must be treated independently. If linked the "linkage phase" could be either way. (Coupled versus Repulsion) (D is coupled to Morph 1 and in repulsion with morph 2). Similar disease mutations could arise on chromosomes with different morphs (RFLP patterns)-> different phase. Furthermore, similar diseases (phenotype) can arise from mutations in different gene loci (on different chromosomes - unlinked). Cross: Dd x dd gives 8 progeny Progeny: - each has one allele as morph 2 (Dad) and either morph 1 or 2 from affected Mother. This is a small sample - normally use many families and combine data. But one can conclude: If it were unlinked then D would have 50% morph 1 50% morph 2. So: D/d disease locus appears linked but not absolutely D phase morph 1 and d phase morph 2 ------D---------------------morph 1----------o-------------d---------------------morph 2----------o-------One can explain the single discordance as a cross-over between the D/d locus and the RFLP locus. This was a simple example. Real mapping of human disease genes: Figure 12-9 (8th) 14-1a, b (7th) 6 Lecture 30-32: 2007 Biology 207; Section B2; Good On a large scale (many RFLP and many progeny) it has been possible to find linkage with most Human disease genes. Currently this is the method of choice for mapping a human genetic disease. One can test 100-1000 of RFLP sites on various chromosomes to find linkage. First find weak linkage with a RFLP probe, then refine the location with other RFLP sites nearby. "Zeroin" on genetic disease gene. Then use RFLP probe as probes to start chromosomal walk to clone the gene. There are "statistical flukes" – mislead investigators. RFLP, VNTRs, and DNA fingerprinting RFLP can arise due to VNTR's VNTR are variable number tandem repeat. First example found in myoglobin gene. Short sequence of 33 base pairs (other examples vary from 15-100 bp) is repeated a variable number of times. Direct repeat - highly polymorphic - many allele morphs Figure 14-4 (7th) Use this repeat sequence as a DNA probe to genomic southern blot. Several loci for such a probe. Many morphs at each locus. Effect is such that each individual has a unique band pattern - a DNA fingerprint (Identical twins are identical). DNA fingerprint Fig 14-3 (7th) - of Blood stain matches a suspect. Current debate of what a "match" is. The technique is much more powerful at excluding people. If the banding pattern doesn't "match" then the samples must be different (e.g. from different individuals). Twins?? 7 Lecture 30-32: 2007 Biology 207; Section B2; Good Simple VNTR Clone one locus out and use it to identify a single specific locus probe where there is only one VNTR present at the locus. High degree of polymorphism permits one to follow many alleles through a pedigree. Example in Fig 12-11 (8th) 14-5 (7th) and 12-12 (8th) 14-6(7th) 4 morphs - can follow each through to children. This provides a more detailed analysis than is possible with typical RFLP analysis. Microsatellites Microsatellites are small repeated regions of DNA which usually have a 2 or 3 bp repeated sequence which is 10-50 repeats in length. more frequent and polymorphic than VNTRs repeating units of 2, 3, 4 nucleotides: TGTGTG CAACAACAA or AAATAAATAAAT often has many alleles present in the population (heterozygosity freq-70%) for example 10-30 AC dinucleotide repeats. Uses PCR to visualize For forensics. A number of markers are used, up to 15. Each locus can have a number of alleles. 16S-212 - 212, 215, 218, 221 etc. Examples 4 alleles at this locus, each in equal frequency, with a total of 10 loci. (0.25)10 For other loci, can use frequency of alleles times each other. This approach most easily rules out who DNA it was not. 8 Lecture 30-32: 2007 Biology 207; Section B2; Good Probabilities, Gene Frequencies and Forensics While a single individual can only have two alleles at any one locus, in a population there can be many alleles, and these alleles can occur at different frequencies in different populations. The classic example is the ABO blood group system. There are three alleles in the population, A, B and O or i (a null allele). The frequency of any genotype will be the frequency of the two alleles in the population multiplied, (assuming random mating, etc). A Japanese AB would be (0.279*0.172 = 0.048 or 5% of the population). 9