* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Download positionalCloning15
Segmental Duplication on the Human Y Chromosome wikipedia , lookup
Gene nomenclature wikipedia , lookup
Ridge (biology) wikipedia , lookup
Epigenetics of diabetes Type 2 wikipedia , lookup
Human genetic variation wikipedia , lookup
Point mutation wikipedia , lookup
Oncogenomics wikipedia , lookup
Genetic engineering wikipedia , lookup
Genomic imprinting wikipedia , lookup
Copy-number variation wikipedia , lookup
Gene expression programming wikipedia , lookup
History of genetic engineering wikipedia , lookup
Transposable element wikipedia , lookup
Non-coding DNA wikipedia , lookup
Gene desert wikipedia , lookup
Metagenomics wikipedia , lookup
Gene expression profiling wikipedia , lookup
Therapeutic gene modulation wikipedia , lookup
Microevolution wikipedia , lookup
Genome (book) wikipedia , lookup
Quantitative trait locus wikipedia , lookup
Public health genomics wikipedia , lookup
Designer baby wikipedia , lookup
Whole genome sequencing wikipedia , lookup
No-SCAR (Scarless Cas9 Assisted Recombineering) Genome Editing wikipedia , lookup
Pathogenomics wikipedia , lookup
Minimal genome wikipedia , lookup
Human genome wikipedia , lookup
Helitron (biology) wikipedia , lookup
Site-specific recombinase technology wikipedia , lookup
Genomic library wikipedia , lookup
Artificial gene synthesis wikipedia , lookup
Human Genome Project wikipedia , lookup
Positional cloning: the rest of the story X a a a a a a a a http://faculty.ithaca.edu/iwoods/docs/wh Today: So you have a map location … now what? Mapped Mutant Cloned Gene Mapping: Ultimate Goal Map Distance = # of recombinants # of meioses X Screen MANY markers on FEW meioses LOW resolution = Potentially HIGH distance Great for “Which Marker is Linked?” =0 Screen NEARBY markers on MANY (1000’s) meioses HIGH resolution = Potentially ZERO distance Great for “Where is the Mutation?” High-Resolution Mapping Basic strategies: • more markers: Refine boundaries - SSLPs – likely polymorphic, no sequence needed - SNPs – require sequence data • more mutants: Increase resolution One fancy strategy: • NextGen sequencing of pooled WT and pooled mutants => RNA SEQ => focus on exons “Homozygosity Mapping”: Define homozygous region in mutants Find the actual mutation? How to be sure . . . ? Generate more SNPs = more markers to map on more mutants Data so far: Mutant with defects in slow muscle specification Initial Mapping: Out of 16 meioses: 1 recombinants: Z3057, Z4999, Z7109 0 recombinants: Z8693, Z11119 4 recombinants: Z13936 From mutant map position to cloned gene • Refining the map location with high-resolution mapping • Trolling for candidate genes • Testing candidates From mutant map position to cloned gene • Refining the map location with high-resolution mapping • Trolling for candidate genes • Testing candidates 2013 What’s near Z15270? http://www.ncbi.nlm.nih.gov/nucleotide Goal: obtain sequence so we can localize it to Genome NCBI Nucleotide Query NCBI Nucleotide Query Sequence Search at Ensembl Genome Browser Start close and move out both ways Sequence Search at Ensembl Genome Browser Start close and move out both ways Sequence Search at Ensembl Genome Browser Find More Markers To Test . . . Find More Polymorphisms Find More Markers To Test . . . Additional validated Polymorphisms Simple Repeats: UCSC genome browser Designing PCR primers http://bioinfo.ut.ee/primer3-0.4.0/primer3/ Where do we go from here? Can get sequences and test each of these Not all will be useful “Informative” = polymorphic = PCR amplicons of different lengths from WT and mutants Markers you’ve seen already Testing for informative SSLPs “Informative” = polymorphic = PCR amplicons of different lengths from WT and mutants More fish = refine the map More fish (i.e. embryos / larvae) = more recombinants = higher resolving power a a a a a a a a Narrowing the critical interval 5/1156 Z15270 More fish = more better 7/1156 Z11119 Z15270 Z11119 Defining the BOUNDARIES in the genome Now what? • Identify more markers and do more high-res mapping Key point = continually refine boundaries by recombination • Look in genome for potential candidates What’s nearby in genome? . . . a [very good] MODEL of reality No luck in genome sequence? (rare) misassembly or gaps • conserved synteny with other fish • Physical map: BAC clones • genetic or RH maps Now what? • Identify more markers and do more high-res mapping Key point = continually refine boundaries by recombination • Look in genome for potential candidates What’s nearby in genome? . . . a [very good] MODEL of reality No luck in genome sequence? (rare) misassembly or gaps • conserved synteny with other fish • Physical map: BAC clones • genetic or RH maps What’s nearby in the genome? http://www.ensembl.org/Danio_rerio/ Good candidate? calca at ZFIN calca expression motor neuron expression Mutant = lack slow muscle fibers what if . . . A secreted signal from motor neurons to developing muscle?! calca expression: RNA-SEQ calca expression: RNA-SEQ What’s known about calca? http://www.ncbi.nlm.nih.gov/gene What’s known about calca? Cool new biology: it’s a secreted peptide with a novel role in directing slow muscle specification! Alert Cell, Science, and Nature! How to test if this is the right gene? Is calca the right gene? High resolution mapping - no recombinants between mutation and gene in lots of meioses Phenocopy with new mutant (or MO injection) or noncomplementation with another allele Rescue with mRNA injection Find mutation in coding sequence Picking the right strategy often is determined by balance of . . . - Available Resources - Number of Candidates These are often determined by size of candidate interval Now what? Test potential candidates: • Turn the candidate into a new map marker - could it be the right gene? - even if not, can it narrow your interval? How to turn it into a map marker? What’s a good candidate? Now what? Test potential candidates: • Turn the candidate into a new map marker - could it be the right gene? - even if not, can it narrow your interval? How to turn it into a map marker? What’s a good candidate? Single nucleotide polymorphisms Forward Forward 200 bp A Reverse G 60 bp, 140bp Reverse SNPs = ~ 1 / 250 bp in genome Generating map markers from ESTs/Genes/other sequences • Find or design primers for PCR (from gDNA) • Sequence PCR product on WT and mut • Find RE polymorphism • or use your huge list of markers from nextGen sequencing pooled WT and pooled mutant. which regions are differentially homozygous? Obtaining gDNA from cDNA sequence: exporting from genome http://genome.ucsc.edu/ Obtaining gDNA from cDNA sequence: exporting from genome BLAT Result Good vs. Questionable Regions Good vs. Questionable Regions Beware of shotgun (non-BAC, i.e. large clone) assembly Safe Sailing (mostly) Here there be Monsters Obtaining gDNA from cDNA sequence: exporting from genome Obtaining gDNA from cDNA sequence: exporting from genome Designing PCR primers http://bioinfo.ut.ee/primer3-0.4.0/primer3/ PCR primers Amplify from WT and mut, sequence . . . Locating a SNP to map . . . run on your mapping panel - still a candidate? (0 recombinants) - narrow the candidate interval? Identifying a restriction enzyme to map your SNP http://helix.wustl.edu/dcaps/dcaps.html dCAPS results Striking the right balance in positional cloning Follow-up: Map? Or Biology? Mapping: lots of fish, lots of PCR, lots of gels should always give you an unambiguous answer Functional: Sequencing => often done concomitantly with mapping mRNA rescue, CRISPR allele, Morpholinos => time, money Ambiguous, easy to make up lots of stories Mapping: Ultimate Goal Map Distance = # of recombinants # of meioses X Screen MANY markers on FEW meioses LOW resolution = Potentially HIGH distance Great for “Which Marker is Linked?” =0 Screen NEARBY markers on MANY (1000’s) meioses HIGH resolution = Potentially ZERO distance Great for “Where is the Mutation?” Mapping can do it all! What if ZF genome turns out to be a dead end (RARE!)? • Check other fish genomes - more candidate genes? - fix a gap in the ZF data • RNA-SEQ or HMFSeq? • Start a chromosome walk - iterative BAC screening What if ZF genome turns out to be a dead end? • Check other fish genomes Pufferfish (Tetraodon, Fugu) - smaller, more compact genome - good for getting enhancer regions Tetraodon calca region More Candidates to test: find and map zebrafish orthologs Today: So you have a map location … now what? Mapped Mutant Cloned Gene Tomorrow’s bioinformatics practical: 0) Virtual Positional Cloning 1) Navigate Genome browsers for information related to expression, Loss-of-function, Rescue 2) Zebrafish orthologs of your favorite human genes Identification of enhancer elements Transgenic Lines 3) BLAST on your own computer, and blast parsing via Python script 4) From transcriptome profiling, identify genes, download upstream sequences, visualize overrepresented motifs Today: So you have a map location … now what? Mapped Mutant Cloned Gene Tomorrow’s bioinformatics practical: 0) Virtual Positional Cloning A review of what we did today, with some extra stuff Today: So you have a map location … now what? Mapped Mutant Cloned Gene Tomorrow’s bioinformatics practical: 1) Navigate Genome browsers for information related to expression, Loss-of-function, Rescue Your favorite Zebrafish Gene => sequence / exon-intron boundaries, conservation expression morpholino design obtaining mRNA clones for rescue Today: So you have a map location … now what? Mapped Mutant Cloned Gene Tomorrow’s bioinformatics practical: 2) Zebrafish orthologs of your favorite human genes Identification of enhancer elements Transgenic Lines Human gene ZF ortholog location in genome putative promoter / enhancer => conservation of noncoding DNA from other fish Today: So you have a map location … now what? Mapped Mutant Cloned Gene Tomorrow’s bioinformatics practical: 3) BLAST on your own computer, and blast parsing via Python script All human proteins associated with HH signaling Identification of ALL putative ZF orthologs of these proteins via local BLAST Parse BLAST to get top result and genome location for each ZF protein Determine if genome location matches map position of mutant Today: So you have a map location … now what? Mapped Mutant Cloned Gene Tomorrow’s bioinformatics practical: 4) From transcriptome profiling, identify genes, download upstream sequences, visualize overrepresented motifs List of short unidentified sequence Assign to Ensembl ID via BLAST and parsing Download 5’UTR and 2k upstream sequences for batch of Ensembl ID’s Search through these for enriched motifs Visualize locations of enriched motifs Tomorrow’s Informatics Practical http://faculty.ithaca.edu/iwoods/docs/wh/