* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Download PDF file
Survey
Document related concepts
Genomic library wikipedia , lookup
Medical genetics wikipedia , lookup
Whole genome sequencing wikipedia , lookup
Genetically modified crops wikipedia , lookup
History of genetic engineering wikipedia , lookup
Population genetics wikipedia , lookup
Genetic testing wikipedia , lookup
SNP genotyping wikipedia , lookup
Molecular Inversion Probe wikipedia , lookup
Microevolution wikipedia , lookup
Heritability of IQ wikipedia , lookup
Genetic engineering wikipedia , lookup
Genome-wide association study wikipedia , lookup
Genome (book) wikipedia , lookup
Human genetic variation wikipedia , lookup
Selective breeding wikipedia , lookup
Transcript
03/02/2015 Introduction Application of genomics to selective breeding of Atlantic salmon • Salmonid aquaculture = large & expanding industry • Atlantic salmon major farmed species Global salmonid production (tonnes) Source; FAO Ross Houston PAG XXIII, San Diego, 10th Jan 2015 Introduction Introduction • Infectious diseases are a serious problem for salmon farming worldwide • Viral diseases: • Control strategies can include vaccination, drugs, management, & selective breeding • E.g. Infectious Salmon Anaemia (ISA), Infectious Pancreatic Necrosis (IPN), Pancreas Disease (PD) • Bacterial diseases: • E.g. Furunculosis, Salmon Rickettsial Syndrome (SRS) • Ectoparasites: • E.g. Sea lice (L. salmonis & C. rogercresseyi), Amoebic Gill Disease (AGD) Introduction Introduction • Control strategies can include vaccination, drugs, management, & selective breeding • Viral diseases: • Control strategies can include vaccination, drugs, management, & selective breeding • Viral diseases: • E.g. Infectious Salmon Anaemia (ISA), Infectious Pancreatic Necrosis (IPN), Pancreas Disease (PD) • Bacterial diseases: • E.g. Furunculosis, Salmon Rickettsial Syndrome (SRS) • Ectoparasites: • E.g. Sea lice (L. salmonis & C. rogercresseyi), Amoebic Gill Disease (AGD) • E.g. Infectious Salmon Anaemia (ISA), Infectious Pancreatic Necrosis (IPN), Pancreas Disease (PD) • Bacterial diseases: • E.g. Furunculosis, Salmon Rickettsial Syndrome (SRS) • Ectoparasites: • E.g. Sea lice (L. salmonis & C. rogercresseyi), Amoebic Gill Disease (AGD) Is host resistance to these diseases heritable??? 1 03/02/2015 Introduction Introduction • Control strategies can include vaccination, drugs, management, & selective breeding • Viral diseases: • E.g. Infectious Salmon Anaemia (ISA), Infectious Pancreatic Necrosis (IPN), Pancreas Disease (PD) • Bacterial diseases: • E.g. Furunculosis, Salmon Rickettsial Syndrome (SRS) • Ectoparasites: Yáñez, Houston & Newman (2014) Frontiers in Livestock Genomics • E.g. Sea lice (L. salmonis & C. rogercresseyi), Amoebic Gill Disease (AGD) Published evidence for host genetic variation in resistance Selective Breeding for Resistance Selective Breeding for Resistance • Selection for resistance in salmon is achievable… • …but there are limitations: • High fecundity and external fertilisation helps • Breeding value of candidates typically estimated from siblings x Parents Offspring selection candidates • Only half of the genetic variation is between-family variation • Disease challenge experiments each year are undesirable • Genomics tools can help: x Parents Calculate breeding values Offspring Disease Challenge Offspring selection candidates Subsequent generation Calculate breeding values Subsequent generation Selective Breeding for Resistance Selective Breeding for Resistance • …but there are limitations: • …but there are limitations: • Only half of the genetic variation is between-family variation • Disease challenge experiments each year are undesirable • Genomics tools can help: x Parents Offspring selection candidates Subsequent generation Offspring Disease Challenge • Only half of the genetic variation is between-family variation • Disease challenge experiments each year are undesirable • Genomics tools can help: x Parents Calculate breeding values Offspring Disease Challenge Offspring selection candidates Take sample of DNA Calculate (genomic) breeding values Genotype for genetic markers Subsequent generation 2 03/02/2015 Selective Breeding for Resistance Introduction Major QTL common in farmed fish? • Farmed salmon are close to wild ancestors ~ 10 generations of selective breeding (fewer for disease resistance) • New disease pressures in farmed environment • Major effect QTL more common than in terrestrial species? 50-86 % Phenotypic variance 49 – 65 % Phenotypic variance 50 % Phenotypic variance 25-30 % Phenotypic variance Introduction IPN Resistance • Genetic architecture of resistance important for application of genomics in breeding programmes • Three case studies of genetic resistance in salmon: • Endemic birnavirus • Mortality: i. Infectious Pancreatic Necrosis (IPN) virus ii. Pancreas Disease (PD) virus iii. Sea lice (L. salmonis) • Post-Smolts (after seawater transfer) • Family mortalities differ (range: 0 – 100%) • Fry (freshwater stage) IPNV-infected salmon fry • Selection for resistance is possible • But only between families using trait data alone • Aim: Find & utilise genetic markers to predict resistance to IPN virus Sea water cages IPN Resistance IPN Resistance • Collected mortalities and survivors from ‘natural’ IPN virus outbreaks in seawater • Markers to assign to family and establish pedigree • Single locus explains almost all genetic variation in resistance in freshwater and seawater heritability ~ 0.4 • Genome scan for resistance loci in salmon genome: • Stage 1: QTL detection • Sparse markers, sire segregation • Stage 2: QTL confirmation & positioning • Denser markers, dam segragation Major QTL detected: Same region detected by Norwegian group F M Houston et al. (2008) Genetics & (2010) Heredity Moen et al. (2009) BMC Genomics Salmon have very limited recombination in males 3 03/02/2015 IPN Resistance IPN Resistance • Large effect of QTL on mortality level of fry • Next stages: i. SNP markers to accurately predict QTL genotype in commercial salmon stocks ii. Understanding of underlying biological mechanisms Sire haplotype Dam haplotype R S R 0% 2% S 1% 63% IPN mortality levels in fish carrying alternative QTL alleles (n = 341) Houston et al. (2010) Heredity IPN Resistance IPN Resistance RAD Sequencing RAD Sequencing • DNA sequencing technology is transforming animal breeding • SNP Discovery and Genotyping by RAD Sequencing • Reference genome sequences for A. salmon and rainbow trout • Tools for high density SNP genotypes: Genotyping by sequencing SNP arrays • Illumina sequencing of pooled, barcoded samples • Applied to RR and SS homozygotes from IPNV-challenged families • Compared SNP genotypes for concordance with QTL Restriction enzyme cut sites Genomic DNA RAD Seq Whole Genome Seq Adapted from Etter et al (2009) single Illumina sequencing read Davidson et al. (2010) Genome Biology, Berthelot et al (2014) Nature Comms IPN Resistance IPN Resistance • 22K SNPs → 50 linked SNPs → 10 screened → 2 associated • Association between SNP and IPN mortality in test (n~4000) and validation (n~5000) populations of salmon fry • Microarray & RNA-Seq comparison of RR vs SS host response to infection in resistant vs susceptible fish Population 1 day post-infection Gene expression susceptible fry Mortality Rate (SE) 7 days 20 days Interferons & other cytokines RAD01 SNP Genotype RR RS SS TEST 0.10 (0.03) 0.17 (0.01) 0.60 (0.01) VALIDATION 0.11 (0.01) 0.25 (0.01) 0.63 (0.01) → Commercial application: SNP panel used as genetic test to select resistant broodstock Houston et al. (2012) BMC Genomics Gene expression resistant fry • Resistance mechanism early in host-pathogen interaction • Mapping early DE genes → positional / functional candidates 4 03/02/2015 PD Resistance • Near monogenic host resistance to a virus is atypical • Pancreas Disease (salmonid alphavirus) • Resistance to other diseases likely to be controlled by a mix of QTL and a polygenic component http://aqua.merck-animal-health.com/binaries/Alphaviruses%20189x200_tcm56-34357.jpg PD Resistance • Morbidity & mortality at post-smolt stage • Causative agent – Salmonid alphavirus • Six subtypes, geographic specificity Alphavirus PD Resistance PD Resistance • Fry challenge model • Sampled survivors and mortalities from challenge • Genotyped for SNP markers, assigned to family • Estimated h2 • Naïve fry (Marine Harvest) challenged with water from tank of IP-injected parr • • • • Total n = 3,949 150 full-sib families Sire half-sib structure Mortality ~ 60% overall PD Resistance • Model: Yijk = μ + Sirei + Damij + eijk • Observed, LOGIT, PROBIT scales Analysis Method Observed binary scale Heritability (±SE) 0.34 (0.05) Underlying liability scale 0.55 Probit-link scale 0.54 (0.07) Logit-link scale 0.46 (0.06) PD Resistance F • QTL mapping using sparse SNP panel • Two-stage mapping strategy as with IPN M • Chr 3 QTL confirmed in separate population (a) ‘Intermediate’ genetic architecture (i) Chr 3 PD resistance QTL mapped in population of postsmolts (Baranski et al) (ii) Position of fry & post-smolt QTL on chromosome co-incide Gonen et al., Submitted 5 03/02/2015 PD Resistance Sea Lice Resistance • Summary: • Pancreas disease resistance is highly heritable (h 2 ~ 0.5) • The genetic architecture is ‘intermediate’ • A QTL on Chr 3 affects resistance in fry & post-smolts • SNPs associated with QTL applied in selective breeding • What about more polygenic traits? • QTL-targeted marker selection replaced by genomic selection • New genomic tools required to genotype thousands of animals Create High Density SNP chip • Generation and application of SNP chip plausible within typical 3 year timescale of scientific project SNP chip development & application SNP chip development & application • Illumina sequencing of diverse populations • Validation and utility of Affymetrix SNP chip • Combination of RAD-Seq, RR-Seq and RNA-Seq • SNP discovery and testing • Millions of candidates to ~132 K verified, polymorphic assays • Sequencing haploid fish to remove paralogous variation SNP array is informative in farmed Scottish, farmed Norwegian and wild European Atlantic salmon populations 1. Can be used to detect population structure based on genomic similarity 2. Y-specific probes accurately predict phenotypic sex of juvenilles Houston et al. (2014), BMC Genomics Sea Lice Resistance Sea Lice Resistance • Sea lice challenge • Quantitative Genetic parameters • Pedigreed population (Landcatch) challenged with L. salmonis larvae 96 larvae per fish • 725 fish sampled 7 days post infection Individual lice counts using stereo microscope Weight & length measurements and fin clip sample • Genotyping and QC • Samples genotyped using Axiom 132 K SNP array • QC filtering of SNPs and individuals (Mendel error, MAF, etc.) 111 K SNPs retained 29 sires, 61 dams, 534 offspring i. ii. Animal model in ASReml used to estimate h2 of lice count • Sex: fixed effect, animal (pedigree): random effect • Body weight as a covariate Genomic (IBS) relationship matrix calculated using Genabel • Equivalent model with G matrix replacing A matrix • Results h2 (SE) Mean lice count Standard Deviation pedigree genomic 25.6 12.4 0.24 (0.08) 0.20 (0.07) Consistent h2 estimates using pedigree and genomic matrix 6 03/02/2015 Sea Lice Resistance Sea Lice Resistance • Genome-wide association analysis • The most significant SNP • Located on distal end of chromosome 3, P~10-5 • Derived from RNA-Seq • Occurs in a gene for which the protein product is known to be involved in the host response of the epidermis to sea lice infection • Genabel mmscore & ASReml -log P Value SNP Genotype AA (n=15) AB (n=94) BB (n=537) Lice Count (SE) 36.4 (3.4) 29.8 (1.3) 24.3 (0.6) • Resistance allele is common in this population U Atlantic salmon Chromosome Number Sea Lice Resistance Overall Summary • Summary: • We created a publicly-available high-density SNP array • Can detect population structure & predict phenotypic sex • h2 estimated at 0.24 (pedigree) and 0.20 (genomic) • The genetic architecture is polygenic • Genomic prediction & selection for resistance underway • Host resistance to infectious diseases has a genetic component Genetic Architecture of Resistance Overall Summary Sea lice Polygenic: No evidence for ‘major’ QTL • PD and IPN viruses ~ 50 % variation • Sea lice ~ 25 % variation • But the genetic architecture of resistance varies…. • Host resistance to infectious diseases has a genetic component • PD and IPN viruses ~ 50 % observed variation genetic • Sea lice ~ 25 % observed variation genetic Pancreas Disease Infectious Pancreatic Necrosis Intermediate: ~30 % genetic variation explained by 3 QTL • But the genetic architecture of resistance varies…. • Genomic tools have a major role for salmon disease research and application • Past success for IPN virus & significant progress for other diseases • Working with aquaculture industry is key • Application of genomics is tailored to genetic architecture of resistance Nearly Monogenic: ~80 % genetic variation explained by singe QTL 7 03/02/2015 Genotyping Strategies Acknowledgements RAD Sequencing Main collaborators: Steve Bishop Chris Haley Alan Archibald Serap Gonen Natalie Lowe Karim Gharbi Richard Talbot Tim Cezard John Davey John Taggart Michaël Bekaert James Bron Brendan McAndrew Ashie Norris Derrick Guy Alastair Hamilton Alan Tinch Jose Mota-Velasco David Verner-Jeffreys Richard Paley Fiona Brew • Rough comparison of SNP chip & genotyping by sequencing De Novo SNP Chip RAD-Seq Time to set up 1 – 2 years 1 – 6 months Up front cost High Low Cost per sample $50 - 100 $10 - 30 SNP density 30 – 1000 K 1 – 20 K Bioinformatics requirement Medium High Populationspecificity Variable High Accuracy of data Very High High Suitable sample size Large Small - Medium 8