Download PDF file

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

RNA-Seq wikipedia , lookup

Genomic library wikipedia , lookup

Medical genetics wikipedia , lookup

Whole genome sequencing wikipedia , lookup

Genetically modified crops wikipedia , lookup

History of genetic engineering wikipedia , lookup

Population genetics wikipedia , lookup

Genetic testing wikipedia , lookup

SNP genotyping wikipedia , lookup

Molecular Inversion Probe wikipedia , lookup

Genomics wikipedia , lookup

Microevolution wikipedia , lookup

Heritability of IQ wikipedia , lookup

Genetic engineering wikipedia , lookup

Genome-wide association study wikipedia , lookup

Genome (book) wikipedia , lookup

Human genetic variation wikipedia , lookup

Tag SNP wikipedia , lookup

Selective breeding wikipedia , lookup

Quantitative trait locus wikipedia , lookup

Public health genomics wikipedia , lookup

Transcript
03/02/2015
Introduction
Application of genomics to selective
breeding of Atlantic salmon
• Salmonid aquaculture = large & expanding industry
• Atlantic salmon major farmed species
Global salmonid production (tonnes)
Source; FAO
Ross Houston
PAG XXIII, San Diego, 10th Jan 2015
Introduction
Introduction
• Infectious diseases are a serious problem for
salmon farming worldwide
• Viral diseases:
• Control strategies can include vaccination,
drugs, management, & selective breeding
• E.g. Infectious Salmon Anaemia (ISA), Infectious
Pancreatic Necrosis (IPN), Pancreas Disease (PD)
• Bacterial diseases:
• E.g. Furunculosis, Salmon Rickettsial Syndrome (SRS)
• Ectoparasites:
• E.g. Sea lice (L. salmonis & C. rogercresseyi),
Amoebic Gill Disease (AGD)
Introduction
Introduction
• Control strategies can include vaccination,
drugs, management, & selective breeding
• Viral diseases:
• Control strategies can include vaccination,
drugs, management, & selective breeding
• Viral diseases:
• E.g. Infectious Salmon Anaemia (ISA), Infectious
Pancreatic Necrosis (IPN), Pancreas Disease (PD)
• Bacterial diseases:
• E.g. Furunculosis, Salmon Rickettsial Syndrome
(SRS)
• Ectoparasites:
• E.g. Sea lice (L. salmonis & C. rogercresseyi),
Amoebic Gill Disease (AGD)
• E.g. Infectious Salmon Anaemia (ISA), Infectious
Pancreatic Necrosis (IPN), Pancreas Disease (PD)
• Bacterial diseases:
• E.g. Furunculosis, Salmon Rickettsial Syndrome
(SRS)
• Ectoparasites:
• E.g. Sea lice (L. salmonis & C. rogercresseyi),
Amoebic Gill Disease (AGD)
Is host resistance to these diseases heritable???
1
03/02/2015
Introduction
Introduction
• Control strategies can include vaccination,
drugs, management, & selective breeding
• Viral diseases:
• E.g. Infectious Salmon Anaemia (ISA), Infectious
Pancreatic Necrosis (IPN), Pancreas Disease (PD)
• Bacterial diseases:
• E.g. Furunculosis, Salmon Rickettsial Syndrome
(SRS)
• Ectoparasites:
Yáñez, Houston &
Newman (2014)
Frontiers in Livestock
Genomics
• E.g. Sea lice (L. salmonis & C. rogercresseyi),
Amoebic Gill Disease (AGD)
Published evidence for host genetic variation in resistance
Selective Breeding for Resistance
Selective Breeding for Resistance
• Selection for resistance in salmon is achievable…
• …but there are limitations:
• High fecundity and external fertilisation helps
• Breeding value of candidates typically estimated from siblings
x
Parents
Offspring
selection
candidates
• Only half of the genetic variation is between-family variation
• Disease challenge experiments each year are undesirable
• Genomics tools can help:
x
Parents
Calculate breeding
values
Offspring
Disease
Challenge
Offspring
selection
candidates
Subsequent generation
Calculate breeding
values
Subsequent generation
Selective Breeding for Resistance
Selective Breeding for Resistance
• …but there are limitations:
• …but there are limitations:
• Only half of the genetic variation is between-family variation
• Disease challenge experiments each year are undesirable
• Genomics tools can help:
x
Parents
Offspring
selection
candidates
Subsequent generation
Offspring
Disease
Challenge
• Only half of the genetic variation is between-family variation
• Disease challenge experiments each year are undesirable
• Genomics tools can help:
x
Parents
Calculate breeding
values
Offspring
Disease
Challenge
Offspring
selection
candidates
Take sample of DNA
Calculate
(genomic)
breeding values
Genotype for genetic
markers
Subsequent generation
2
03/02/2015
Selective Breeding for Resistance
Introduction
Major QTL common in farmed fish?
• Farmed salmon are close to wild ancestors
 ~ 10 generations of selective breeding (fewer for disease resistance)
• New disease pressures in farmed environment
• Major effect QTL more common than in terrestrial species?
50-86 % Phenotypic variance
49 – 65 % Phenotypic variance
50 % Phenotypic variance
25-30 % Phenotypic variance
Introduction
IPN Resistance
• Genetic architecture of resistance important for application
of genomics in breeding programmes
• Three case studies of genetic resistance in salmon:
• Endemic birnavirus
• Mortality:
i. Infectious Pancreatic Necrosis (IPN) virus
ii. Pancreas Disease (PD) virus
iii. Sea lice (L. salmonis)
• Post-Smolts (after seawater transfer)
• Family mortalities differ (range: 0 – 100%)
• Fry (freshwater stage)
IPNV-infected salmon fry
• Selection for resistance is possible
• But only between families using trait data alone
• Aim: Find & utilise genetic markers to
predict resistance to IPN virus
Sea water cages
IPN Resistance
IPN Resistance
• Collected mortalities and survivors from ‘natural’ IPN virus
outbreaks in seawater
• Markers to assign to family and establish pedigree
• Single locus explains almost all genetic variation in
resistance in freshwater and seawater
 heritability ~ 0.4
• Genome scan for resistance loci in salmon genome:
• Stage 1: QTL detection
• Sparse markers, sire segregation
• Stage 2: QTL confirmation & positioning
• Denser markers, dam segragation
Major QTL detected:
Same region detected by
Norwegian group
F
M
Houston et al. (2008) Genetics & (2010) Heredity
Moen et al. (2009) BMC Genomics
Salmon have very limited
recombination in males
3
03/02/2015
IPN Resistance
IPN Resistance
• Large effect of QTL on mortality level of fry
• Next stages:
i. SNP markers to accurately predict QTL genotype in
commercial salmon stocks
ii. Understanding of underlying biological mechanisms
Sire haplotype
Dam haplotype
R
S
R
0%
2%
S
1%
63%
IPN mortality levels in fish carrying alternative QTL alleles (n = 341)
Houston et al. (2010) Heredity
IPN Resistance
IPN Resistance
RAD Sequencing
RAD Sequencing
• DNA sequencing technology is transforming animal breeding
• SNP Discovery and Genotyping by RAD Sequencing
• Reference genome sequences for A. salmon and rainbow trout
• Tools for high density SNP genotypes:
 Genotyping by sequencing
 SNP arrays
• Illumina sequencing of pooled, barcoded samples
• Applied to RR and SS homozygotes from IPNV-challenged families
• Compared SNP genotypes for concordance with QTL
Restriction enzyme cut sites
Genomic DNA
RAD Seq
Whole Genome
Seq
Adapted from
Etter et al (2009)
single Illumina sequencing read
Davidson et al. (2010) Genome Biology, Berthelot et al (2014) Nature Comms
IPN Resistance
IPN Resistance
• 22K SNPs → 50 linked SNPs → 10 screened → 2 associated
• Association between SNP and IPN mortality in test (n~4000)
and validation (n~5000) populations of salmon fry
• Microarray & RNA-Seq comparison of RR vs SS host response to
infection in resistant vs susceptible fish
Population
1 day post-infection
Gene expression
susceptible fry
Mortality Rate (SE)
7 days
20 days
Interferons &
other cytokines
RAD01 SNP Genotype
RR
RS
SS
TEST
0.10 (0.03)
0.17 (0.01)
0.60 (0.01)
VALIDATION
0.11 (0.01)
0.25 (0.01)
0.63 (0.01)
→ Commercial application:
SNP panel used as genetic test to select resistant broodstock
Houston et al. (2012) BMC Genomics
Gene expression
resistant fry
• Resistance mechanism early in host-pathogen interaction
• Mapping early DE genes → positional / functional candidates
4
03/02/2015
PD Resistance
• Near monogenic host resistance to a virus is atypical
• Pancreas Disease (salmonid alphavirus)
• Resistance to other diseases likely to be controlled by a mix
of QTL and a polygenic component
http://aqua.merck-animal-health.com/binaries/Alphaviruses%20189x200_tcm56-34357.jpg
PD Resistance
• Morbidity & mortality at post-smolt stage
• Causative agent – Salmonid alphavirus
• Six subtypes, geographic specificity
Alphavirus
PD Resistance
PD Resistance
• Fry challenge model
• Sampled survivors and mortalities from challenge
• Genotyped for SNP markers, assigned to family
• Estimated h2
• Naïve fry (Marine Harvest) challenged with water from tank
of IP-injected parr
•
•
•
•
Total n = 3,949
150 full-sib families
Sire half-sib structure
Mortality ~ 60% overall
PD Resistance
• Model: Yijk = μ + Sirei + Damij + eijk
• Observed, LOGIT, PROBIT scales
Analysis Method
Observed binary scale
Heritability (±SE)
0.34 (0.05)
Underlying liability scale
0.55
Probit-link scale
0.54 (0.07)
Logit-link scale
0.46 (0.06)
PD Resistance
F
• QTL mapping using sparse SNP panel
• Two-stage mapping strategy as with IPN
M
• Chr 3 QTL confirmed in separate population
(a)
‘Intermediate’ genetic architecture
(i) Chr 3 PD resistance QTL
mapped in population of postsmolts (Baranski et al)
(ii) Position of fry & post-smolt
QTL on chromosome co-incide
Gonen et al., Submitted
5
03/02/2015
PD Resistance
Sea Lice Resistance
• Summary:
• Pancreas disease resistance is highly heritable (h 2 ~ 0.5)
• The genetic architecture is ‘intermediate’
• A QTL on Chr 3 affects resistance in fry & post-smolts
• SNPs associated with QTL applied in selective breeding
• What about more polygenic traits?
• QTL-targeted marker selection replaced by genomic selection
• New genomic tools required to genotype thousands of animals
 Create High Density SNP chip
• Generation and application of SNP chip plausible within typical 3
year timescale of scientific project
SNP chip development & application
SNP chip development & application
• Illumina sequencing of diverse populations
• Validation and utility of Affymetrix SNP chip
• Combination of RAD-Seq, RR-Seq and RNA-Seq
• SNP discovery and testing
• Millions of candidates to ~132 K verified, polymorphic assays
• Sequencing haploid fish to remove paralogous variation
SNP array is informative in farmed
Scottish, farmed Norwegian and wild
European Atlantic salmon populations
1. Can be used to detect population
structure based on genomic similarity
2. Y-specific probes accurately predict
phenotypic sex of juvenilles
Houston et al. (2014), BMC Genomics
Sea Lice Resistance
Sea Lice Resistance
• Sea lice challenge
• Quantitative Genetic parameters
• Pedigreed population (Landcatch) challenged with L. salmonis larvae
 96 larvae per fish
• 725 fish sampled 7 days post infection
 Individual lice counts using stereo microscope
 Weight & length measurements and fin clip sample
• Genotyping and QC
• Samples genotyped using Axiom 132 K SNP array
• QC filtering of SNPs and individuals (Mendel error, MAF, etc.)
 111 K SNPs retained
 29 sires, 61 dams, 534 offspring
i.
ii.
Animal model in ASReml used to estimate h2 of lice count
•
Sex: fixed effect, animal (pedigree): random effect
•
Body weight as a covariate
Genomic (IBS) relationship matrix calculated using Genabel
•
Equivalent model with G matrix replacing A matrix
• Results
h2 (SE)
Mean lice
count
Standard
Deviation
pedigree
genomic
25.6
12.4
0.24 (0.08)
0.20 (0.07)
Consistent h2 estimates using pedigree and genomic matrix
6
03/02/2015
Sea Lice Resistance
Sea Lice Resistance
• Genome-wide association analysis
• The most significant SNP
• Located on distal end of chromosome 3, P~10-5
• Derived from RNA-Seq
• Occurs in a gene for which the protein product is known to be
involved in the host response of the epidermis to sea lice infection
• Genabel mmscore & ASReml
-log P
Value
SNP Genotype
AA (n=15)
AB (n=94)
BB (n=537)
Lice Count (SE)
36.4 (3.4)
29.8 (1.3)
24.3 (0.6)
• Resistance allele is common in this population
U
Atlantic salmon Chromosome Number
Sea Lice Resistance
Overall Summary
• Summary:
• We created a publicly-available high-density SNP array
• Can detect population structure & predict phenotypic sex
• h2 estimated at 0.24 (pedigree) and 0.20 (genomic)
• The genetic architecture is polygenic
• Genomic prediction & selection for resistance underway
• Host resistance to infectious diseases has a genetic component
Genetic Architecture of Resistance
Overall Summary
Sea lice
Polygenic: No evidence for ‘major’ QTL
• PD and IPN viruses ~ 50 % variation
• Sea lice ~ 25 % variation
• But the genetic architecture of resistance varies….
• Host resistance to infectious diseases has a genetic component
• PD and IPN viruses ~ 50 % observed variation genetic
• Sea lice ~ 25 % observed variation genetic
Pancreas
Disease
Infectious
Pancreatic
Necrosis
Intermediate: ~30 % genetic variation explained by 3 QTL
• But the genetic architecture of resistance varies….
• Genomic tools have a major role for salmon disease research
and application
• Past success for IPN virus & significant progress for other diseases
• Working with aquaculture industry is key
• Application of genomics is tailored to genetic architecture of resistance
Nearly Monogenic: ~80 % genetic variation
explained by singe QTL
7
03/02/2015
Genotyping Strategies
Acknowledgements
RAD Sequencing
Main collaborators:
Steve Bishop
Chris Haley
Alan Archibald
Serap Gonen
Natalie Lowe
Karim Gharbi
Richard Talbot
Tim Cezard
John Davey
John Taggart
Michaël Bekaert
James Bron
Brendan McAndrew
Ashie Norris
Derrick Guy
Alastair Hamilton
Alan Tinch
Jose Mota-Velasco
David Verner-Jeffreys
Richard Paley
Fiona Brew
• Rough comparison of SNP chip & genotyping by sequencing
De Novo SNP Chip
RAD-Seq
Time to set up
1 – 2 years
1 – 6 months
Up front cost
High
Low
Cost per sample
$50 - 100
$10 - 30
SNP density
30 – 1000 K
1 – 20 K
Bioinformatics
requirement
Medium
High
Populationspecificity
Variable
High
Accuracy of data
Very High
High
Suitable sample
size
Large
Small - Medium
8