* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Download B - Dendrome
Genetics and archaeogenetics of South Asia wikipedia , lookup
Dual inheritance theory wikipedia , lookup
Genome evolution wikipedia , lookup
Group selection wikipedia , lookup
Site-specific recombinase technology wikipedia , lookup
Gene expression programming wikipedia , lookup
Biology and consumer behaviour wikipedia , lookup
Koinophilia wikipedia , lookup
Pharmacogenomics wikipedia , lookup
Dominance (genetics) wikipedia , lookup
Genetic testing wikipedia , lookup
Genetic engineering wikipedia , lookup
Hardy–Weinberg principle wikipedia , lookup
History of genetic engineering wikipedia , lookup
Polymorphism (biology) wikipedia , lookup
Medical genetics wikipedia , lookup
Public health genomics wikipedia , lookup
Designer baby wikipedia , lookup
Behavioural genetics wikipedia , lookup
Genome (book) wikipedia , lookup
Human genetic variation wikipedia , lookup
Genetic drift wikipedia , lookup
Heritability of IQ wikipedia , lookup
Quantitative trait locus wikipedia , lookup
Conifer Translational Genomics Network Coordinated Agricultural Project www.pinegenome.org/ctgn AN INTEGRATED RESEARCH, EDUCATION AND EXTENSION PROJECT AIMED AT MAINTAINING AND ENHANCING HEALTHY FORESTS AND ECOSYSTEMS BY BRINGING GENOMIC ASSISTED BREEDING TO APPLICATION IN THE UNITED STATES. www.pinegenome.org/ctgn 2 Workshop Introduction Genomics in tree breeding and forest ecosystems www.pinegenome.org/ctgn 3 Genomics in tree breeding and forest ecosystems Overall goals of the Conifer Translational Genomics Network – What is a Coordinated Agricultural Project (CAP)? – What’s the objective of the Conifer Translational Genomics Network (CTGN) CAP, and how does this workshop fit in? – How will the CTGN help forestry? Overall goals of this workshop What we‘ll be covering throughout the week – Modules one through ten Meet and greet www.pinegenome.org/ctgn 4 What is a Coordinated Agricultural Project? Coordinated Agricultural Project (CAP) awards are large-scale, multimillion dollar national research and extension initiatives funded by the Cooperative State Research, Education, and Extension Service (CSREES), an agency within the US Department of Agriculture (USDA) CAPs promote collaboration and information exchange; coordinate activities among individuals, institutions, states, and regions; and reduce the duplication of effort. They promote extension and education functions. CAP awards encourage maximum flexibility in applied plant genomics research www.pinegenome.org/ctgn 5 What is the objective of the CTGN CAP? The Conifer Translational Genomics Network CAP seeks to deliver genomic assisted breeding to tree improvement cooperatives by developing genomic resources and linking laboratory and field research with education and extension Comprehensive outreach education and extension programs are an integral part of the CTGN CAP, providing diverse training opportunities for post-doctoral researchers, graduate and undergraduate students, tree breeders, managers, stakeholders, and the public www.pinegenome.org/ctgn 6 How will the CTGN help forestry? Developments in biotechnology have allowed plant breeders to complement traditional phenotypic-based selection systems with molecular markers. Selection may now include the presence or absence of a specific alleles at known genes, either alone or in combination with traditional phenotypes The CTGN‘s goal is to develop marker-based breeding applications for tree breeding cooperatives, which provide over 1.3 billion seedlings annually in the US The CTGN team will leverage decades of tree breeding experience, institutional resources, and genomics research to enhance and complement ongoing tree breeding activities by: – – – – providing new tools for tree breeding increasing field selection efficiency and genetic gains improving the growth potential of planted seedlings enhancing forest health and vigor www.pinegenome.org/ctgn 7 CTGN informs non-industrial forestry The genomic resources developed by CTGN have utility for tree breeders as well as gene resource managers (Module 9) Application of genomics tools and resources range from characterizing a species‘ genetic diversity to identifying specific genes and alleles that may favor a population‘s survival under changing climatic conditions www.pinegenome.org/ctgn 8 Workshop goals Upon completion the workshop, students will have learned to: – – – – – – – – – – – Estimate variation using population and quantitative genetic metrics Predict how management activities will affect population structure Anticipate likely impacts of tree-improvement Assess the roles of institutional goals and biological constraints in tree improvement operations Quantify and evaluate population dynamics Use software to calculate population parameters based on genetic markers Distinguish strategies for mapping markers and quantitative trait loci Understand marker development and marker-assisted applications Predict and evaluate alternative breeding strategies Locate, read, and assess relevant scientific literature Gain facility with web-based genomics tools and resources www.pinegenome.org/ctgn 9 Workshop syllabus Course Outline Day 1. Introduction and Basic Principles – Module 1. Basic Principles in Population and Quantitative Genetics – Module 2. Introduction to Conventional Tree Breeding Day 2. Genetic Polymorphisms and Analyses – Module 3. Genetic Markers – Module 4. Molecular Population Genetics Day 3. Complex Trait Dissection – Module 5. Genetic Maps and QTL Mapping – Module 6. Association Genetics Day 4. Marker Assisted Breeding – Module 7. Marker Informed Breeding: Concepts and Discovery – Module 8. Marker Informed Breeding: Application and Economics (Day 5 in 2009) Day 5. Gene Resource Management and Forest Ecosystem Applications – Module 9. Linear Models and Molecular Markers (Day 4 in 2009) – Module 10. Genomic Applications in Ecosystems and Genetic Resource Management www.pinegenome.org/ctgn 10 Short course structure Topics delivered in a modular format – Two modules a day, roughly 4 hours per module – Opportunities for evening presentations, discussions, or computer use Modules will consist of… – Lectures with PowerPoint slides (2-3 hours) – Computer lab demonstrations or exercises (1-2 hours) – Plenty of breaks www.pinegenome.org/ctgn 11 Our Team PROJECT DIRECTOR David Neale - University of California, Davis [email protected] CO-INVESTIGATORS AND COOPERATORS Tom Byram - Texas Forest Service Jeff Dean - University of Georgia David Harry - Oregon State University Glenn Howe - Oregon State University Dudley Huber - University of Florida Fikret Isik - North Carolina State University Steve McKeand - North Carolina State University Dana Nelson - USDA Forest Service, SIFG Brad St. Clair - USDA Forest Service, PNWRS Jill Wegrzyn - University of California, Davis Nick Wheeler - Oregon State University Ross Whetten - North Carolina State University www.pinegenome.org/ctgn 12 Introductions www.pinegenome.org/ctgn 13 Module 1 Basic principles in population and quantitative genetics www.pinegenome.org/ctgn 14 Genes and Genomes www.pinegenome.org/ctgn 15 Quick review: Genes and genomes In eukaryotes, DNA is found in the... Nucleus Plant Cell Mitochondria Chloroplasts (plants) Organelle inheritance is often uniparental, making it powerful for certain types of applications For this workshop, we‘ll focus almost exclusively on nuclear genes www.pinegenome.org/ctgn 16 Chromosomes Linear strands of DNA and associated proteins in the nucleus of eukaryotic cells Chromosomes carry the genes and function in the transmission of hereditary information Diploid cells have two copies of each chromosome One copy comes from each parent Paternal and maternal chromosomes may have different alleles www.pinegenome.org/ctgn 17 Alleles Alternative forms of a gene Alleles arise through mutation A diploid cell has two copies of each gene (i.e. two alleles) at each locus Alleles on homologous chromosomes may be the same or different (homozygous vs. heterozygous) www.pinegenome.org/ctgn 18 Genes Units of information on heritable traits In eukaryotes, genes are distributed along chromosomes Each gene has a particular physical location: a locus Genes encompass regulatory switches and include both coding and non-coding regions Genes are separated by intergenic regions whose function is not understood www.pinegenome.org/ctgn 19 The genome An individual‘s complete genetic complement For eukaryotes, a haploid set of chromosomes For bacteria, often a single chromosome For viruses, one or a few DNA or RNA molecules Genome size is typically reported as the number of base pairs in one genome complement (i.e. haploid for eukaryotes) Until recently, we studied genes and alleles one or a few at a time (genetics) Aided by high throughput technologies we can now study 100‘s to 1000‘s of genes simultaneously (genomics). www.pinegenome.org/ctgn 20 Genome size Lambda phage 4.8 x 103 bp E. coli 4.6 x 106 bp Arabidopsis 1.6 x 108 bp Cottonwood 4.8 x 108 bp Chestnut 9.6 x 108 bp Humans 3.0 x 109 bp Pines ~3 x 1010 bp Fritillaria 1.3 x 1011 bp Amoeba dubia 6.7 x 1011 bp www.pinegenome.org/ctgn Douglas-fir, Pseudotsuga menziesii, has a chromosome number of 26. It’s diploid, so that means that n=13. Most Conifers have n=12. 21 Genes and genomes Using what we've learned In this workshop, we will convey many ways that our knowledge of genetics and genomics can be used by breeders and land managers Our emphasis will be on the study of methods that can be used to characterize or dissect complex (quantitatively inherited) traits and associate phenotype with genotype, leading to marker informed applications To do this, we will need to review several sub-disciplines of the science of genetics www.pinegenome.org/ctgn 22 Genetics To understand marker-informed breeding, we will first set the stage by briefly reviewing… Mendelian genetics describes inheritance from parents to offspring – discrete qualitative traits (including genetic markers) – predicts frequencies of offspring given specific matings Population genetics describes allele and genotype frequencies over space and time, including – – – – changes in allele frequencies between generations environmental factors contributing to fitness models are limited to a small number of genes analyzes variation within and among populations Quantitative genetics describes variation in traits influenced by multiple genes (continuous rather than discrete attributes) – relies on statistical tools describing correlations among relatives – each of many genes may have little influence on a specific trait www.pinegenome.org/ctgn 23 Mendelian Genetics (and the basis of genetic markers) www.pinegenome.org/ctgn 24 Mendelian inheritance We begin with family resemblance: ―Like begets like.‖ How do we explain it? www.madeyoulaugh.com www.pinegenome.org/ctgn 25 Traits tend to run in families Sib 1 Family 1 Sib 2 www.pinegenome.org/ctgn Sib 3 Family 2 Sib 4 26 Genotype and phenotype Genotype refers to the particular gene or genes an individual carries Phenotype refers to an individual‘s observable traits Only rarely can we determine genotype by observing phenotype Genomics offers tools to better understand the relationship between genotype and phenotype Individual genetic markers behave as Mendelian traits, so understanding Mendelian traits is key to understanding markers www.pinegenome.org/ctgn 27 Single-gene traits in trees are rare… Here‘s one in alder (f. pinnatisecta) P x F1 F1 x F1 (crossed or selfed) F2 www.pinegenome.org/ctgn 3:1 Mendel’s predictions hold up 28 Mendel‘s insights were amazing, and yet... Knowledge of biological processes was rudimentary, including… – cell division (mitosis or meiosis) – chromosomes were not yet known With the discovery of chromosomes, we realized – That genes are packaged on chromosomes – That genes on the same chromosome are associated (genetic linkage). Very important! We will explore this a great deal in future modules www.pinegenome.org/ctgn 29 Markers reflect genetic polymorphisms that are inherited in a Mendelian fashion DNA markers 'mark' locations where DNA varies (sequence or size) – Such polymorphisms can vary within and among individuals (e.g. heterozygotes vs. homozygotes) and populations Markers may be located in genes or elsewhere in the genome – Historically, we've had too few markers to inform breeding Genomics tools provide an almost unlimited supply of markers Today‘s marker applications were only imagined a few years ago www.pinegenome.org/ctgn 30 DNA markers reflect sequence variation Marker “1” Marker “2” A T C A A T C G A C G A T G A T T A C T A C G G T C G C T G<>C T C C G<>T www.pinegenome.org/ctgn 31 Markers track inheritance www.pinegenome.org/ctgn 32 Trait dissection using markers Hypothetical genes (QTLs) affecting economic traits, mapped using genetic markers a-m www.pinegenome.org/ctgn 33 Single Nucleotide Polymorphisms (SNPs) embedded within a DNA sequence DNA sequences are aligned Polymorphic sites are identified Haplotypes (closely linked markers of a specific configuration) are deduced by direct observation or statistical inference * * * atggctacctgaactggtcaactcatcgaaagctaa atggctacctgaactggtcaactcatcgaaagctaa atgcctacctgaactggtcaactcatcgaaagctaa atgcctacctgaactggtcaactcatcgaaggctaa atgcctacctgaactggtcaacacatcgaaggctaa www.pinegenome.org/ctgn 1 1 2 3 4 34 Genetic linkage and recombination Genes on different chromosomes are inherited independently Genes located on the same chromosome tend to be inherited together because they are physically linked—except that widely separated genes behave as if they are unlinked. Recombination during gametogenesis breaks up parental configurations into new (recombinant) classes The relative frequency of parental and recombinant gametes reflects the degree of genetic linkage Genetic mapping is the process of determining the order and relative distance between genes or markers (to be discussed in Module 3) www.pinegenome.org/ctgn 35 Genetic linkage and recombination A B a b A B a b A B a b a b A B A a B b Parental gametes (Non-recombinant) freq = 1-r www.pinegenome.org/ctgn A b a B Grandparents (2N) Parents (2N) Gametes (1N) Recombinant gametes freq = r 36 Population Genetics www.pinegenome.org/ctgn 37 Population genetics Population genetics is the study of genetic differences within and among populations of individuals, and how these differences change across generations It extends Mendelian genetics to include population dynamics and chance events such as… – – – – survival frequency of specific matings random sampling from populations, and mutation www.pinegenome.org/ctgn 38 38 Population genetics Over time, changes among populations can lead to genetic isolation and speciation Population genetics describes the mechanics of how evolution takes place As we discuss genetic differences, keep in mind that – – polymorphisms reflect differences among individuals within a species divergence reflects differences between species We‘ll discuss more specifics on these processes later on... – (D. L. Hartl 2000. A Primer of Population Genetics) www.pinegenome.org/ctgn 39 What do population geneticists measure? Studies limited to simply-inherited traits – – – – – historically, this involved morphological or biochemical markers shifted to allozymes in 1960’s to 1980’s DNA markers became more common beginning in later 1980’s many types of DNA markers have been developed we’ll re-visit markers in Module 3 Key points – population geneticists measure discrete (Mendelian) traits – quantitative geneticists measure continuous traits controlled by multiple genes (we’ll talk about quantitative genetics later in this Module) www.pinegenome.org/ctgn 40 40 Why study genes in populations? In natural populations: – Adaptation, or the ability to survive and exploit an environmental niche, involves the response of populations, not individuals. In breeding populations: – Genetic gain—improving the average performance of populations for desired breeding objectives—depends on selecting and breeding parents with the best genetic potential www.pinegenome.org/ctgn 41 Population genetics: Key questions Population genetics provides empirical models to predict genetic behavior for these and other situations What genotypes are present in a population and at what frequency? Are all genotypes equally likely to survive and reproduce? Are mating frequencies independent of genotype? Is the population stratified in some way, e.g. by proximity, size, or the timing of natural events? To what extent does mating occur with individuals outside the immediate area? outside To what extent are environmental conditions stable across generations? Etc… www.pinegenome.org/ctgn 42 42 Population genetics Provides empirical models to predict genetic behavior What genotypes are present in a population and at what frequencies? Are all genotypes equally likely to survive and reproduce? Are mating frequencies independent of genotype? Is the population stratified in some way, e.g. by proximity, size, or the timing of natural events? To what extent does mating occur with individuals outside the immediate area? To what extent are environmental conditions stable across generations? www.pinegenome.org/ctgn 43 43 Population genetics How genetically diverse is a species or population? – contrast diversity in populations that differ in life-history traits, pop size, breeding structure, etc Are different populations closely related to one another? – monitor diversity for conservation purposes What is the potential for inbreeding depression? – what is the minimum viable population size from a genetic standpoint? How is genetic variation maintained? Identify genes/alleles responsible for phenotypic variation Phylogenetic and biogeographic questions www.pinegenome.org/ctgn 44 44 Populations are groups of individuals whose relatedness is usually unknown Typical descriptive statistics Locus ‘X’ in pop #1 Allele A1 A2 A3 Frequency 0.2 0.5 0.3 Total = 1.0 Genotype A1 A1 A1 A2 A1 A3 A2 A2 A2 A3 A3 A3 Frequency 0.1 0.1 0.1 0.3 0.3 0.1 Sum = 1.0 A (# alleles) = 3 Ho (observed heterozygosity) = 0.5 With data from more loci, you an also calculate, P (% polymorphic loci) = % of loci with >1 allele www.pinegenome.org/ctgn 45 The Hardy-Weinberg Principle The frequencies of alleles and genotypes in a population will remain constant over time (given certain assumptions): describing a static, or non-evolving population The frequencies of alleles and genotypes can be described mathematically, where p and q are the frequencies of the alleles A1 and A2 Freq. A1A1 homozygote 2 Freq. A2A2 homozygote 2 p + 2pq + q = 1.0 Freq. A1A2 heterozygote www.pinegenome.org/ctgn 46 HW proportions… Predict frequencies of all genotypes based on allele frequencies Provide a quantitative measure of variation among populations differing in allele frequencies Provide a measure of within-population, heterozygosity Expected heterozygosity (He) is the combined frequency of all heterozygotes calculated from allele frequencies www.pinegenome.org/ctgn 47 Random mating restores HW proportions each generation White et al. 2007 www.pinegenome.org/ctgn 48 HW and random sampling Strictly speaking, Hardy-Weinberg proportions require certain assumptions, such as – an infinitely large population (translation = sampling with replacement) – mating is at random (translation = all possible pairings of mates is equally likely) – no selection (which biases genotype frequencies) – no migration (since all alleles must be sampled from the same pool) – no mutation (which introduces new variants) These conditions represent an ―ideal‖ population that is rarely (if ever) never fully realized www.pinegenome.org/ctgn 49 HW and random sampling Minor violations of assumptions usually have little impact In practice… – HW proportions apply for many natural populations – breeding populations are different: – population sizes can be small – individuals chosen for breeding may represent a subset of relatives – matings are often non-random www.pinegenome.org/ctgn 50 HW : Non-random mating When individual genotypes do not mate randomly, then HW proportions are not observed among the offspring We‘ll look at two kinds of non-random mating – population substructure/admixture – inbreeding (mating among related individuals) www.pinegenome.org/ctgn 51 HW : Population admixture Consider mixing individuals from non-interbreeding subpopulations (e.g. Offshore salmon from different runs) Even if each subpopulation is in HW, the admixed group is not (p1 ≠ p2) The admixed group will appear to have too many homozygotes This situation is called Wahlund‘s effect Hartl, 2000, Fig. 2.6 www.pinegenome.org/ctgn 52 Population structure: Wahlund‘s effect Larger populations may be subdivided into smaller groups, which may be difficult to delineate – sub-population can have different allele frequencies – each sub-population may show HW proportions A biologist may unknowingly sample individuals from different subpopulations and group them together. What would you observe? – HW proportions in the entire sample, or – more heterozygous individuals than predicted from HW expectations, or – more homozygous individuals than predicted from HW expectations? www.pinegenome.org/ctgn 53 Population structure: Wahlund‘s effect Wahlund’s effect: As long as allele frequencies vary among subpopulations, even if each subpopulation exhibits HW proportions, then more homozygotes will be observed than would be expected based on the allele frequency of the metapopulation The relative increase in homozygosity is proportional to the variance in allele frequencies among subpopulations, as measured by F (where 0 ≤ F ≤ 1). There are many versions of F, formulated in different ways. Each is a measure of increased genetic relatedness www.pinegenome.org/ctgn 54 Inbreeding Inbreeding (mating among relatives) increases homozygosity relative to HW – rate is proportional to degree of relationship – distant cousin < first cousin < half-sib < full-sib < self Recurrent inbreeding leads to a build-up of homozygosity, and a corresponding reduction in heterozygosity Inbreeding affects genotype frequencies, but not allele frequencies How does inbreeding affect deleterious recessive alleles? www.pinegenome.org/ctgn 55 Inbreeding and homozygosity White et al. 2007, Fig. 5.6 F reflects a proportional reduction in heterozygosity, and a build-up of genetic relatedness. HW implies F= 0. With recurrent selfing, F goes to 1 www.pinegenome.org/ctgn 56 Inbreeding depression Inbreeding often leads to reduced vitality (growth, fitness) Deleterious recessive alleles are made homozygous Outcrossing species are more likely to suffer higher inbreeding depression White et al. 2007, Fig. 5.7 www.pinegenome.org/ctgn 57 Evolutionary forces change allele frequencies Mutation a random heritable change in the genetic material (DNA) - ultimate source of all new alleles Migration (gene flow) the introduction of new alleles into a population via seeds, pollen, or vegetative propagules Random genetic drift the random process whereby some alleles are not included in the next generation by chance alone Natural selection the differential, non-random reproductive success of individuals that differ in hereditary characteristics www.pinegenome.org/ctgn 58 58 Mutation Heritable changes in DNA sequence alter allele frequencies as new alleles are formed Mutations at any one locus are rare, but with sufficient time, cumulative effects can be large Mutations are the ultimate source of genetic variation on which other evolutionary forces act (e.g., natural selection) Effects on populations – Mutations promote differentiation (but effects are gradual in the absence of other evolutionary forces) www.pinegenome.org/ctgn 59 59 Gene flow: Migration of alleles Gene Flow – the movement of alleles among populations Movement may occur by individuals (via seed) or gametes (via pollen) Effects on populations – gene flow hinders differentiation. It is a cohesive force tends to bind populations together www.pinegenome.org/ctgn Seed (low gene flow) Pollen (high gene flow) 60 Genetic drift Drift reflects sampling in small populations Subgroups follow independent paths Allele frequencies vary among subgroups Frequencies in the metapopulation remain relatively stable How does F behave? Hartl & Jones, 2004. www.pinegenome.org/ctgn 61 Random genetic drift Genetic bottleneck: An extreme form of genetic drift that occurs when a population is severely reduced in size such that the surviving population is no longer genetically representative of the original population Effects on populations – Drift promotes differentiation Large proportion of white beads Some yellow beads www.pinegenome.org/ctgn Small proportion of white beads No yellow beads 62 Natural selection Natural selection First proposed by Charles Darwin in mid1800‘s. The differential reproductive success of individuals that differ in hereditary characteristics – not all offspring survive and reproduce – some individuals produce more offspring than others (mortality, disease, bad luck, etc) – offspring differ in hereditary characteristics affecting their survival (genotype and reproduction are correlated) – individuals that reproduce pass along their hereditary characteristics to the next generation – favorable characteristics become more frequent in successive generations Effects on populations: – Promotes differentiation between populations that inhabit dissimilar environments – Hinders differentiation between populations that inhabit similar environments www.pinegenome.org/ctgn 63 Selection: Numerical example White et al. 2007, Table 5.3 www.pinegenome.org/ctgn 64 Selection: Equations White et al. 2007, Table 5.4 www.pinegenome.org/ctgn 65 Relative fitness: Key considerations Which genotype has the largest relative fitness? – determines the direction in which allele frequencies will change Are fitness differences large or small? – determines rate of change over generations—fast or slow What is the fitness of the heterozygote compared to either homozygote? – reflects dominance – complete (heterozygote identical to either homozygote) – no dominance (additive, heterozygote is intermediate) – partial (heterozygote more closely resembles one homozygote) – dominance influences how selection ―sees‖ heterozygotes – affects rate of change across generations www.pinegenome.org/ctgn 66 Natural selection Fitness: the relative contribution an individual makes to the gene pool of the next generation Directional www.pinegenome.org/ctgn Diversifying Stabilizing 67 Gene action: Additive vs. dominance A2A2 A1A2 A1A1 1-s 1-(1/2)s 1 additive A2A2 A1A2 A1A1 1-s 1-hs 1 partial dominance complete dominance A2A2 A1A2 A1A1 1-s 1 A2A2 A1A1 A1A2 1-s2 1-s1 1 overdominance phenotype Jennifer Kling, OSU www.pinegenome.org/ctgn 68 Dominance and rate of change Hartl, 2000 www.pinegenome.org/ctgn 69 What if selection is weak or absent? We‘ve already seen that mutation can supply new variation that selection may act upon Most mutations are deleterious and are lost, but rarely, advantageous mutations can occur What about mutations that cause no effect either way? Neutrality theory pertains to alleles that confer no difference in relative fitness—as if selection is oblivious to them We‘ll revisit the behavior of neutral alleles later on www.pinegenome.org/ctgn 70 Measuring population structure Generically speaking, population structure measures the degree to which allele frequencies vary among subpopulations This can be thought of in several ways – variance among subpopulations – heterozygosity among pairs of alleles drawn at random Recall, expected heterozygosity measures – the frequency of heterozygous genotypes in a HW population – which equals the frequency of random pairs of haploid gametes with different alleles Whenever allele frequencies vary among subpopulations (regardless of the cause), the variance in allele frequencies can be measured by F F = (He – Ho)/He We‘ll revisit this in Module 4 www.pinegenome.org/ctgn 71 Population genetics: A final concept Linkage disequilibrium (LD, also called gametic phase disequilibrium) Conceptually—LD is a correlation in allelic state among loci Numerically – expected haplotype (gamete) frequency is the product of the two allele frequencies, i.e. f(AB) = f(A) x f (B) – if f(AB) = f(A) x f (B), then LD = 0 – if f(AB) ≠ f(A) x f (B), then LD ≠ 0 LD may arise from factors such as – recent mutations – historical selection (hitchhiking effect) – population admixture Recombination causes LD to decay over generations LD plays a major role in association genetics. We will revisit! www.pinegenome.org/ctgn 72 A numeric example of LD • • • • determine allele frequencies ask whether f(A) x f(B) = f(AB) repeat for f(Ab), f(aB), and f(ab) linkage disequilibrium (LD) reflects this difference Gamete Type (linked) A B A b a B a b f(A) = 0.7 f(B) = 0.6 f(a) = 0.3 f(b) = 0.4 www.pinegenome.org/ctgn Gamete Frequency No LD 0.42 0.28 0.18 0.12 Gamete Type (unlinked) Higher LD Lower LD 0.60 0.55 0.10 0.15 -0.05 0.30 0.25 Allele Frequencies A B A b a B a b f(A) = 0.7 f(B) = 0.6 f(a) = 0.3 f(b) = 0.4 73 Summary: Population Genetics Population genetics extends Mendelian genetics to describe how allele and genotype frequencies can be predicted given certain dynamic population processes For populations in Hardy-Weinberg (HW) proportions, genotype frequencies are easily calculated given allele frequencies HW proportions are used as a comparative baseline Population genetics questions include – How much genetic diversity (heterozygosity) are in populations? – How is genetic diversity distributed? – What mechanisms have shaped the diversity we observe? Our challenge: How can we measure, interpret, and utilize genetic diversity? www.pinegenome.org/ctgn 74 Quantitative Genetics www.pinegenome.org/ctgn 75 Quantitative genetics Metric traits – show continuous variation – cannot be grouped into discrete categories – are affected by environmental influence (to a larger extent) Traits such as: – – – – growth, Survival, Reproductive ability cold hardiness, Drought hardiness wood quality, Disease resistance economic Traits! Adaptive Traits! Applied & Evolutionary Genetic Principles: – builds upon both Mendelian and population genetics – not limited to traits influenced by only one or a few genes – analysis encompasses traits affected by many genes www.pinegenome.org/ctgn 76 Quantitative genetics Extends analysis of Mendelian traits (with discrete phenotypes) to metric traits (continuously distributed, often influenced by many genes) Describes genetic variation based on phenotypic resemblance among relatives Is usually the primary genetic tool for plant and animal breeding Provides the basis for evaluating the relative genetic merit of potential parents Provides tools for predicting response to selection (genetic gain) www.pinegenome.org/ctgn 77 Distinctions Mendelian vs metrical – lies in the magnitude of effect – recognizable discontinuity – Mendelian (major) – non-recognizable discontinuity – metrical (minor) How can Mendelian and metrical inheritance be reconciled? How can we explain the continuous variation of metrical traits in terms of the discontinuous categories of Mendelian inheritance? – simultaneous segregation of many genes – non-genetic or environmental variation (truly continuous effects) www.pinegenome.org/ctgn 78 Additive trait with two genes Consider crossing AABB x aabb, where phenotypically: AABB, with 4 doses aabb, with 0 doses Then crossing F1: AaBb x AaBb Genotype/ (Dose) AB Ab aB ab (2) (1) (1) (0) AB (2) AABB (4) AABb (3) AaBB (3) AaBb (2) 4 doses, 1 Ab AABb AAbb AaBb Aabb 3 doses, 4 (1) (3) (2) (2) (1) aB (1) AaBB (3) AaBb (2) aaBB (2) aaBb (1) ab AaBb Aabb aaBb aabb (0) (2) (1) (1) (0) www.pinegenome.org/ctgn Doses, Color & Number 2 doses, 6 1 doses, 4 0 doses, 1 79 Now consider a trait influenced by 3 genes Similar to previous example with 2 genes ‗Upper-case' alleles (black dots) behave as unit doses. Genotypes with comparable doses are grouped together in colored boxes Colors depict groups with identical phenotypes Gene effects are additive Hartl & Jones, 2004. www.pinegenome.org/ctgn 80 Phenotypes Phenotypic categories from the previous slide are represented here in the histogram, with bar heights showing the relative frequency of each category Quantitative genetics describes populations using trait means variances, as well as co variances among traits and relatives www.pinegenome.org/ctgn 81 How to describe a population? Mean ≈ average Variance is dispersion around the mean – individual observations (usually) differ from the mean – deviation is distance from mean – variance is average squared deviation www.pinegenome.org/ctgn 82 Height in humans is a quantitative trait Students from the University of Connecticut line up by height: 5‘0‖ to 6‘5‖ in 1‖ increments. Women are in white, men are in blue Crow 1997. Genetics 147:1-6 www.pinegenome.org/ctgn 83 Population properties for metrical traits Means, variances, covariances Measuring variation within and among families allows estimating genetic and environmental variance components Phenotypic resemblance among relatives allows estimating heritability, breeding values, genetic correlations and so forth www.pinegenome.org/ctgn 84 We also measure properties of genes Dominance – allelic interactions at a locus Epistasis - non-allelic interactions Pleiotropy – allelic affects on multiple traits Linkage – genes on the same chromosome tend to be inherited together Fitness – how genes affect the likelihood that an individual survives and reproduces (may be natural or artificial) www.pinegenome.org/ctgn 85 Phenotypic expression of a metrical trait www.pinegenome.org/ctgn 86 Partitioning phenotypic variance As we‘ve seen, an individual‘s phenotype reflects both genetic and environmental influences, modeled as P=G+E With variances shown as Var (P) = Var (G) + Var (E) or σ2p = σ2G +σ2E Where – P = Phenotypic variance – G = Genetic variance – E = Environmental variance www.pinegenome.org/ctgn 87 Partitioning phenotypic variance Genetic variance includes a combination of additive and nonadditive (mostly dominance, but also epistasis and other) G=A+ I With variances (and expanding from the previous slide) Var (P) = Var (A) + Var (I) + Var (E) Or σ2p = σ2A +σ2I + σ2E Where – A = Additive genetic variance (breeding value) – I = Interaction, or non-additive – E = Environmental Variance www.pinegenome.org/ctgn 88 Breeding value (additive genetic value) The sum of all average allelic effects at each locus influencing the trait of interest – alleles, not genotypes are passed on to the next generation Breeding value is a concept associated with parents in a sexually breeding population Historically, average allelic effects could not be measured…now they can – how? – what is effect of population gene frequencies on average effect? www.pinegenome.org/ctgn 89 Non-additive genetic variance Non-additive effects depend on the interactions of specific alleles Specific combinations of allelic effects cannot be predicted in a general way, for example Dominance – dominant (vs. recessive) gene action reflects allelic interactions for one gene – multiple genes can be involved simultaneously – dominance variance summarizes all of these interactions Epistasis – allelic interactions involving specific combinations of alleles of different genes – may involve two or more genes – epistatic variance summarizes all of these interactions www.pinegenome.org/ctgn 90 Genetics and the environment Among trees, phenotypic variation for most traits represents more environmental variation, rather than genetic It‘s hard to judge the genetic value of a tree just by looking at it Heritability (h2) – the percentage of variation among trees that is genetic – h2 ranges from 0 to 100% (0 to 1) – Heritability for growth is often only 10-30% (0.10 – 0.30) – Low heritabilities make genetic improvement difficult www.pinegenome.org/ctgn 91 Heritability (h2) P= G+E h2 = σ2G /σ2P www.pinegenome.org/ctgn E P G 92 92 Heritability A measure of the degree to which the variance in the distribution of a phenotype is due to genetic causes In the broad sense, heritability is measured by the total genetic variance divided by the total phenotypic variance In the narrow sense, it is measured by the genetic variance due to additive genes divided by the total phenotypic variance Heritability is mathematically defined in terms of population variance components. It can only be estimated from experiments that have a genetic structure: sexually produced offspring in this case www.pinegenome.org/ctgn 93 Narrow-sense heritability, h2 Thus, narrow sense heritability can be written as h2 = σ2A/ σ2P = σ2A/ (σ2A + σ2I + σ2E) Where – σ2P is the phenotypic variance, which can be partitioned as – σ2A is the additive genetic variance (variance among breeding values in a reference population) – σ2I is the interaction or non-additive genetic variance (which includes both dominance variance and epistatic variance) – σ2E is the variance associated with environment www.pinegenome.org/ctgn 94 Broad sense heritability (H2, or h2b) Broad sense heritability is used when we deal with clones! Clones can capture all of genetic variance due to both the additive breeding value and the non-additive interaction effects. Thus, H2 = σ2G / (σ2A + σ2I + σ2E) = (σ2A + σ2I) / (σ2A + σ2I + σ2E) Consequently, broad sense heritability is typically larger than narrow sense heritability and progress in achieving genetic gain can be faster when clonal selection is possible. What might be a drawback to clonal based programs? www.pinegenome.org/ctgn 95 Genetic gain (G) Level of improvement in one or more measured traits as compared to natural or unimproved populations Usually expressed as a percentage Improvement is determined by two factors: – Heritability of the trait (h2) – Selection Differential (S) Genetic Gain = Heritability x Selection Differential G = h2 x S Gain can be expressed in other ways as well www.pinegenome.org/ctgn 96 Another expression of genetic gain G = h2 i σp A related measure of genetic gain is based on three parameters…for this measure, S = i x σp, where – Heritability (h2 or H2) – Measure of the degree to which the variance in the distribution of a phenotype is due to genetic causes – Selection intensity (i) – Difference between the mean selection criterion of those individuals selected to be parents and the average selection criterion of all potential parents, expressed in standard deviation units – The proportion of trees selected from the population of trees measured for the trait – Phenotypic standard deviation of a trait (σ p) www.pinegenome.org/ctgn 97 Predicting genetic gain Gain = h2 (selection differential) selection differential = i σP Gain = h2 i σP Get more gain by controlling the environmental variation and increasing h2 www.pinegenome.org/ctgn Get more gain by selecting a smaller proportion of the population 98 98 Calculating the selection differential S = 1.6 S = 2.8 S = 1.4 (from Falconer and Mackay 1996) Selection differential is a function of the selection intensity (i), which is related to the proportion of the population selected, and the variability of the population (σP): Selection Differential = i x σp www.pinegenome.org/ctgn 99 A little more on selection intensity The factor most under breeders control i increases as the fraction of trees selected decreases Law of diminishing returns takes hold Intensity drops rapidly with increasing number of traits selected simultaneously (See White et al. 2007 p. 342) White et al 2007 www.pinegenome.org/ctgn 100 Additional complications: more traits and how they are evaluated Selecting on one or a few traits is complicated enough As more traits are evaluated, then we must also consider – – – – indirect selection genetic correlations correlated selection response methods for multi-trait selection Including a few to (perhaps) thousands of genetic markers to facilitate selection is a non-trivial matter www.pinegenome.org/ctgn 101 Genetic correlations Correlations in phenotype – may be due to genetic or environmental causes – may be positive or negative Genetic causes may be due to – pleiotropy – linkage – gametic phase disequilibrium The additive genetic correlation (correlation of breeding values) is of greatest interest to plant breeders – genetic correlation usually refers to the additive genetic correlation (r G is usually rA ) We typically measure phenotypic correlations Falconer and Mackay, Chapt. 19; Bernardo, Chapt. 12 www.pinegenome.org/ctgn 102 Indirect selection Can we make greater progress from indirect selection than from direct selection? Molecular markers are strongly inherited (h2~1), and yet their correlation with a desired trait can be indirect Need to consider other factors in using markers (e.g. time & cost) Is there a benefit to practicing both direct and indirect selection at the same time? www.pinegenome.org/ctgn 103 Strategies for multiple trait selection We usually wish to improve more than one trait in a breeding program Traits may be correlated or independent from each other Options… – independent culling – tandem selection – index selection www.pinegenome.org/ctgn 104 Independent culling Minimum levels of performance are set for each trait 10 9 8 7 Trait Y 6 5 4 3 2 1 0 0 1 2 3 4 5 6 7 8 9 10 Trait X Jennifer Kling, OSU www.pinegenome.org/ctgn 105 Tandem selection Conduct one or more cycles of selection for one trait, and then select for another trait 10 9 8 Select for trait X in the next cycle 7 Trait Y 6 5 4 3 2 1 0 0 1 2 3 4 5 6 7 8 9 10 Trait X Jennifer Kling, OSU www.pinegenome.org/ctgn 106 Selection indices Values for multiple traits are incorporated into a single index value for selection 10 9 8 7 Trait Y 6 5 4 3 2 1 0 0 1 2 3 4 5 6 7 8 9 10 Trait X Jennifer Kling, OSU www.pinegenome.org/ctgn 107 Effects of multiple trait selection Selection for n traits reduces selection intensity for any one trait Reduction in selection intensity per trait is greatest for tandem selection, and least for index selection Expected response to selection index selection ≥ independent culling ≥ tandem selection To this discussion we will ultimately add the approaches of BLUP and genome wide selection www.pinegenome.org/ctgn 108 How to estimate the genotype of a tree? Traditionally, by measuring… The average performance of many ―copies‖ of the same tree (i.e., the same genotype) – Clones can be produced via rooted cuttings or tissue culture The average performance of its offspring The average performance of its siblings – (i.e., 'brothers and sisters') www.pinegenome.org/ctgn 109 Genetic evaluation trials Common-garden experiments can be used to separate genetic from environmental effects Plantation #1 Block #1 Block #2 Plantation #2 Block #1 Block #2 Family 8 Family 6 Family 3 Family 8 Family 7 Family 2 Family 7 Family 5 Family 3 Family 9 Family 9 Family 1 Family 4 Family 8 Family 8 Family 9 Family 9 Family 5 Family 4 Family 4 Family 6 Family 1 Family 1 Family 6 Family 2 Family 7 Family 2 Family 2 Family 1 Family 4 Family 5 Family 3 Family 5 Family 3 Family 6 Family 7 www.pinegenome.org/ctgn 110 Estimating a tree‘s genotype Until recently, this has been accomplished using evaluation trials As genomics tools and platforms have developed, we are more seriously evaluating the potential of genetic markers to augment phenotypic assessments – QTL mapping in pedigreed populations – association genetics How might marker data be incorporated in breeding? – – – – selection index correlated traits BLUP genomic selection We will revisit this question over the next few days www.pinegenome.org/ctgn 111 Summary: Quantitative genetics Quantitative genetics deals with metrical traits – two or more loci, their interactions with each other and their environment Properties of populations and genes Crop improvement programs use basic parameters of means, variances, covariances to calculate relevant heritabilities, gain, etc Characterizing genotypes require breeding and evaluation of offspring or other relatives www.pinegenome.org/ctgn 112 Overall summary: Population and Quantitative genetics These disciplines provide the theoretical framework for applied breeding Both disciplines rely on mathematical models and probabilities Models are based on approximations of how genes function End result…both provide predictions of genetic behavior, involving – allele and genotype frequencies (population genetics) – population-level phenotypes and genetic gain (quantitative genetics) www.pinegenome.org/ctgn 113