Survey
* Your assessment is very important for improving the work of artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the work of artificial intelligence, which forms the content of this project
Chapter2:Genesinpedigrees «Celuiquinecomprendpas,etquiledit,estcelui qui fait le plus évidemment preuve d’intelligence, carilacomprisqu’iln’apascomprisetc’estcequi est le plus difficile à comprendre ….» Albert Jacquard.Lascienceàl’usagedesnon-scientifiques, 2001. Haploidgametesareproducedbymeiosis Parents transmit their genes to their offspring via the gametes: the large immotile oocytes of the mother and the miniature motile spermatozoa of thefather.Bothtypesofgametesareproducedby aspecializedtypeofcelldivision,knownasmeiosis, thatproduceshaploidgametesfromdiploidcellsof the germ line. Contrary to the somatic cells and cells of the germ line, gametes only contain one copy of the “genomic encyclopedia”. The number of chromosomes found in haploid gametes is said to be “n”, while diploid cells contain “2n” chromosomes (corresponding to “n” pairs of homologouschromosomesorhomologues). Before entering meiosis, spermatogonia in the testisandoogoniaintheovaryundergoreplication oftheirDNAduringanSphasethatisverysimilar tomitosis-precedingS-phasesexceptforitsslightly increased duration. After S-phase, each chromosome in the cell is present as two sister chromatidsconnectedbycohesins.Gametogenesis then proceeds with a very unusual type of cell division: meiosis I. Rather than segregating one sisterchromatidofeachhomologueineachofthe two daughter cells, each cell will inherit the two sister chromatids of one homologue for each chromosome (ignoring recombination for the moment). As mitosis, meiosis I is subdivided in a prophase, metaphase, anaphase and telophase. Prophase I is itself subdivided in four stages: leptotene,zygotene,pachytene,anddiplotene. Leptotene is characterized by the condensation of thereplicatedchromosomesand-moststrikingly- by the pairing of homologues. The condensed sister chromatids of the paternal copy of chromosome 1 will align themselves with the homologoussisterchromatidsofthematernalcopy of chromosome 1 along their entire length. The same will happen for each chromosome pair, forming as many bivalents. How are homologous chromosomes finding each other in the nucleus? Pairing appears to be mediated by the process of “homologous recombination”. A specialized protein (called Spo11 in yeast) introduces staggered cuts in the DNA known as doublestranded breaks. On either side of the break, exonucleasesthengeneratelong,single-stranded3’ overhangs. Proteins of the RecA family (called Rad51 in yeast) cover the single-strands and catalyzethesearchforhomologousDNAsequences by strand-invasion, i.e. the formation of threestranded structures comprising a region of heteroduplex DNA and corresponding displaced strand. Branch migration extends the heteroduplex region, while formation of a doubleHolliday junction (involving DNA synthesis and ligation) seals the local connection between homologues. Pairing involves tens to hundreds of exchangesofthiskindperchromosomepair. During zygotene, the alignment of homologous chromosomes is stabilized by formation of synaptonemal complexes: ladder-like structures comprising a pair of axial cores bridged by transversefilaments.Pachytenemarkscompletion ofsynapsis.Double-Hollidayjunctionsareresolved by formation of DNA strand cuts and re-ligation, which can happen in two ways. By far the most common pathway leaves a local segment of heteroduplex as sole trace of the recombination process. Approximately one double-Holliday junction per chromosome arm is resolved using a distinct pathway that creates a crossing-over between homologous sister chromatids. Crossingovers result in pairs of double-stranded-helixes with distinct parental origin on either side of the crossing over (maternal-CO-paternal and paternalCO-maternal). Paternalandmaternalsequencesmaydifferwithin the heteroduplex regions as a result of Single Nucleotide Polymorphisms (or SNPs) that occur in all populations (see hereafter). This may cause a mismatchthatlocallydeformtheDNAdoublehelix andmaytriggerDNArepair.Inthisprocess,known as “gene conversion”, the sequence originating fromoneparentis“converted”tothatoftheother. Diplotene is characterized by the disassembly of the synaptonemal complexes, and further chromosomal condensation. Paired homologs remainconnectedatchiasmata(singularchiasma), the sites where crossing-overs occurred. The end ofprophaseIisreferredtoasdiakinesis. The remainder of meiosis I (i.e. metaphase I, anaphaseI,telophaseI)ishighlysimilartomitosis, exceptthat(i)thekinetochoresofsisterchromatids bind to microtubules originating from the same pole (rather than opposite poles as in mitosis), (ii) chiasmata connect homologues (rather than cohesins connecting sister chromatids during mitosis)priortosegregation,enablingalignmentof thechromosomesonthemetaphaseplate,and(iii) cohesins at the kinetochores are protected from Chapter2:GenesinpedigreesPage1/14 the action of separase by proteins called shugoshins. Meiosis II is in nearly all respect equivalent to a mitoticdivision. The diploid germ cells that undergo meiosis are calledoogonia(singularoogonium)infemales,and spermatogonia(singularspermatogonium)inmales. Both types of cells derive from a population of primordial germ cells (PGC) that migrate to the developinggonads. Inafemalefetus,PGCwilldividemitoticallyinthe 6 ovarytoyield∼5x10 oogonia.Itisestimatedthat ∼22celldivisionsseparateeachoogoniumfromthe zygote from which they derive. The fetal oogonia thenentermeiosisI,whichis–however-arrested at the diplotene stage of prophase I. The corresponding germ cells are now called primary oocytes. Only at puberty, stimulated by gonadotropins secreted by the pituitary gland at eachmenstrualcycle,willoneorasmallnumberof primary oocytes complete meiosis I. The corresponding cytokinesis is strikingly asymmetric, yielding a large secondary oocyte inheriting virtuallyallthecytoplasmiccontentattheexpense oftheminutefirstpolarbody.Secondaryoocytes are released from the ovary by the process of ovulation, yet remain arrested in metaphase II. Only if fertilized by a sperm cell, will secondary oocytes complete meiosis II, releasing a fertilized matureegg(orzygote)andsecondpolarbody. In a male fetus, PGC migrate in the developing 9 testis,wheretheymultiplymitoticallytoyield∼10 spermatogonia lining the basal membrane of the seminiferoustubulesatthetimeofpuberty.Each one of these is separated by ∼30 mitotic divisions fromthezygotefromwhichtheyderive.Starting atpuberty,spermatogoniawillengageevery∼two weeksinanasymmetricmitoticdivision,yielding(i) astemcell-likespermatogoniumtosupportfuture spermatogenesis, and (ii) a maturing spermatogonium that commits to completing spermatogenesis, implying three more mitotic divisions (yielding eight primary spermatocytes), followed by meiosis I (yielding sixteen secondary spermatocytes), and meiosis II (yielding 32 spermatids that will terminally differentiate in as manymaturespermcells).The32cellsthatderive from a maturing spermatogonium remain connected by cytoplasmic bridges, forming a syncytium until the very final stages of sperm maturation. Although genetically haploid, secondary spermatocytes and spermatids are in effect functionally diploid. As a result, although halvethesecondaryspermatocytesandspermatids are X-bearing and the other halve Y-bearing, all of them will contain all the gene products derived fromtheXandYchromosome,ofwhichsomeare essential to complete spermatogenesis. Contrary totheegg,whichisseparatedfromthezygotebya constantnumberof∼23celldivisions,thenumber ofcelldivisionsleadingfromthezygotetoasperm cell equals ∼34 + (26 x number of years after puberty). Sexualreproduction,involvingthefusionofhaploid gametescontributedbyanimalsofoppositesex,is an elaborate and costly process. Yet is utilized by the vast majority of plants and animals. It thereforemustconferaselectiveadvantage,which nature–however-remainspoorlyunderstood.As a result of the independent segregation of the maternal and paternal homologues for distinct chromosomes (Mendel’s second law), as well as theirreshufflingbytheprocessofcrossing-over,no two gametes produced by an individual are identical. For species producing many offspring, this might ensure that at least some offspring wouldbewelladaptedtoachangingenvironment. Sexual reproduction promotes the combination of multiple favorable mutation having occurred in independent lineages, while asexual reproduction requires such favorable mutations to occur sequentially in the same lineage. Sexual reproductionhasalsobeenproposedasameansto eliminate deleterious mutations, which would otherwiseaccumulateinthepopulation. In most sexually reproducing organisms, haploid cellsdonotmultiply.Onlythediploidsomaticand germline cells multiply mitotically. In some primitive plants, diploid and haploid cells proliferate.Insomespeciesofyeast,onlyhaploid cells proliferate, while the diploid zygote immediately engages in meiosis to produce new haploidcells. Mendel’slaws When asked what genetics is all about, students typically respond with “the science of heredity” or “the study of genes and their action”. Indeed as apparentfromthetableofcontentsofthiscourse, genetics studies the behavior and mode of action of genes in cells (molecular genetics), pedigrees (factorial and quantitative genetics), and populations(populationandevolutionarygenetics). But, equally important, genetics also refers to a wayofdoingscience,anexperimentalapproachto unravel complex biological phenomena. If Gregor Mendel is as much revered by geneticists as he is today, it is because he was probably the first to applythismethod,andthisinanexemplaryfashion. Providing extra appeal to geneticists, he applied it tothestudyofheredity. Mendel was born Johann in the now Czech Republic in 1822. He was raised on a farm, and wenttoUniversitybeforeenteringpriesthoodand the Augustinian Abbey of St Thomas in Brno. He performed research in plant and animal breeding but also astronomy and meteorology. His now famousresultsonplanthybridizationswentlargely Chapter2:GenesinpedigreesPage2/14 unnoticed for ∼35 years (including by Darwin) beforebeingrediscoveredaround1900.Heisnow generally considered to be the father of modern genetics. Mendel’s most famous experiments exemplify the key features of a well designed and executed experiment, and these are still highly relevant in thewayscienceisconductedtoday.Hefirstsetthe stage by carefully designing an experimental system. In his case he used peas, which he could multiply in a controlled fashion. As peas are hermaphrodites (the same plant produces both male and female gametes), he could both “self” plants. But he could also perform outcrossing at will, by excising the undeveloped male pollenproducing structures of a plant, and expose the femalestructurestomaturepollenfromadifferent plant using a pencil. Repeated selfing of plants with unique attributes (called phenotypes) let to pure lines, i.e. plants that would yield uniform offspring upon self-fertilization. By doing so Mendel developed pure “parental” lines (F0) that differed for one or more of seven “binary” phenotypes, including seed shape (round or wrinkled)andcolor(yelloworgreen). Inhisfirstexperiments,Mendelcrossedpure-lines differingforonesuchbinaryphenotype,producing aso-calledF1generation.HethenselfedtheF1’s, producing the F2 generation. The outcome of thesecrossessharedcommonfeatures:(i)theF1’s exhibited the phenotype of one of the crossed parentallines,(ii)¾oftheF2’sexhibitedthesame phenotype as the F1 generation, while the remaining¼re-expressedthephenotypeoftheF0 lineunseenintheF1generation. Mendelthenproposedamodel(Mendel’sfirstlaw of segregation) that would account for his observations. He posited that each binary trait is determined by a hereditary particle (now referred toas“gene”)(f.i.thegenedeterminingseedshape), that comes in two forms or alleles (f.i. the round alleleandthewrinkledallele).Individualshavetwo copiesofeachgene:oneinheritedfromthefather the other from the mother. Individuals inheriting the same allele from both parents are said to be homozygous or have homozygous genotype (f.i. wrinkled-wrinkledorround-round)andexpressthe correspondingphenotype(respectively,wrinkledor round). Individuals that inherit a different copy from each parent are heterozygous (f.i. genotype round-wrinkled). As one allele is dominant (and the other recessive), heterozygous individuals express the same phenotype as the homozygotes forthedominantallele.Gametescontainonlyone copy of each gene. Gametes from heterozygous parents have equal probability to harbor either of theallelesoftheparent.Referringtothedominant round allele as W and to the recessive wrinkled allele as w, Mendel’s model applied to crosses between pure lines of peas with round and wrinkledseedscanbesummarizedasfollows: F0:WWxww F1:Ww F2 Male gametes W(1/2) w(1/2) Femalegametes W(1/2) w(1/2) WW(1/4) Ww(1/4) Ww(1/4) Ww(1/4) Thefinalbutcrucialstagewastotesttheproposed model. The model indeed makes a number of predictions, including about the expected phenotypicproportionsonselfingF2,i.e.intheF3 generation. According to the model, wrinkled F2 are all homozygous ww and should therefore produce all wrinkled offspring when selfed. More interestingly,themodelpredictsthat1/3ofround F2’s should be homozygous WW and hence produceallroundoffspringonlywhenselfed,while 2/3ofroundF2’sshouldbeheterozygousWwand hence behave as the F2’s when selfed, i.e. yield ¾ roundand¼wrinkledF3’s.Andthisisindeedwhat Mendelobservedwhenhetestedhishypothesis. Another important observation made by Mendel, wasthe“equivalenceofreciprocalcrosses”,i.e.the outcome was independent of the sex-byphenotype combination in the F0 generation. In theexampleoftheseedshape,thismeansthatthe outcomeofthecrosseswasthesamewhetherthe wrinkledF0parentwasmaleorfemale. Inasecondseriesofexperiments,Mendelcrossed pure-lines that differed for two binary phenotypes (f.i.round&yellowcrossedwithwrinkled&green). In agreement with the previous experiments, the F1 generation only exhibited the dominant phenotypes as defined before (f.i. round and yellow). When selfing the corresponding F1’s he always obtained the four possible phenotypic combinations in the following proportions in the F2: 9/16 (dominant-dominant; f.i. round-yellow), 3/16 (dominant-recessive; f.i. round-green), 3/16 (recessive-dominant;f.i.wrinkled-yellow),and1/16 (recessive-recessive;f.i.wrinkled-green). The model he proposed to account for these observations, was that alleles of different genes segregateindependentlyinthegametes(Mendel’s secondlawofindependentsegregation).Thus,ifa individual is double heterozygote, he will produce four types of gametes in equal proportions. Applied to crosses between wrinkled-yellow and round-green peas, and referring to the dominant yellowallelesasGandtotherecessivegreenallele asg,hismodelcanbesummarizedasfollows: F0:WWGGxwwgg Chapter2:GenesinpedigreesPage3/14 F1:WwGg MaleGametes F2 Femalegametes WG (1/4) Wg (1/4) wG (1/4) wg (1/4) WG (1/4) WWGG (1/16) WWGg (1/16) WwGG (1/16) WwGg (1/16) Wg (1/4) WWGg (1/16) WWgg (1/16) WwGg (1/16) Wwgg (1/16) wG (1/4) WwGG (1/16) WwGg (1/16) wwGG (1/16) wwGg (1/16) wg (1/4) WwGg (1/16) Wwgg (1/16) wwGg (1/16) wwgg (1/16) Asbefore,thisextendedmodelmakesanumberof predictions which Mendel’s verified. One of the checks he carried out was to cross the F1’s with a tester. A tester is a pure line that is homozygous fortherecessiveallelesatallexaminedgenes.Asa consequence, the phenotype exhibited by the offspring directly reflects the genotype of the gametestransmittedbytheF1.Aspredictedbyhis model, this test-cross yielded the four possible phenotypiccombinationsinequalproportions. By performing an apparently simple “bean bag” experiment, and just on the basis of observed phenotypicproportions,Mendelproposedamodel that accurately described essential features of heredity and meiosis. Make sure to identify the featuresofmeiosisthatunderlieMendel’sfirstlaw of segregation and second law of independent assortment. Subsequently, other scientists tested Mendel’s models,includinginnewmodelorganismssuchas the fruitfly D. melanogaster. In a few (lucky) instances, Mendel’s predictions proved to be inaccurate, and predictive models had to be amended accordingly. As an example, violation of the equivalence of reciprocal crosses let to the discovery of the sex chromosomes, and of Mendel’s second law to the discovery of genetic linkage. (BoxNomenclature) Geneticlinkage Mendel’ssecondlawstatesthatanindividualthat is heterozygous for two genes (f.i. AaBb) produces fourtypesofgametesinequalproportions:AB,Ab, aB and ab. However, geneticist discovered in the early20-thcenturythatMendel’ssecondlawdidn’t apply to all pairs of genes. For such pairs, two types of gametes were significantly overrepresented at the expense of the two other classes. The over-represented types corresponded to the “parental” gametes, i.e. gametes with genotype identical to the ones inherited by the double heterozygous individual from its parents. AssumethattheparentswererespectivelyofAABB and aabb genotype, the informative AaBb individual would preferentially transmit “parental” AB and ab gametes, at the expense of so-called “recombinant” Ab and aB gametes. If – on the otherhand–theparentswererespectivelyofAAbb and aaBB genotype, the AaBb offspring would preferentially transmit “parental” Ab and aB gametes at the expense of “recombinant” AB and ab gametes. Determining which gametes transmitted by a double heterozygous individual are parental and which are recombinant is called “determining the linkage phase”. One way to deducethelinkagephaseisfromtheanalysisofthe parental genotypes – as shown above. When the linkage phase of a double heterozygous individual is known, this can be indicated by writing the genotype as follows: AB/ab. Pairs of genes characterized by an excess of parental and lack of recombinant gametes are said to be genetically linkedorexhibitgeneticlinkage.Theproportionof recombinant gametes was rapidly shown to vary for different pairs of linked genes, and to be independent of the phase of the informative individual (i.e. it would be the same for an AB/ab andanAb/aBindividual). It is now well established that linkage occurs between genes that reside on the same chromosome. Such genes are said to be syntenic. ThefactthatMendeldidnotobservelinkageisdue tothefactthat-bychance–thesevengenesthat he studied were non-syntenic. In the case of syntenic genes, recombinant gametes derive from meiosesinwhichacrossing-overoccurredbetween the two considered genes. Such event is more likelyifthetwosyntenicgenesarefarapartthanif theyareclosetoeachother.Thedegreeoflinkage, reflected by the proportion of recombinant gametes (or recombination frequency - RF), can therefore be viewed as a measure of the distance separatingsyntenicgenes. Itwasquicklyrecognizedthatsyntenicgenescould be positioned relative to each other based on observed RF, i.e. that one could build genome mapsbylinkageanalysis.Imaginethattest-crossing an ABC/abc individual yields the following set of progeny: Phenotype Number Proportion ABC 350 .350 abc 360 .360 ABc 6 .006 abC 4 .004 Abc 92 .092 aBC 98 .098 AbC 43 .043 aBc 47 .047 Total 1,000 1 Rememberingthatinatestcrossthephenotypesof the offspring directly reflect the genotype of the gamete transmitted by the triple heterozygote parent, the observed RF between A and B equals (92+98+43+47)/1000or0.28.Achi-squared-based Chapter2:GenesinpedigreesPage4/14 goodness-of-fit test indicates that the observed gametic frequencies differ very significantly from those expected under Mendel’s second law, with an excess of parental over recombinant gametes typicaloflinkedgenes: O(bs) E(xp) 2 (O-E) /E AB ab Ab aB 350+6 360+4 92+43 98+47 250 250 250 250 44.9 52.0 52.9 44.1 2 Chi p 193.9 -6 <10 TheRFbetweenAandCequals(6+4+92+98)/1000 or 0.20, which also departs very significantly -6 (pgoodness-of-fit<10 ) from Mendelian expectations indicatingthatAisalsolinkedtoC.Alongthesame lines, the RF between B and C equals -6 (6+4+43+47)/1000or0.10(pgoodness-of-fit<10 ). ThedatathusindicatethatA,BandCaresyntenic, while analysis of pairwise RF (AB: 0.28; AC: 0.20; BC:0.10)indicatethatthecorrectgeneorder(out ofthethreepossibleorders)isA-C-B,i.e.thatgene CislocatedbetweengenesAandB. Note that we could have reached the same conclusions even without a priori (f.i. based on knowledge of the genotype of its parents) information about the linkage phase of the informative triple heterozygous parent. Indeed, the two most frequently transmitted gametes (ABC: 350/1000 and abc: 360/1000) indirectly determinethephase,whilethetwoleastcommon gametes (ABc: 6/1000 and abC: 4/1000) identify the double recombinants thereby unambiguously establishingorder. It is worthwhile noting that the distances in RFunits between A-C (0.20) and C-B (0.10) add up to more than the distance between A-B (0.28). This non-additivityisofcoursenotadesirableproperty for a distance measure. Closer examination indicatesthatthelowerthanexpectedRFbetween A and B is due to the fact that 10 gametes have been counted as parentals, while being in fact double-recombinants (having recombined in both theA-CandC-Bintervals).Thefactthatwehave informationaboutCbetweenAandBallowedusto detect recombinational events, which we would havemissedotherwise.If,ratherthanconsidering these 10 gametes as parental, we consider that they jointly contribute 20 recombination events, the corrected RF between A and B becomes 0.30 which is exactly equal to 0.20 (A-C) + 0.10 (C-B). Thisindicatesthatadistancemeasurebasedonthe numberofCOintheconsideredintervalwouldbe betterthanonebasedontheobservedproportion of parental gametes (which in fact includes gameteswithzerobutalsowithanevennumberof COintheintervalofinterest).Asamatteroffact, even if one can’t directly observe all CO that may have occurred between two loci from genotype data, one can fairly accurately estimate their frequencyfromtheobservedRF.Themathematical relationship between the observed RF and the estimated CO frequency (COF) is called a mapping function. ThesimplestmappingfunctionisHaldane’s.Itcan bederivedasfollows.Assumetwosyntenicloci,A andB.Assumethemanymeiosesoccurringinthe gonadsofadoubleheterozygousAB/abindividual. A proportion of meiosis will be characterized by 0 chiasmata(CH)betweenAandB,aproportionby1 CH, a proportion by 2 CH, a proportion by 3 CH, etc ... We will assume that these proportions are accuratelydescribedbyaPoissondistribution,i.e. p(x;m) = e−m m x x! inwhichmistheaveragenumberofCHoccurring betweenAandBacrossallmeioses.Notethatthe -m proportionofmeiosiswith0CHhenceequalse . By definition gametes issued from meiosis with 0 CH between A and B can only be parental. Gametes issued from meioses with one CH between A and B, have 50% chance to be recombinantand50%chancetobeparental.This is due to the fact that CH occur at prophase I, involving two of the four present chromatids. Paradoxically, gametes issued from meiosis with two or more CH between A and B, likewise have 50%chancetoberecombinantand50%chanceto beparental.Thiscanbeseenformeiosiswithtwo CHasfollows.Letuslabelthesisterchromatidson thepaternalhomologue1and2,andthoseonthe maternalhomologue3and4.EachCHcaninvolve four possible chromatid combinations (1+3; 1+4; 2+3; 2+4). Meioses with two CH can thus correspondto16possiblechromaticcombinations, which are a priori equally probable. For four of these,thetwoCHinvolvethesamechromatidpair (f.i. 1+3 & 1+3). All gametes derived from such meiosisareparental.Foreight,thetwoCHinvolve three chromatids (f.i. 1+3 & 1+4). Halve the gametes derived from such a meiosis will be parental, the other halve recombinant. For the remainingfour,thetwoCHjointlyinvolvethefour chromatids (f.i. 1+3 & 2+4). All gametes derived from such meiosis will be recombinant. Taken together, half the gametes derived from meioses withtwoCHbetweenAandBwillbeparental,the otherhalfrecombinant.Thesameconclusioncan be reached for meioses with three CH between A and B (64 possible chromatid combinations), four CH,etc. Therefore, the expected RF between loci A and B equals: 1 RFA−B = (1− e−m ) 2 where (1− e−m ) corresponds to the proportion of meiosiswithatleastoneCHbetweenAandB. Chapter2:GenesinpedigreesPage5/14 GeneticistsprefertousetheaveragenumberofCO (COF)pergameteasdistancemeasure,ratherthan the average number of CH per meiosis (m), but thesetwoaresimplyrelatedas2xCOF=m.Hence, 1 RF = (1− e−2*COF ) 2 and 1 COF = − ln(1− 2RF) 2 COFisexpressedinmorgansorcentimorgans(cM). One cM corresponds to one CO per hundred gametes.ThusRFbetweenlociaremeasuredfrom thegenotypedata,andsubsequentlyconvertedto cM (i.e. an additive measure of genetic distance) usingmappingfunctions. Haldane’smappingfunctionassumesthatCHoccur at random along a bivalent and have no influence on each other. This does not correspond to the reality.First,therearevirtuallynonullichiasmatic meiosis, as chiasmata play a crucial mechanistic role in linking homologous chromosomes during meiosis I. This observation is sometimes referred to as the “obliged crossing-over” rule. Secondly, multiple CO tend to occur farther apart than expectedbychancealone.Thelatterphenomenon is called positive interference, i.e. the fact that a firstCOdiminishestheprobabilityofasecondone in its immediate vicinity. The degree of interferencebetweenintervals(sayA-BvsB-C)can bemeasuredona0to1scaleas: I = 1− DCOOBS DCOEXP where DCOOBS/DCOEXP is the ratio between the observed and expected numbers of doublecrossovers, also known as coefficient of coincidence. In the previous example, the coefficientofcoincidencewas((4+6)/1000)/0.1*0.2 =0.5,correspondingtoaninterferenceof0.5. Mappingfunctionshavebeenmodifiedinorderto accountforthestronginterferencethatisobserved for closely linked intervals in many organisms, including mammals. The most commonly used mappingfunctionisKosambi’sone.Itassumesthat thelevelofinterferencediminisheswithincreasing distance. Centimorgans, as defined above, are a convenient additive measure of genetic distance. But what is the relationship between centimorgans and the physical measure of genetic distance, i.e. the base pairs? At small scale the relationship looks like a staircase: regions with very little recombination separated by “recombination hotspots”. The majorityofCOindeedoccurwithin∼25,0001-2Kblong recombination hotspots containing short recombination-triggering motifs. In human, the consensus sequence of the most common recombination-triggering motif is: CCNCCNTNNCCNC.Atlargerscale,oneobservesan increase in the recombination towards the telomeres where recombination hotspots tend to concentrate, and a comparative decrease in recombination around the centromeres, which are recombinationcold-spots. Recombinationvariesbetweenindividuals,withsex having a major effect: the recombination rate is typically higher in the homogametic than in the heterogametic sex. But even within sex, recombination is variable and this is in part geneticallydetermined.Ofnote,inwomenglobal recombinationrate(i.e.theaveragenumberofCO in a gamete) is positively correlated with reproductive success. It has also been hypothesized that domestication has lead to an increaseinrecombination. Linkageplaysacentralroleingenetics.Assoonas recombinant DNA technology permitted the development of numerous polymorphic DNA markers (f.i. microsatellites and SNPs) for human and other species of interest, dense linkage maps comprising thousands of markers were generated. These played an essential role as a first step towards assembling the complete genomic sequenceofthesespecies,and–mostimportantly - they permitted the mapping of thousands of genes underlying inherited defects and other monogenictraits,aswellhundredsofQuantitative Trait Loci influencing complex diseases and quantitativetraitsinhuman,animalsandplants(cfr. Chapter...). Chromosomalsexdetermination Inmanyspecies,thekaryotypediffersbetweenthe sexes. In placental mammals for instance, while females have n pairs of matched chromosomes, males have n-1 pairs of matched chromosomes (referredtoasautosomes),plusonepaircomposed of two distinct chromosomes. This unique chromosomal pair corresponds to the sex chromosomes or gonosomes. In mammals, females are the homogametic sex and their n-th matchedpairofchromosomescorrespondstotwo identical gonosomes referred to as “X”. The males are the heterogametic sex and their n-th unmatchedgonosomalpaircorrespondstoone“X” (identical to the sex chromosomes of the females) and “Y” chromosome. The Y chromosome is typicallyconsiderablysmallerthanitsXcounterpart. As females are XX, all oocytes carry a X chromosome.Ontheotherhand,halvethesperm cells carry a X and will give a female offspring on fertilization,whiletheotherhalvecarryaYandwill givemaleoffspringonfertilization. Chapter2:GenesinpedigreesPage6/14 120 million years ago, i.e. after the separation of the lineage that would give the metatheria (i.e. marsupials) and eutheria (i.e. placental mammals) from that of the prototheria (i.e. monotremes), a mutation in the SOX3 gene (encoding a transcription factor) located on a then regular autosome,generatedanalleleconferringmaleness, known today as the SRY gene. The corresponding chromosomes carrying the original SOX3 allele would become the X chromosome, while those carrying the newly born male-determining SRY genewouldbecometheYchromosome. How the size of the Y chromosome has been reduced during 120 million years of evolution, is a very interesting biological question. It is believed that mutations in genes in the vicinity of SRY created alleles that were beneficial for male fertility but deleterious for female fertility. Such newalleleswouldfarebestinthepopulation(and hence favored by natural selection) if they were confinedtomales.Onewaytoachievethiswould be to block recombination between such malebeneficial haplotypes on the proto-Y chromosome andtheirXcounterpart.Aswewillseelaterinthis chapter, chromosomal inversions are one such recombination-blocking mechanism. It is now assumed that successive inversions have isolated increasing portions of the Y chromosome from its formerXpartner,reducingtheportionalongwhich the ancestral partners are still able to recombine (whichisessentialforpropersegregationofthesex chromosomes during male meiosis) to the nowadays very short “pseudo-autosomal region” (PAR).TheprogressivedegradationoftheY-specific region of the Y chromosome is thought to be a directconsequenceofthefactthat,contrarytothe Xchromosome,theYneverproductivelyengagesin recombination with a homologous partner (as YY individuals don’t occur) which seem essential for repair, and probably also of the fact that deleterious mutations on the Y are systematically shielded from purifying selection by the X chromosome. As the Y chromosome progressively lost an increasing proportion of its original gene complement, the ratio between autosomal gene copies(diploid)andgonosomalgenecopies(diploid in females, haploid in males) started to differ betweenmalesandfemales,causingaproblemfor molecular systems requiring finely tuned componentdosage.Counteringthisdosageissueis thought to underlie the selective pressure that let to the evolution of “dosage compensation” mechanisms to achieve proper and equal autosomaltogonosomalbalanceinbothsexes.In mammals, dosage compensation involves two components. The first is “X-inactivation” in females (or “Lyonisation” in honor of Mary Lyon who discoveredthismechanism).Althoughfemalecells are diploid XX, for most genes on the X-specific portion of the X chromosome, only one copy is active the other one being epigenetically silenced. In eutherian females, cells of the epiblast chose oneoftheXchromosomestoremainactive,while theotheroneissilenced.Thischoiceistoalarge extent random, so that some cells will silence the paternal X while neighboring cells may have selected the maternal X for silencing. However, once the decision is made in an embryonic precursor cells, her choice is epigenetically transmittedtotheentirecloneofdescendingcells. Alleutherianfemalesarethusinessence“mosaics” (i.e. organisms comprising cells that derive from the same zygote yet differing in their genetic complement),andfunctionallyhaploidformanyof the genes on the X chromosome. This functional mosaicism of females is directly apparent in tortoiseshell cats. These always female cats are heterozygous “B(lack)/O(range)” for a pigmentation gene on the X. The black sectors correspond to cell populations having inactivated theXcarryingtheOallele,whiletheorangesectors correspond to cell populations having inactivated the X carrying the B allele. The additional white spots, characterizing so-called calico cats, are due tothe“Piebald”alleleatadistinctautosomallocus. OneofthekeyplayersoftheX-inactivationprocess is the XIST gene, which is the only gene that is expressed from the inactive X and not from the activeX.ThecorrespondingXISTtranscriptisone ofthebeststudiedlongnon-codingRNAgenes.It coverstheXchromosomefromwhichitisderived, thereby silencing it by a still rather obscure mechanism. In addition to its XIST coat, the inactive X chromosome is characterized by epigenetically inherited DNA methylation and chromatin silencing marks. The inactive X chromosomeadoptsacondensedheterochromatic conformation that is microscopically visible at the nuclearperipheryasaso-calledBarrbody. X inactivation equalizes expression levels from the X chromosome in males and females. Proper gonosome-to-autosomebalanceisachievedbythe second mechanism involved in dosage compensation,i.e.thedoublingofexpressionlevels of all genes on the X when compared to genes on autosomes. The mechanisms underlying this generalized X-specific boost of expression remains unknown. How does the SRY gene cause masculinization of thedevelopingfetus?Inamalefetus,theSRYgene is activated in somatic cells of the developing gonad, which consequently differentiate in Sertoli cells. Sertoli cells secrete signals which (i) cause the incoming germ cells to differentiate along the sperm rather than egg pathway, (ii) cause neighboring cells to differentiate into Leydig cells whichsecretethesex-hormonetestosteronewhich will cause masculinization of several organs, and Chapter2:GenesinpedigreesPage7/14 (iii) block the development of the Müllerian anlagen(anti-Müllerianhormone).Intheabsence of SRY (female fetus), the somatic cells in the developing gonad differentiate in follicle cells, (i) letting incoming germ cells develop along the egg pathway, and (ii) stimulating neighboring cells to differentiateintoestrogen-secretingthecacells. Inmammals,themalesexisheterogameticandthe female sex homogametic. In some organisms, however, the situation is opposite: males are homogametic and females heterogametic. The gonosomesarethenreferredtoasZandW,males beingZZandfemalesZW.Thissituationappliesin particulartobirds.AsintheXYsystem,theWhas a tendency to degenerate over time for what are assumedtobesimilarreasons.Intheseorganisms, thesex-determininggeneisnotSRY. (Cell-autonomoussexdifferentiationinbirds) (discoveryofthesexchromosomes) Geneticpolymorphismsandhaplotypeblocks.The descriptionofmeiosisandgametogenesisinformed us about the mechanisms that govern the transmission of chromosomes and genes from parents to offspring. It didn’t account for the differencesthatareobservedbetweenindividuals, andwhysomeofthesetendto“runinfamilies”. Heritable, distinctive features find their origin in the “genetic polymorphism” that characterizes outbred populations (in contrast to inbred strains in which all individuals are genetically identical or monomorphic). In human, the genome of two randomlyselectedgametes(forinstancethesperm cellandoocytethatgavebirthtoyou),differevery 6 ∼1,000 nucleotides, for a total of ∼3x10 differences. The number of difference per nucleotide between two randomly selected genomes(oraverageheterozygositypernucleotide siteforarandomlyselectedindividual)iscalledthe nucleotidediversity(π).Inmostdomesticanimals, the nucleotide diversity is slightly lower than in human,oftheorderof∼1/2,000to∼1/3,000.This difference is related to the population size (see 6 Chapter III). The ∼3x10 differences between two randomly selected genomes correspond mainly to SingleNucleotidePolymorphismsor“SNPs”.These are genome positions for which more than one alleleisobservedinthepopulation.MostSNPsare saidtobebiallelic:onlytwoallelesareobservedat appreciable frequency in the population. Biallelic SNPs are often characterized by their Minor Allele Frequencyor“MAF”,i.e.thepopulationfrequency of the rarest allele, which ranges from 0 to 0.5. MostSNPscorrespondtonucleotidesubstitutions, andthesecaneitherbetransitionsortransversions. ItisimportantwhendescribingSNPstoagreeona referencestrand,whethertheCrickortheWatson strand. Having done this, transitions are either a cytosine<->thymine or adenine<->guanine substitution.Transversionscorrespondtothefour possible purine<->pyrimidine substitutions. Although there are twice as many possible transversions as transitions, transitions are more thantwicemorecommonthantransversions. If one were to randomly sample a second pair of human genomes, their nucleotide diversity would likewisebe∼1/1,000.However,thecorresponding 6 ∼3x10 differenceswouldonlyoverlapinpartwith thoseofthefirstpair.ThetotalnumberofSNPsin thehumanpopulationisestimatedat>12million. The corresponding MAF are characterized by an exponential distribution: there are many SNP with lowMAFandfewerwithhighMAF.Thelatterare called “common SNPs”. As a matter of fact, approximately halve of the differences between two randomly selected human genomes correspondtoSNPswithMAF<1%.Somepeople refer to such rare SNPs as “mutations”, but we prefer to reserve this term for the process that generatespolymorphisms. Chromosomes sampled in the population are characterizedbyspecificcombinationsofallelesfor strings of adjacent SNPs, called “haplotypes”. Hence,ifoneconsiderstwoadjacentSNPs,oneC<>T and the other A<->C, the four possible combinations or haplotypes are C-A, T-A, C-G and T-G. The human genome and that of domestic animals is subdivided in “haplotype blocks”, i.e. segments measuring of the order of 20-Kb (hence comparable to the size of a typical gene although there is no coincidence between the limits of haplotypeblocksandgenes).Haplotypeblacksare typically characterized by 5-10 haplotypes accounting for a large majority of chromosomes encountered in the population. Haplotype blocks typically encompass tens to hundreds of SNPs, which could – in theory – assort in thousands of haplotype combinations. Five to ten haplotypes per block is thus very limited. This is because alleles at neighboring SNPs don’t assort independently. They tend to preferably associate in specific allelic combinations, a phenomenon called “linkage disequilibrium” (see Chapter III). Thelimitsbetweenhaplotypeblockscoincidewith the position of recombination hotspots where crossing-overs preferentially occur. Across recombination hotspots, haplotypes from adjacent blocks tend to assort independently, i.e. to be in “linkageequilibrium”. Thelevelofgeneticpolymorphism(measuredbyπ) that is observed in a population at a given time, reflects a balance between (i) the generation of new variation by the process of mutation, (ii) the loss of variants as a result of “genetic drift” and selection. As mentioned before, every newly formed zygote carries ∼100 de novo mutations. Thecorrespondingnewvariantiscalledthederived allele,byoppositiontotheoriginalancestralallele. The vast majority of the millions of de novo Chapter2:GenesinpedigreesPage8/14 mutationthatarehenceinjectedinthepopulation ateverygenerationwillremainveryrare,restricted to specific families. But the population frequency of a minority of them will – with time – progressively increase, to sometimes even completely supplant the ancestral allele, i.e. reach fixation.Randomdrift,i.e.therandomsamplingof alleles from the parents to produce the next generation,issufficienttoexplainthelossofmost de novo mutation, the ascent of a lucky few, and the resulting transient polymorphism that is observed in outbred populations. However, the faithofsomedenovomutationsisalsodetermined by their phenotypic effects, and the resulting selectiveadvantageor–moreoften–disadvantage that the confer to the individuals that inherited them(seeChapterIII). Substitutions (transitions and transversions) are nottheonlykindofgeneticpolymorphisms.There arealso(i)deletionsandinsertionsofoneormore nucleotides (“indels”), (ii) variable numbers of tandem repetitions (“VNTR” or “SSR”) of small motifs defining microsatellites or larger motifs defining mini- and regular satellites, (iii) copy number variations (“CNVs”), and (iv) chromosomal polymorphisms. We will describe those in more detaillaterinthischapter. Functional effects of genetic polymorphisms: codingandregulatoryvariants.Inmammals,most polymorphisms that affect one (SNPs) or a small number of nucleotides are probably selectively “neutral”ornearlyso.Indeed,mostofthemfallin poorly conserved genome segments located in intergenic regions or introns, as these represent the largest fraction of the genome. The faith of such neutral polymorphisms is primarily determinedbygeneticdrift. However, some genetic polymorphisms do have functional consequences. These include, in particular, “coding SNPs” (“cSNPs”) which are changing the amino-acid sequence of a protein, and “regulatory SNPs” (“rSNPs”) which are affectinggeneregulation. cSNPs include (i) missense variants causing aminoacid substitutions which can be conservative (the two alternative amino-acids belong to the same category: neutral, polar, positively charged, negatively charged) or non-conservative (the two amino-acids are of a different category), (ii) nonsensevariantswhichchangeatripletcodingfor an amino-acid in one of the three stop codons (“stop-gained”), (iii) frame-shift variants which by incorporatingordeletingbasepairscauseashiftin the reading frame resulting in the addition of a nonsensicalpeptideatthecarboxy-terminalendof a truncated protein. Nonsense and frame-shift variantsoftenrevealstopcodonsinnon-lastexons. This usually causes degradation of the corresponding transcripts perceived as abnormal and triggering “nonsense mediated RNA decay” (NMRD).SNPsinORFcanbesynonymous,i.e.not changing the amino-acid sequence. Most synonymousSNPsaffectthethirdbaseofacodon. Deletions or insertions of three or a multiple of three nucleotides in the open reading frame may alter the amino-acid sequence without causing a frame-shift. An allelic series of cSNPs in the myostatin (MSTN) geneunderliesthe“double-muscling”phenotypein cattle. Double-muscled animals exhibit a severe and generalized muscular hypertrophy. Doublemusclingisencounteredinseveralbreedsincluding Asturiana, Belgian-Blue, Maine-Anjou, Parthenaise, Charolais, Piedmontaise, etc. In all these breeds, double-musclingisduetomutationsthatcauseloss of MSTN function. MSTN is an autoregulatory peptide hormone of the TGFβ family that is secretedbyskeletalmuscleandinhibitsitsgrowth. AtleastsixMSTNloss-of-functionmutationshave been identified in cattle so far including a frameshifting 11-bp deletion in the third exon (p.D273RfsX13=c.818-828delATGAACACTCC), the substitution of the fifth of nine highly conserved carboxyterminal cysteines (p.C313Y) mediating an intramolecular disulfide bridge stabilizing the structure of the bioactive domain, three stop codons resulting from single nucleotide substitutions (p.Q204X, p.E226X, p.E291X), and a stop codon resulting from a complex insertiondeletion event (p.F140X=c.419426delTTAAATTTinsAAGCATACAA). In addition, a missense mutation (p.F94L) is segregating at high frequencyinLimousinsandislikelytocontributeto thepronouncedmusculardevelopment(evenifnot double-muscledperse)ofthisbreed. cSNPs may also alter the amino-acid sequence by perturbingsplicing.Such“splice-sitevariants”may destroy splice donor or acceptor sites, or/and activate cryptic splice sites. Splice-site variants cause exon skipping or exonification of intronic sequences, and this usually results in gross alterations of the amino-acid sequence. An interesting splice-site variant was recently describedintheMSTNgeneofBlonded’Aquitaine cattle.Blonded’AquitaineareaFrenchbeefbreed characterized by a pronounced muscular hypertrophy reminiscent of double-muscling. Initially, however, no obvious loss-of-function mutationswerefoundwhensequencingtheORFof the MSTN gene in this breed. Recently, French researchersfoundthattheMSTNmRNAofBlonde d’Aquitaine animals has a ...-bp insertion at a position coinciding exactly with the boundary between exons 1 and 2. They demonstrated that theinsertionisduetothe“exonification”ofapiece of intron 1, which is due to a point mutation that activates a cryptic splice acceptor site. The illegitimateexon... Chapter2:GenesinpedigreesPage9/14 Belgian-Bluesprovideuswithanotherexampleofa splice-site variant. Between 200x and 200x, breeders noticed an increase in the number of calves aged ∼6 months exhibiting signs of stunted growth or growth retardation. Using positional cloning (see Chapter IV), an A->G SNP (c.124-2A>G) affecting the splice-acceptor site (position -2) ofthefirstintronoftheRNF11genewasidentified as being the cause of ∼40% of cases. The splicesitevariantcauseseitherskippingofexon2(which introduces a frame-shift in the amino-acid sequence resulting in NMRD or at best a severely truncated protein), or the activation of a cryptic splice site seven base pairs downstream of the original one. This causes a 7-bp deletion in the mRNA, resulting in a frame-shift that likewise results in NMRD and a severely truncated protein. RNF11 is a component of the A20 complex that down-regulates NF-κB-mediated inflammatory response. Concomitantly, most animals that are homozygous for the c.124-2A->G mutation, were subsequently shown to die at a very young age from excessive and uncontrolled inflammatory responses. Regulatory SNPs or rSNPs affect cis-acting gene regulatoryelementscontrollingeithertranscription or a post-transcriptional process. A nice example of the first type of rSNP in farm animals is a G->A transitioninasilencerelementlocatedinthethird intron of the porcine IGF2 gene. The wild-type version of this element normally binds the ZBED6 trans-acting factor in post-natal striated skeletal muscle. The mutation abrogates this interaction leadingtooverexpressionoftheIGF2growthfactor and a concomitant ∼3% increase in muscle mass (∼3kgs of additional muscle per 100Kg carcass). This mutation is present in many European pig breeds in which its frequency has increased as a result of selection for meatiness. It is noteworthy that it was found on a haplotype that differs at ∼1/100 nucleotides from the other haplotypes foundinEuropeanpigs,anucleotidediversitythat is an order of magnitude larger than expected withinspecies.Themutationactuallyoccurredon a haplotype that is found at high frequency (with and without the mutation) in Chinese pig breeds. European and Chinese haplotypes derive from different sub-species of wild boars, which were independently domesticated in Europe and China ∼10,000 years ago. The European and Asian wildboarsub-speciesderivedfromacommonancestor much earlier, probably millions of years ago, and this explains the high degree of divergence of the corresponding haplotypes. Chinese pigs were imported in the 19th century to improve the leanness of the then extremely fatty European breeds.Chinesebreedremainofinteresttodayfor the pig breeding industry because of their exceptionalfertility. TexelsheepprovideaniceexampleofarSNPthat operates at the post-transcriptional level. Texel sheep are quite famous for their pronounced muscular development. It is now known that the meatinessofTexelsheepisdueinpart(∼25%)toa G->A transition in the 3’UTR of the ovine MSTN gene. This substitution creates an illegitimate target site for two microRNAs that are highly expressedinskeletalmuscle:miR1andmiR206.As a result, MSTN mRNA with the mutation are destabilizedandtheirtranslationinhibited,causing a reduction in circulating MSTN protein thereby increasingmusclemass. Monogenic traits. We have seen how genes are transmitted from parents to offspring via haploid gametes, as well as the fact that genes are encountered in the population in distinct forms or alleleswhichmaydifferfunctionally.Thesefactors explain to a large extent why individuals differ in their “phenotype” and how these phenotypes segregateinfamilies.Forafirstclassofphenotypes, the differences between individuals are fully explained by allelic variation at one locus. Such phenotypes are said to be monogenic. Most geneticdefectsbehavelargelyasmonogenictraits. A second class of phenotypes, referred to as oligogenic, are determined by allelic variation at a small number of loci (typically two to three). Coat color variation in domestic animals is typically oligogenic. Finally, most phenotypes are determinedbyalargenumberofgenes(referredto as “polygenes”), as well as by non-genetic environmental and even stochastic effects. Such traits are referred to as “complex” or “multifactorial”. Production traits in livestock are typically complex, as are most of the common diseases afflicting human, including cancer, blood pressure,diabetes,etc. [Boxx:pedigreedrawingandallelicnomenclature] The simplest category of monogenic traits are the Mendelian traits. They are characterized by two phenotypicclasses(f.i.affectedbyageneticdefect versus non-affected) and segregate according to one of five basic Mendelian inheritance patterns: autosomal recessive, autosomal dominant, X- (or Z)-linked recessive, X- (or Z-) linked dominant, and Y-(ofW-)linked. As their name implies, autosomal recessive traits are due to recessive alleles at autosomal (or exceptionally pseudo-autosomal) loci. We will refertoarecessivealleleinlowercase(sayd),and use the “+” sign to label the corresponding “wildtype” allele. The defining feature of a recessive alleleisthatitsphenotypiceffectismaskedin+/d heterozygous or carrier individuals, which are “wild-type” in appearance. The trait is only apparentind/dhomozygousindividuals. The segregation of an autosomal recessive trait in an outbred pedigree is characterized by the following distinctive features: (i) affected Chapter2:GenesinpedigreesPage 10/14 individuals are usually born to unaffected but carrier parents, (ii) individuals of both sexes are equally affected, (iii) after birth of an affected offspringfromunaffectedparents,¼ofsubsequent full-sibs are affected, and (iv) an enrichment of parental consanguinity. Why does parental consanguinity increase the incidence of offspring affected with an autosomal recessive condition? All individuals carry of the order of ∼5 deleterious recessives. For any given gene, however, the frequencyofdeleteriousallelesinthepopulationis -3 verylow,oftheorderof10 .Thustheprobability that an offspring, from a mating between individuals that are not closely related, would be -3 affectedbyoneoffiveconditionsis∼5x½x10 ≈ 0.0025. However, the probability for an offspring oftwocousinstobeaffectedforatleastoneofthe ∼10 diseases carried by the two shared grand6 10 parents is ∼1-(1-(½) ) ≈ 0.15. In addition, this consanguinous offspring could be affected by a recessive condition not inherited from shared grand-parents. Why are some alleles recessive? Recessive alleles are typically “loss-of-function” mutations in genes forwhichonefunctionalcopyissufficienttoensure correct cellular/organismal operation (“haplosufficiency”).Nonsenseandframe-shiftmutations, forinstance,usually“knockthegeneout”(creating so-callednullalleles),butifthecopyinheritedfrom the other parent is normal, the heterozygous individualwillbehealthy. [Theoryofrecessivityforenzymes] The long list of genetic defects typically involves such recessive, loss-of-function mutations. The OMIM database (On-line Mendelian Inheritance in Man; www....) reports x such human defects. Causal genes and mutations have been identified for ... of these. The OMIA database (On-line Mendelian Inheritance in Animals; ...) reports x such animal defects. Causal genes and mutations havebeenidentifiedfor...ofthese. Autosomal dominant traits are due to dominant alleles at a (pseudo-) autosomal locus. We will typically refer to a dominant allele in upper case (say D), and use the “+” sign to label the corresponding “wild-type” allele. The defining feature of a dominant allele is that its phenotypic effect is apparent in +/D heterozygous or carrier individuals, as well as in D/D individuals. Only homozygous+/+individualshavenormal,wild-type appearance. In outbred populations, autosomal dominanttraitarecharacterizedby(i)thefactthat affected individuals have at least one affected parent, (ii) that they equally affect individuals of bothsexes,(iii)thattheyareequallytransmittedby both sexes, and (iii) one in two affected amongst offspringofaffectedparents(ifrarecondition). Alleles may be dominant for several molecular reasons. Mutations which result in a “gain-offunction” are typically dominant. An interesting example of such a gain-of-function is the so-called Pittsburghalleleinthehumanproteaseinhibitor1 gene (PI). PI encodes α1-antitrypsin, a protease whichnormallytargets(therebyinhibiting)elastase. PI null mutations underlie recessively inherited emphysema due to the exacerbated elastase activity.ThePIPittsburghallele,observedina14yearoldboy,hasaGtoTtransversionthatcauses a missense methionine to arginine substitution at position 358 (p.Met358Arg). As a result, the mutant α1-antitrypsin loses its affinity for elastase but targets coagulation factors, thereby causing a dominantly inherited bleeding disorder. Other examples of alleles that act dominantly as a result of a gain-of-function are mutations that cause membrane-bound receptors to become constitutively active. +++ Some “loss-of-function” mutations are dominant. Amongst these are “dominant negative” mutations in genes encoding subunitsofmultimericproteins.Dominantnegative mutations abrogate the normal function of the subunit but not its ability to integrate in the multimeric complex. When integrated, the mutated subunit acts as a poison, inactivating the complex. Imagine a complex like hemoglobin, comprising two alpha and two beta-subunits. An individual carrying a dominant negative α-globin allele would only have 25% of its hemoglobin tetramersdevoidofdefectivesubunits,whichmay causedisease.Examples:+++ Denovomutations. X-(orZ-)linkedrecessivetraitsareduetorecessive alleles of genes mapping to the X- (or Z-) specific portion of the gonosomes (i.e. not the pseudoautosomal region). While both sexes can be affectedinprinciple,individualswiththecondition are nearly exclusively heterogametic (for rare conditions). Indeed, hemizygous heterogametic individualsonlyrequiretheirsinglegenecopytobe mutant to be affected, while homogametic individualsneedtwomutantcopiestoexpressthe condition. Assuming that the frequency of the recessivealleleisp,thefrequencyofthecondition 2 is ∼p in heterogametic individuals yet ∼p in homogametic ones. For rare conditions, the resulting 1/p ratio is thus very large. Affected heterogametic individuals inherit the mutation from their usually asymptomatic carrier homogametic parent, which will often have affected heterogametic relatives. Affected homogametic individuals can result from the mating between an affected heterogametic and carrierhomogameticparent. Ofnote,ifthegeneissubjecttoX-inactivation(asis thecaseforthemajorityofgenesontheX-specific region),∼halvethecellsofa+/dcarrierXXfemale will have inactivated the + allele and therefore be Chapter2:GenesinpedigreesPage 11/14 deficient for the corresponding gene. If the gene productissharedbetweencells,thismayhavelittle if any consequence for the operation of the cell. However, if the encoded function is cellautonomous,∼halvethecellswillbedysfunctional. In such cases, mosaic XX females will have a mixture of healthy and deficient cells. This is for instance observed in women carrying loss-offunction mutations in the ... gene (causing anhidroticectodermaldysplasiaind/-menandd/d women), which exhibit dermal patches devoid of sweat glands adjacent to normally operating skin sectors. An example of a Z-linked recessive trait in animals is“Z-linkeddwarfism”inpoultry.... X- (or Z-) linked dominant traits are due to dominantallelesofgenesmappingtotheX-(orZ-) specific portion of the gonosomes. In outbred populations both sexes are affected, yet homogametic individuals are more often affected thanheterogameticones.Indeed,ifthefrequency of the dominant allele in the population is p, heterogametic individuals have a probability ∼p of being affected, while the probability for homogametic individuals is ∼2p(1-p). Hence, the ratio is 2(1-p) ∼ 2 for rare conditions. For rare conditions, offspring of affected homogametic individuals have a ½ probability to be affected, irrespective of their sex. On the contrary, all homogametic offspring of affected heterogametic parents will be affected while none of the heterogametic offspring will. As a result of Xinactivation+/Dfemalesareoftenmoremildlyand morevariablyaffectedthanmales. An example of a Z-linked dominant trait is “slow feathering”inpoultry.... Y-(W-)linkedtraits,involvinggenesmappingtothe Y-(W-)specificportionofthegonososmes,areonly expressed in the heterogametic sex and transmittedfromheterogameticparentstoalltheir heterogametic offspring. The only known example ofaY-linkedtraitis... For some supposedly Mendelian traits, the proportionofaffectedindividualsmaydeviatefrom expectation. This may for instance be caused by incompletepenetrance.Thepenetranceisdefined astheproportionofindividualsofagivengenotype that expresses the trait. For a regular autosomal recessive trait the penetrance of genotypes +/+, +/dandd/d(where+isthewild-typealleleandda recessive disease causing allele) is 0, 0 and 1. For some traits (including genetic defects) not all d/d individuals are affected. Such traits are said to be characterized by incomplete penetrance. Incomplete penetrance results from the fact that individuals with same genotype, f.i. d/d may have have been exposed to distinct environments or differ at other genes influencing disease outcome. Penetrance may differ between sexes or evolve with age, defining distinct liability classes. As an example, the penetrance of the autosomal dominant Huntington’s chorea in human is ..., ... and1for+/Dindividualsagedrespectively20,40 and70yearsofage. Deviations from expected Mendelian proportions also occur in the case of lethality of one of genotype. One such example is the tail-less phenotype of so-called Manx cats, caused by the dominant ... allele. Contradicting Mendel’s law, thematingoftwo+/...Manxcatsyields2/3rather than ¾ affected cats. This is due to the fact that the¼.../...dieinuterofromseveredevelopmental anomalies. Genes underlying Mendelian traits may be characterized by allelic heterogeneity, i.e. there maybemultipledistinctmutationsinthegene.In some instances, the different mutations will cause the same phenotype. One such example is “double-muscling” in cattle involving an allelic series of loss-of-mutations in the MSTN gene (cfr. above). In other instances, distinct mutations in the same gene may cause different phenotypes. Example... compoundheterozygotes Mendelian traits are typically binary, i.e. characterized by two distinct phenotypic classes. Somemonogenictraits,however,arecharacterized by more than two phenotypic classes. This is for instance the case when the phenotype of heterozygotes differs from that of either homozygotes, which can be due either to semidominant or co-dominant alleles. Semi-dominant alleles cause the heterozygotes to have a phenotype, measured on a continuous scale, intermediate between the two heterozygotes. As an example, the muscular development of cattle carrying one copy of MSTN loss-of-function alleles (causing full-blown double-muscling in homozygotes),isslightlyhigher(xresidualstandard deviations) than that of homozygous wild-type animals. MSTN loss-of-function alleles are therefore said to be semi-dominant or also partial recessives. Alleles are said to be codominant when heterozygotes express the phenotypes associated withbothalleles.Examplesofcodominancecanbe foundinthebloodgroups.ThehumanABOblood group system for instance, is the manifestation of three alleles of a gene (I) encoding a glycosyl A transferase. The I allele catalyzes the addition of N-acetylgalactosamine(=Aantigen)toaprecursor carbohydrate present on the surface of red blood B andothersomaticcells.TheI allele,differingfrom theIAalleleatfouramino-acidpositions,catalyzes the addition of galactose (=B antigen) to the the O sameprecursor.TheI alleleisanullalleleofthe Chapter2:GenesinpedigreesPage 12/14 transferase. The added N-acetylgalactosamine groups can be detected using anti-A antibody, while the galactose residues are evidenced using AB anti-B antibody. Red blood cells from I I heterozygous individuals simultaneously express AA boththeAandBantigenontheirsurface,fromI I AO BB andI I individualsonlytheAantigen,fromI I and BO OO I I individuals only the B antigen, and from I I individualsneithertheAnortheBantigen.[MHC] Another example of codominance is the roan phenotypeincattle.+++ Oligogenictraits When the phenotypic differences between individualsreflectallelicvariationatmorethanone yet a small number of loci, the trait is said to be oligogenic. Oligogenic traits can be classified according to the number of phenotypic classes (two, three or four). Oligogenic traits with two phenotypic classes involve complementation, suppressionandduplication,withthreephenotypic classes epistasis, and with four phenotypic classes modifiergenes. Complementation.Thesametraitmayreflectallelic variation at distinct loci in distinct families or populations. Such traits are said to be characterized by “locus heterogeneity”. As an example,inheriteddeafnessinhumanmaybedue torecessiveloss-of-functionmutationsinatleastx genes. This explains why deaf couples often have children with perfectly normal hearing. If the deafnessoftheparentsresultsfromhomozygosity for null alleles at two distinct loci, all children will be double carriers but have a functional allele at each locus therefore have normal hearing. The outcome whereby the mating between mutant parents yields 100% wild-type offspring is referred to as complementation. Complementation tests play an essential role in the genetic dissection of metabolicpathwaysinmodelorganisms.Imagine thataphenotype..explainwithexample]. Suppression. Alleles at some loci (say B) may suppress the effect of mutants alleles at another locus (say A). The suppressive alleles may be dominant or recessive. Assume an F2 generation segregatingattwolociAandB(F1genotypeAaBb). Assume also that the mutant phenotype depends ontherecessiveaallele.IntheF2generation,the proportionofmutantindividualswillbe3/16(aaB)incaseofrecessivesuppression,and1/16(aabb) incaseofdominantsuppression.Examples... Duplication. Some organisms contain more than two copies of all or par gene. This is particularly common in polyploid species, whose genome has undergone duplication or more in an ancestral species. Epistasis. Imagine a locus with two alleles A and a underlyingabinarytrait,animalshavingphenotype “A” (genotype A-) or “a” (genotype aa). A distinct locus (say B, with alleles B and b) is said to be epistatic(litterally“standingupon”)withrespectto locus A, if individuals with genotype B- (in which casetheepistasisissaidtobedominant)orbb(in which case the epistasis is said to be recessive) exhibit a distinct, third phenotype (say “B” or “b”) maskingeitherofthetwophenotypesdetermined by the A locus. In a F2 generation obtained by intercrossing double heterozygous F1 parents (genotypeAaBb),thethreephenotypes(A,aandB or b) are expected in proportions 3:1:12 corresponding to genotypes A-bb:aabb:--B- (dominant epistasis), or 9:3:4 corresponding to genotypes A-B-:aaB-:--bb (recessive epistasis). A well known example of recessive epistasis in domesticanimalsistheinteractionbetweenthe“X” and“E(xtension)”lociinlabradorretrievers.The“X” locus is characterized by the ... and ... alleles, explainingtheoccurenceofblack(genotype...)and brownorchocolate(genotype...)labradors.TheX locuscorrespondstothe...gene....The“E”locus isalsocharacterizedbytwoalleles:...and...Dogs with ... genotype will either be black or chocolate depending on their genotype at the ... locus. However labradors of ... genotypes will be yellow or golden irrespective of their genotype at the .. locus.TheElocuscorrespondstothe...gene.... Modifiergenes. Sex Polygenicinheritance(heritability) Parentalimprinting(callipyge) Mitochondrial:faithofpaternalmitochondria Epigeneticinheritance Chromosomal anomalies (aberrant euploidies, aneuploidies, isodisomies; translocations, inversions,duplications) Chapter2:GenesinpedigreesPage 13/14 Chapter2:GenesinpedigreesPage 14/14