Survey
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
DNA Science A Hands-On Workshop Illustration from the booklet, Alzheimer’s Disease: Unraveling the Mystery, available from the ADEAR Center. Illustrator: Christy Krames. Spring 2007 Dr. Sarah C.R. Elgin Dr. Mark Johnston Dr. Elena Gracheva Dr. Kelly J. Wright Supported by a grant from Howard Hughes Medical Institute to Washington University Lab manual by Dr. April Bednarski 2 Table of contents: 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 Laboratory Schedule 4 Lab safety & Learning to Use Micropipeters 5 Spooling purified DNA 6 Models of DNA Structure 11 PCR and genetic variation in humans. Alu. Part 1. 15 PCR and genetic variation in humans. Alu. Part 2. 23 Sterile techniques and transforming bacteria with DsRed. 25 Learning to clone DNA. Part 1 31 Learning to clone DNA. Part 2. 39 Learning to clone DNA. Part 3. 43 Chymotrypsin enzyme activity assay. 48 Yeast genetics 53 Reebops: Inheritance of traits. 59 Can sunscreen stop mutations? 65 Learning to use bioinformatics tools. 71 Glossary. 95 Website list for Bio 280 101 Genetic glossary 104 3 Laboratory Schedule Biology 280 Spring 2007 Date 1/19 1/26 2/2 2/9 2/16 2/23 3/2 3/9 3/23 3/30 4/6 4/13 4/20 4/27 Lab Lab Safety DNA spooling DNA models Alu PCR, part 1. Transforming bacteria with DsRed, part 1. Alu PCR, part 2. Transforming bacteria with DsRed, part 2. Yeast cloning, part 1. Yeast cloning, part 2. Yeast cloning, part 3. Chymotrypsin activity assay. Intro to bioinformatics. Tour of the GSC. Yeast genetics. Reebops. Sunscreen, part 1. Sunscreen, part 2. Bioinformatics: DNA sequence comparative analysis. Bioinformatics: protein structure. Bioinformatics: human disability. Paper reports. 4 Lab safety & Learning to Use Micropipeters Pipetmans™∗ are instruments used to accurately transfer small volumes (1µl to 1ml) of solution. Because of their accuracy, ease of use, and convenience in sterile techniques, they are practically universal lab tools. In this week’s lab, you will first learn how to properly use these instruments. After you feel comfortable accurately transferring small volumes onto a filter paper and from tube to tube, you will learn how to transfer liquids into a gel. These techniques will be used throughout our course. Use the picture below and the pipetmen on your desk to identify the following parts. Label each part. Barrellower The working end of the pipetman; a disposable tip is seated on the end of the barrel before each use. The part located at the top that is pressed and released to withdraw and expel liquid. Tip ejector buttonUsed to remove a tip from the barrel without direct handling of the disposable tip. Volume settingDisplays the volume the pipetman is currently set to deliver. Volume adjustment knobRotated to change the digital volume setting. Plunger- “Pipetman” is a brand name. The generic name for this instrument is “micropipet". In a research lab you are likely to hear these terms interchangeably. ∗ 5 At your lab bench you will find three pipetmans; each one is appropriate for a specific volume range. If you look at the dot on the top of the plunger of each pipetman, you will see a number that represents the maximum volume, in microliters (µl), that can be transferred by that pipetman. The minimum volume appropriate for each pipetman is typically ten percent of the maximum. The dot on the plunger is also color-coded and matches the color of the disposable tips used with that pipetman. The table below shows the volume ranges, expected accuracies, and the appropriate tips for each of the pipetmen you will be using. In most experiment, accuracy is important when transferring small volumes of liquid with a pipetman. A researcher needs to be sure that she is transferring the volume desired with a reasonable degree of accuracy. The research can be confident of this, provided two conditions are satisfied. First, the pipetman must have been calibrated and tested for accuracy (this is usually done on a yearly or semi-yearly basis). Second, the research must be using the pipetman properly. Pipetman P-1000 P-200 P-20 Volume Range (µl) 100-1000 20-200 1-20 Accuracy Disposable Tip Color ±10µl ±1µl ±0.5µl Blue Yellow Yellow Exercise 1: Familiarization with the pipetman/How to draw up and eject liquids Pick up a P-20 and se the volume setting for 10µl (reading from the top to the bottom it should be ‘100’). Push the plunger down and notice that there is a point that the plunger becomes more resistant and requires more effort to push further. This point is called the “first stop”. Notice that the plunger can be pushed well beyond the first stop until it reaches the “second stop”. Be sure you can feel the first stop point and notice how far the plunger travels before reaching this point. Reset the pipetman to 1.0µl (‘010). Again push the plunger to the first stop. You should notice that the plunger travels a much shorter distance before it reaches the first stop than it did when the pipetman was set for 10µl. 1. To transfer solution using a pipetman, set the volume dial to the desired volume by rotating the volume adjustment knob. Note: the volume setting should never be adjusted above the maximum volume specified for a particular pipetman. The maximum volume is the volume shown on the plunger. The table below shows how the volume display looks for each pipetman when it is set at its maximum volume. 6 Pipetman P-1000 P-200 P-20 Maximum Setting 1 0 0 2 0 0 2 0 0 2. Seat a disposable tip on the pipetman by placing the end of the barrel into a tip and pressing down firmly. 3. Push the plunger to the first stop and place the tip into the solution you are transferring. Slowly release the plunger to draw the desired volume of solution into the tip. Note: If you release the plunger too fast it will draw an inaccurate volume and plash solution into the barrel. Getting solution into the barrel can damage a micropipet. 4. Place the tip in the tube that is to receive the solution. Press the plunger to the first stop to expel the solution and then press to the second stop to blow out any residual fluid. 5. Continue to hold the plunger down until you withdraw the tip from the tube to avoid drawing the solution back into the tip. 6. Depending on the situation, the same tip may be sued again. In many cases, however, it is necessary (and vitally important) to use a new tip for each transfer. When you wish to eject the tip you are using, place the tip over the appropriate waste container and press the tip ejector button. If the tip is difficult to eject, it is likely that you are jamming the tips onto the pipetman harder than necessary. Exercise 2: Transferring small volumes with a pipetman Obtain a 4 x 6-inch piece of blotting paper and a microcentrifuge tube containing 2X loading dye. Following the protocol above, use a P-20 to spot the following volumes of dye directly onto the paper in a linear order: 1µl, 3µl, 5µl, 7µl, 10µl, 15µl, and 20µl. Spot each volume twice to check your consistency. Compare the spots you made with those of the instructor. If they don’t look right, check with your instructor to determine the cause. Exercise 3: Transferring volumes between tubes This experiment simulates manipulations that would be performed when mixing different volumes of several solutions. Real experiments using these 7 techniques will be done during this lab and for the rest of the semester. These include a bacterial transformation and preparation of bacterial plasmid DNA. 1. Obtain two microcentrifuge tubes and label them A and B. 2. Add the appropriate volumes of solutions I-IV to each tube following the table below: Tube A B Solution I 100µl 150µl Solution II 200µl 250µl Solution III 150µl 350µl Solution IV 550µl 250µl 3. You probably noticed that you have added a total of 1000µl to each tube. To check that you pipeted accurately, set your P-1000 to 1000µl and withdraw the solution mixture from the tube. Does the mixture just fill the tip? If there is solution left in the tube or air in the end of the tip, there may be a problem with your technique. If so, check with your instructor. Exercise 4: Loading small volumes into a gel On your bench you will find clear circular plates with a clear gel at the bottom. These plates also have a small amount of buffer or water in them so take care not to tilt the plate and pour out the liquid. If you look closely at the gel you will also notice that there are several small slits (wells) in the gel arranged in a straight line. Next week you will be loading real human DNA into the wells of a gel that is connected to a power supply which will separate the DNA by size. We will talk more about this later. For now we want you to practice putting a sample into a gel. 1. On your bench you will find tubes labeled “G” for gel. These tubes have 30µl of dye and water in them. 2. Load the following amounts of liquid into the wells in sequential order: 5µl, 10µl, 15µl. Tips: Draw up liquid by pushing to the first stop before inserting pipetman into the liquid you want to draw up. Again, draw up the liquid into the tip slowly to avoid any air bubbles in the tip. Any air bubbles can seriously affect the loading of a gel. Use your other hand to steady the pipetman as you insert the tip into the well of the gel. DO NOT jam the tip into the well as this will puncture the well and your sample will leak out underneath the gel. Instead, gently rest the tip against the side of well and slowly press the plunger allowing the sample to be ejected into the well. DO NOT press the plunger to the second stop (which means that you will still see some sample remaining in your tip) because this forces air into the well which can cause the sample already in the gel to be blown out. 8 Spooling Purified DNA INTRODUCTION Each of the 46 chromosomes in one of your cells contains one DNA molecule that is an inch or two long, though it is far too slender to be seen with any but the most powerful electron microscopes. If we could enlarge one of these DNAmolecules enough so that we could see it — let’s say so that it was about the same diameter as one of the hairs on your head — we would find that it was several miles long! In this exercise, we will take advantage of this long, thin, “threadlike” shape of DNA molecules to “spool” them, which is to say, we will wind them up on a wooden stick like a piece of thread. As you probably know, cotton fibers (which are individually a couple of inches long and quite thin) can be combined into one long, continuous thread, because they tend to stick to one another and line up side by side. The same thing can happen when DNA molecules come out of solution — if we pull on them from one end. We will do this by slowly twisting a stick in the region where DNA is beginning to precipitate. Each DNA molecule that is initially caught and wound around the stick will catch and pull on several other molecules, each of which will then catch and pull on others. Thus, if we work carefully, we can wind all of the DNAmolecules in a test tube into one long, continuous thread. Although DNAmolecules are so thin that you couldn’t possibly see one of them with your naked eyes, if you wind up many such molecules together (as you will in this exercise), the DNA becomes visible, and its properties can be studied. MATERIALS For each group of four students: 1 test tube of DNA 1 test tube of alcohol 1 wooden stick 9 PROCEDURE 1. Observe the alcohol and DNA solutions in the test tubes. Can you tell which one is the DNA? How? 2. Uncap the tubes and hold the tube that contains the DNA at an angle. 3. Carefully transfer the alcohol from its tube into the DNA tube. Pour very slowly, so that the alcohol does not mix with the DNA solution and stir it up. 4. Gently insert the wooden stick through the alcohol layer to the interface where the two liquids meet. Twirl the stick gently. 5. Now slowly lift the stick from the tube and observe the material clinging to it. How long a fiber can you pull from the tube? 10 Models of DNA Structure Stick Model Use the DNA models provided to answer the questions in your lab report (next page). Computer Tutorial Turn on your computers and open the Bio 280 home page: http://www.nslc.wustl.edu/courses/Bio280/bio280.html Open the ‘course contents’ section using the password. Click on the site DNA Structure by Eric Martz. Open item #1, Double helix by element: base pairs, hydrogen bonding. Click on the boxes at the bottom of the screen to highlight the Backbone, Bases, and H-bonds. Clicking on either the X Spin or the Y Spin box enables you to spin the molecule for different perspectives. (Remember to Reset the image after each manipulation.) Doing a Y spin in the Thick mode is a good way to see that the sugar-phosphate backbone is on the outside, the base pairs on the inside. Doing a Y spin in the Space fill mode (done by clicking on the both the Y Spin and the Space fill boxes) is a good way to see the major and minor grooves. Using the Y spin, follow the position of the major and minor grooves and check strand polarity. Use this tutorial to answer the questions in your lab report. 11 12 Bio 280 Lab Write-up: DNA Models Name____________ Date of Investigation: / / Investigation 1: DNA stick model In groups of two or three examine the DNA model on your bench. As you answer the following questions you should gain a better familiarity with the structure of DNA. 1. By comparing the DNA model before you with text-book structures, determine which colors on the model represent which atoms. Black=______Red=_______White=_____Yellow=_____ Blue=_____ 2. What are two quick and easy ways to distinguish A-T and G-C base pairs on this model? 3. How can purine and pyrimidine members of a base pair be distinguished? 4. Using a metric ruler, measure the distance (in cm) from the middle of one base pair to the middle of the next. distance = ________ cm a.) What is the scale of this model; i.e. how many angstroms are represented by one centimeter on the model? b.) What is the magnification factor; i.e. how many times bigger is this model compared to a real DNA molecule? 5. The average gene codes for a polypeptide approximately 333 amino acids long. Knowing that it takes 3 base pairs worth of DNA to code for each amino acid, what fraction of a typical gene is represented by this model? 6. If a typical gene’s worth of DNA was constructed on the scale of this model, approximately how many meters high would the model stand? 7. How many base pairs are there per revolution of this double helix? Calculate the degrees of rotation between adjacent base pairs in DNA. 13 8. Examine the deoxyribose units associated with a single base pair in the model. How do these sugars reveal the antiparallel nature of the sugarphosphate backbones? 9. What is the A+T/G+C ratio for this model? What is the A+G/C+T ratio? Why are they different? Investigation 2: Computer-based models After reviewing the basic B-form DNA structure, select an A-T base pair (click on the AT box). Moving the mouse while depressing the shift key allows you to adjust the size of the base pair to the field of view. Rotate the base pair (by moving your mouse) to achieve an ‘edge-on’ view of one ring structure at a time. 1. Are the purine and pyrimidine rings planar? Is the sugar ring planar? 2. Repeat your observations for the G-C base pair (by clicking on the GC box). Are the purine and pyrimidine rings planar?Is the sugar ring planar? Click Back on the tool bar to reach the DNA Structure (by Eric Martz) page. Click on item #4, Ends, Antiparallelism. Examine the ends of each strand. Identify the deoxyribose sugar backbone of each strand. Look at the deoxyribose sugar rings at the end of each strand. 14 PCR and Genetic Variation in Humans Alu Part 1 Adapted from Cold Spring Harbor Laboratory DNA Learning Center by the Washington University Science Outreach Program Background: Since its development two decades ago, polymerase chain reaction (PCR) has become a tool used almost universally by geneticists. PCR is used to quickly amplify, or create millions of copies of, specific regions of a particular segment of a DNA strand. PCR allows researchers to study very small amounts of DNA without resorting to laborious cloning procedures. The technique has had an impact in many areas of biology and has greatly facilitated the field of forensics. Although the DNA from different individuals is more alike than different, there are many regions of the human chromosomes that exhibit a great deal of diversity. Such variable sequences are termed polymorphic, meaning many forms. These polymorphic sequences provide the basis for genetic disease diagnosis, forensic identification, and paternity testing. In this lab exercise, PCR will be used to amplify a nucleotide sequence from human chromosome 8 to look for an insertion of a short DNA sequence called Alu, within the tissue plasminogen activator (TPA) gene. Alu sequences are thought to be derived from the 7SL RNA gene. This gene encodes for the RNA component of the signal recognition particle which functions in protein synthesis. Alu elements are approximately 300-bp in length and derive their name from a single recognition site for the endonuclease Alu I, located near the middle of the Alu sequence. An estimated 500-2000 Alu elements are thought to exist, and most of them are found in the human genome. A few have appeared relatively recently, within the last one million years, and are not present in all individuals. One such Alu element, called TPA-25, is found within an intron of the TPA gene. This insertion is dimorphic, meaning that it is present in some individuals and not in others. PCR can be used to screen individuals for the presence, or absence, of the TPA-25 insertion. In this experiment, oligonucleotide primers flanking the insertion site will be used to amplify a segment of the TPA gene. The expected product of the PCR reaction will be a 400-bp fragment when TPA-25 is present and a 100-bp fragment when it is absent. Each of the three possible genotypes – homozygous for presence of TPA-25 (400-bp fragment only), homozygous for the absence of TPA-25 (100-bp fragment only), and heterozygous (400-bp and 100-bp fragments) – will be able to be determined following electrophoresis of our PCR reaction products in an agarose gel. 15 Description of Lab Exercise Our source of template DNA will be a sample of several thousand cells obtained from inside your mouth. The cells will be suspended in a solution containing Chelex, a resin that binds metal ions (and that can also inhibit a PCR reaction). The cells will be lysed, or broken open, by boiling and then centrifuged to remove cell debris. A sample of supernatant containing your genomic DNA will be mixed with Taq DNA polymerase, oligonucleotide primers, the four deoxynucleotides (A, T, C, G), and the cofactor MgCl2. Temperature cycling will be used to denature the template DNA, anneal the primers, and extend a complementary DNA strand. The size of the amplification product(s) will depend upon the presence or absence of the Alu insertion at the TPA-25 locus on each copy of chromosome 8. We will compare the genotypes of different individuals by loading aliquots of your amplified samples into wells of an agarose gel. The lab instructor will load DNA size markers into other wells. The bands of the markers will be used to estimate the sizes of your PCR products. Following electrophoresis, amplification products appear as distinct bands in the gel – the distance a DNA fragment travels from the starting well is inversely proportional to its molecular size. The larger TPA product (containing the 300-bp Alu element) will not migrate as far from the well as the smaller TPA product (missing the 300-bp Alu element). In a successful experiment, one or two amplified bands will be observed, indicating that you are either homozygous or heterozygous for the TPA-25 locus. Experimental Protocol NOTE: PCR is extremely sensitive to contamination. A few foreign cells in your preparation can ruin your results. For this reason, it is extremely important to maintain sterility at all times while conducting the following procedures. A. Cell Extraction and DNA Isolation 1. Put on safety glasses and latex gloves. 2. Find one 1.5 mL microcentrifuge tube containing 10% Chelex (pronounced KEE-lex) and use a permanent marker to label the top or side with your 3 letter initials. (Note that Chelex is not actually a solution, but a slurry of resin coated beads in water. You will see the beads settled at the bottom of the tube.) 3. Your instructor will pass around a glass tube containing flat wooden sticks. These sticks have been autoclaved and are sterile. Your instructor will demonstrate how to remove a stick and keep it sterile. Open the glass tube and gently tap out a single wooden stick. 4. With the sterile end of your wooden stick, thoroughly scrape the inside of your cheeks. 16 5. Open your Chelex tube and put the end of your wooden stick (with cheek cells) into the tube. Gently twirl or stir your stick to dislodge the cells into the tube. Close the Chelex tube. Discard your wooden stick in the trash. 6. Finger vortex the Chelex tube for 10 seconds. 7. Place your 1.5 mL microcentrifuge tube in a floating rack in the boiling water bath for 10 minutes. 8. Place your boiled sample on ice for 2 minutes. 9. Centrifuge the sample for 30 seconds to pack the Chelex beads to the bottom of the tube. Your extracted DNA will remain in the supernatant above the beads. B. PCR Amplification 1. Put on safety glasses and latex gloves. Use a permanent marker to label the top of a sterile 0.6 mL microcentrifuge tube with your initials. This is your PCR reaction tube. 2. Using a P-20 micropipettor, pipet 10 µL of your extracted DNA sample into the reaction tube. Be careful to pipet only the supernatant of your DNA sample – avoid the Chelex beads. Any Chelex in the PCR reaction will inhibit the enzyme and ruin the reaction. 3. In the ice bucket on your bench you will find a tube labeled Amp Mix. Use a P-200 micropipettor to pipet 30 µL of this amplification mixture into your 0.6 mL PCR reaction tube. This mixture contains oligonucleotide primers, the four deoxynucleotides (A, T, C, G), Taq polymerase, and buffers. 4. Use a P-20 micropipettor to pipet 10 µL of MgCl2 solution to your 0.6 mL PCR reaction tube. 5. Your DNA is now ready for amplification in the PCR machine, or thermal cycler. Your instructor will collect your reaction tube, place it in the PCR machine, and start the program. The reaction will proceed as follows: 1 cycle 94˚C for 5 minutes 30 cycles 94˚C for 1 minute 58˚C for 2 minutes 72˚C for 2 minutes 17 18 Biology 280 Lab Write-up: Alu Name:________________________ Date of Investigation: Below is the sequence of human DNA called the TPA-25 locus that we are amplifying in class using Polymerase Chain Reaction or PCR. The sequence below also contains the Alu insertion. The sequence shows spaces every ten bases. 5’ CACATATTGT TTAAGGGTCC CTTTATTGCA TGTCTGACAA ACGCCTGTAA TCAGGAGATC AAAACTACAA TACTTGGGAG TGCAGTGAGC AGACTCCGTC ACTGGGTAAC AAGGTCAAGA GGTTGACTGA ATGGGATTGA TAATCCCAGC GAAAAGCAAG TGGCCTGTAA ATGATTTGTA CCCTATGAGA TCCCAGCACT GAGACCATCC AAAATAGCCG GCTGAGGCAG CGAGATCCCG TCAAAAAAAA AAAGTTAAAG TGAACTCGGT GTCTCCTTCT AAATCTTAAA ACTTTGGGAG GTCTACCAGT CCATTTAGTC AGAGTTCCGT TTAGAACACT TTGGGAGGCC CGGCTAAAAC GGCGTAGTGG GAGAATGGCA CCACTGCACT AAAAAAAAAA AGAAGTTCTC GTCCTCCCTC ACTCTTACAT CTCCTGGCCT –3’ TTTCCAACCT CTCAGCTGTT AACAGGACAG ACGGCCGGGC GAGGCGGGCG GGTGAAACCC CGGGCGCCTG TGAACCCGGG CCAGCCTGGG AAAAGAACAC CTAGGGTGGG CCAGCTCAGT GGCCTGTGAT GGTGTGGTGG AAATCCCAAG CTCCTGACAT CTCACAGTTC GCGGTGGCTC GATCACGAGG CGTCTCTACT TAGTCCTGGC AGGCGGAGCT CAACAGAGCG TACATTACTG GGTGTGCTGC GGTTTTCATT GTGGCTGAAA TGCATGCCTG The sequence of the primers that we used in this experiment are shown below: Primer 1: 5’-GTAAGAGTTCCGTAACAGGACAGCT-3’ (upstream primer) Primer 2: 5’-CCCCACCCTAGGAGAACTTCTCTTT-3’ (downstream primer) 1. Draw a line under the sequence in the TPA gene where each primer binds with the proper label. Remember: The Alu sequence above shows only one strand of the double-stranded gene fragment. Since the downstream primer actually binds to the complementary strand, you will need to determine either the complementary strand for the gene or for the primer in order to determine where it binds. 19 2. Now underline the region in the sequence shown that represents the Alu insertion and label the line. 3. If you were interested in amplifying only the region from nucleotide 65 to nucleotide 400, what 20 base pair primers would you use? Remember to label the 5’ and 3’ ends of each primer. (Assume the first nucleotide shown is nucleotide 1.) 3. As you know, humans are diploid, meaning they have two copies of every chromosome. It is possible then that some individuals have one chromosome with the insertion and one chromosome without. What would you expect to see if the DNA from such an individual was amplified and run on an agarose gel? (Draw a picture of the gel with labels.) Answer questions 4 – 7 after Part 2 of the Alu lab (running the gel). 4. What is your Alu genotype? 5. What is the distrubution of genotypes for the class? 20 6. If a crime had been committed, could you identify the perpetrator from a DNA sample using the Alu locus? (Assume that a DNA sample has been voluntarily provided by all suspects in the case.) Explain your reasoning. 21 22 PCR and Genetic Variation in Humans Alu Part 2 Adapted from Cold Spring Harbor Laboratory DNA Learning Center by the Washington University Science Outreach Program Background: This week you will run your amplified DNA on an agarose gel to separate alleles. If your amplification was successful, you will be able to determine your D1S80 genotype. We will then quantify occurrences of the alleles in our class. Day 2: A. Running an agarose gel 1. Your instructor will return your PCR reaction tube to you. Your reaction tube now contains your PCR product. 2. Put on safety glasses and latex gloves. Find a tube of Orange G loading dye. Use a P-20 micropipettor to add 10 µL of loading dye to your PCR reaction tube. 3. Use a P-200 micropipettor to load the entire 60 µL into one well of the 2.0% agarose gel. Record your name on the gel diagram sheet next to the appropriate well number for your PCR product. This is so you will know which lane of the gel contains your PCR product. 4. The gel should run for ~1.5 hours at 100V. B. Analyzing the gel When the electrophoresis is finished, your instructor will photograph the gels so the results can be visualized and analyzed. Be sure to note the number and size of any bands in your lane of the gel. 23 24 Sterile techniques and transforming bacteria with DsRed. Adapted from M.Grupe ‘What color is your colony?’ teacher manual, Washington University Science Outreach 2005 A. Practicing Microbiological Techniques Background: In this laboratory, we will be using the bacteria E. coli. To avoid contamination of your experiment with unwanted bacteria, it is necessary to use sterile techniques and work with all reagents carefully. Wipe your work area with bleach before and after your laboratory and dispose of used pipette tips, tubes, etc. in the containers provided by your instructor. Always wash your hands thoroughly after each laboratory exercise. 1. Put on your safety goggles. Observe your plate of bacteria. To avoid contamination, do not open the lid yet. Notice that the bacteria grow in clumps called colonies. Each colony started as one bacterium that then multiplied many times. All of the bacteria in a colony usually are identical. Bacteria grow and divide quite rapidly. Under the best conditions, E. coli cells can divide every 20 minutes or so. 2. Transfer a colony of bacteria into the tube of nutrient broth. Follow these steps: a. b. c. d. Carefully remove an inoculating loop from the package. Do not touch the loop to anything. Carefully tilt the lid of the petri dish containing the bacteria, without moving the lid away from the dish. This protects the agar from contamination with organisms floating in the air. Insert the loop end of your inoculating loop and touch a single colony of the bacteria with the tip. Withdraw the loop and replace the lid. Place the loop containing the bacteria into the tube of nutrient broth. Twist the loop to mix the E. coli with the broth. Check to be certain the bacteria have come off the loop. Remove the loop. Recap the tube. Dispose of the loop in the bleach solution provided at your lab station. 3. Next, you will practice transferring a bacterial suspension from a tube to a petri dish. a. b. Label the bottom of a petri dish containing sterile nutrient agar with your name and the date. Set the micropipettor to 100 µl. Using a sterile large tip, transfer from the tube 100 µl of the bacterial suspension that you just made 25 c. onto the agar in the petri dish. Dispose of the tip in the waste container. Using a new sterile loop, spread the broth containing the bacteria over the surface of the agar. Make a zigzag motion across the agar. Turn the plate 90˚ and repeat the zigzag motion. Dispose of the loop in the waste container. Put the cover on the plate and seal it with a piece of plastic wrap. Return your plate to your instructor. B. Transforming Bacteria: Part 1 1. Get a cup with ice in it. Locate the tube labeled “W.” This tube contains a very small amount of water. Put your initials on the tube. Gently tap the tube on the counter to shake the drop of water to the bottom of the tube. This tube will be your control tube. Place it in your cup of ice. 2. Locate the tube labeled “DNA.” This tube contains the DNA with the genes that will change a bacterial colony color. Put your initials on the tube. Place it in your cup of ice. 3. Locate the tube labeled “C.” It contain a chemical called calcium chloride, CaCl2. Calcium chloride makes bacteria “leaky,” so that big molecules like DNA can get inside the cells. Put your initials on the tube and place it in the ice. 4. Locate the petri dish culture of E. coli. (You may be sharing this with one or more groups.) 5. Use the inoculating loop to transfer a mass of bacteria to the tube labeled “C.” Then use the micropipettor to mix the bacteria with the liquid in tube “C.” To do this: a. b. c. d. e. f. fluid in g. Tilt the lid of the petri dish, holding it over the dish. Pick a colony off the surface of the agar with the loop, then close the dish Open tube “C,” put the loop with bacteria into the liquid, and gently spin the handle of the inoculating loop to knock off the clump of bacteria. Be sure the bacteria came off the loop! Dispose of the loop. Set your micropipettor to 250 µl. Place a sterile tip on the micropipettor. Pick up the tube that now holds the bacteria and gently pipette the and out to break up the clump of bacteria. Cap the tube and dispose of the tip. Put the tube with the bacterial mixture into the ice. 26 6. Add 250 µl of the bacterial suspension you just made to the tube containing water. a. b. c. d e. Set your micropipettor to 250 µl. Place a sterile tip on it. Transfer 250 µl of the bacterial suspension from tube “C” to tube “W.” Pipette in and out several times to mix. Place tube “W” on ice. Dispose of the tip. 7. Repeat the steps in 7 to add 250 µl of the fluid in tube “C” to tube “DNA”. 8. Start timing both tubes (“W” and “DNA”) at this point. Leave them on ice for 15 minutes. 9. At the end of 15 minutes, bring the tubes (still on ice) to the water bath. Check the temperature in the water bath to make sure it is 42˚C. Transfer the two tubes from the ice to the water bath. Leave the two tubes in the water bath for exactly 90 seconds. 10. Remove the tubes from the water bath and put them back on ice for one minute. After that, they can be at room temperature. 11. Locate the two petri dishes, one labeled “water,” the other labeled “DNA.” Put your group name and hour on the bottom of each dish. 12. Set the micropipettor at 250 µl. Place a new tip on the micropipettor and transfer 250 µl from tube “W” onto the agar in the plate dish marked “WATER”. Dispose of the tip in the waste container. 13. Using a sterile loop, spread the bacteria across the surface of the agar. Remember to zigzag, then rotate the dish and repeat. Dispose of the loop. 14. Place a new tip on the micropipettor and transfer 250 µl from tube DNA onto the plate marked DNA. 15. Using a sterile loop, spread the bacteria across the surface of the agar. Remember to zigzag, then rotate the dish and repeat. Dispose of the loop. Seal both dishes with plastic wrap and place them where your instructor directs. Incubate at room temperature. D. Viewing Plates Observe both of your agar plates and record what you see in the space below. Be sure to record when bacterial colonies can first be seen on each plate. Meanwhile, begin filling out the laboratory write up on the next two pages. 27 28 Biology 280 Lab Write-up: Sterile Techniques and Shine On Name:________________________ Date of Investigation: 1. In this laboratory we used molecular cloning techniques to isolate and amplify DNA fragments. How is DNA amplified in a cloning experiment? 2. How is this amplification different from a PCR experiment? 3. In a cloning experiment like this, what does it mean to say that a bacterial cell has been transformed? 4. How do we separate transformed cells from untransformed cells? Be specific. 5. What are the controls in this experiment? Why are they important? 6. What is the major trait we are trying to change in the bacteria? Extra point if you name both traits changed in this experiment 29 7. What is it that causes the bacteria to change color? 8. Does it matter which organism the piece of DNA we insert into the bacteria came from? Why or why not? 30 Learning to Clone DNA Part One: Construct Creation and Bacterial Transformation Background: Molecular cloning is a process by which DNA fragments are isolated and amplified. This technology is based on the ability to cut DNA fragments precisely (using restriction enzymes), join DNA fragments together, and make many copies of the new DNA. In this lab we will cut up yeast DNA (our source DNA) and join these DNA fragments (using a DNA ligation reaction) with a circular piece of DNA that makes copies of itself naturally when inserted into a bacterial cell. The circular piece of DNA is called an autonomously replicating vector or plasmid molecule. In this lab you will use a small (~3kb) double-stranded DNA vector called pBlueScript (pBS; see map below). Like all good vectors, pBS contains the necessary information to support its autonomous replication in a host (E.coli bacteria in this case). This plasmid is engineered to allow it’s host cell to grow in the presence of an antibiotic (ampicillin in this case). This trait will allow us to easily identify which cells contain our plamid. This plasmid also contains a region with “unique” cutting sites. These are called unique sites because they occur only once in the vector and provide useful positions to insert DNA fragments from the source organisms. For this experiment, you will be given a sample of pBS that has been cut once with the restriction endonuclease EcoR I to generate 5’- P AATTC G overhangs. The cut plasmid was subsequently treated with a phosphatase (an enzyme that cuts off phosphates), yielding a DNA fragment that looked like this: 5’- OH AATTC Now this cut in the circular piece of DNA will not seal back together, a process called vector re-ligation, but remain open to receive our fragment of source DNA. A DNA ligase reaction requires a 5’-phosphate and a 3’-hydroxyl as substrates. The source insert DNA in this lab will be from the yeast Saccharocyces 31 cerevisiae. You will be given a sample of S. cerevisiae DNA that has been cut with the restriction enzyme EcoR I but NOT treated with phosphatase. pBlueScript (2.96 kb) ampicillin resistance marker Multiple cloning site EcoRI DNA replication origin The source and vector DNA samples will be ligated in a small reaction containing buffer, ATP, and DNA ligase. As a negative control, each group will set up a ligation omitting the source DNA. After a brief room temperature incubation, the ligation products will be mixed with E. coli cells that have been pretreated with CaCl2 to make their cell membranes porous, thereby facilitating uptake of DNA molecules from the external media (cells prepared in this manner are known as ‘competent’ cells). The cell + DNA mixture will be ‘heat shocked’ to stimulate DNA uptake (by a poorly understood mechanism). The cells will then be allowed to recover from this harsh treatment in fresh non-selective media. Finally, the cell mixture will be spread on selective LB + ampicillin plates allowing the bacteria that have taken up plasmid DNA to replicate while killing the bacteria that failed to take up a plasmid. The result will be the growth of colonies, each colony representing an independent ‘transformation’ event. All bacteria in a colony are clones (i.e., genetically identical individuals derived from a single precursor) that harbor the same plasmid + insert combination. The actual molecular cloning experiment that you begin today will be continued over the next four weeks. Week One: You will perform a DNA ligation reaction with prepared bacterial plasmid vector and source (in this case, yeast genomic) DNA fragments. The 32 products of the ligation reactions will then be mixed with permeabilized (competent) E. coli host cells. Bacterial cells that take up plasmid vector DNA (+/- inserts) will be selected on antibiotic nutrient plates to recover individual clones. The day after lab, you will examine the transformation plates, count colonies and start a liquid bacterial culture using a clone that potentially contains an insert as an inoculum. Week Two: You will prepare plasmid DNA from your bacterial culture. Using this DNA, you will set up a restriction endonuclease digestion of your potential plasmid clone. You will also set up a sequencing reaction that will yield the sequence of your inserted yeast DNA fragment, if one is present. Week Three: You will analyze the results obtained from your restriction digestions and the results obtained from your DNA sequencing reactions. Experimental Protocol: (Work in groups of 2) Enz mix (buffer + DNA ligase) Vector DNA Source DNA dH20 ATP [5 mM] + insert ligation – insert ligation 4 µl 2 µl 2 µl 0 µl 2 µl 10 µl 4 µl 2 µl 0 µl 2 µl 2 µl 10 µl Ligation: 1. Find two tubes labeled E and label them with your initials. Label one tube as + (for ‘plus insert DNA’ ligation) and the other tube as – (for ‘minus insert DNA’ ligation). 2. To each ‘Enz’ tube add 2 µl of vector DNA from tube labeled V (REMEMBER TO CHANGE YOUR PIPET TIP AFTER EACH TRANSFER!!!) 3. To the ‘+’ tube add 2 µl of source DNA. For the control ligation, add 2 µl of dH20 to the ‘–’ tube. 4. Add 2 µl of ATP to each of the ligation tubes to start the reaction. *The reaction mixture set-up in steps 1-4 is summarized in the table above. 33 5. Tap the tube to mix the reaction and spin the tube briefly in the microcentrifuge to collect the liquid in the bottom of the tube. Incubate the reactions at room temperature for ≥15 minutes. 6. After room temperature incubation, place each ligation tube onto ice for 5 minutes to cool. Transformation 7. To each tube, add 100 µl of competent cells (always keep the cells cold and use sterile technique!). Quickly mix by gently tapping the tube and place the cells + DNA back on ice to incubate for 20 minutes. 8. ‘Heat shock’ the cells by placing the tubes in a 37˚C waterbath for 2 minutes. 9. To each tube, add 500 µl of LB liquid broth. Mix tube by tapping. Incubate at 37˚C for 10 minutes. 10. Remove 100 µl of liquid from each tube and spread on the surface of an LB + ampicillin plate. Incubate at 37˚C overnight. * In parallel, your lab instructor will carry out three additional control experiments. Competent cells that were not mixed with DNA (‘Cells only’) will be plated on 1) LB without antibiotic (a positive control for cell viability), and 2) LB + ampicillin (a control to confirm the efficiency of the antibiotic selection). In addition, competent cells mixed with unligated vector + insert DNA will be plated on LB + ampicillin as a negative control to determine the number of colonies that result from residual uncut vector DNA. 11)On the following day between Noon and 5 PM: Retrieve your plates and the additional control plates from the incubator in Rebstock 132. Count the colonies on each plate and record below: Plate: Cells only No amp Cells only plus amp vector + insert unligated –insert ligation + insert ligation Colony # After you record your data, pick ONE well-separated colony from the + ligation plate and inoculate an LB + ampicillin liquid culture. A workbench and sterile wooden sticks will be available in Rebstock 132 to start your cultures. 34 Remember to use sterile technique. Both you and your lab partner should each inoculate a culture. Place your inoculated tubes in the rack designated for your section. At the end of the day, your cultures will be incubated overnight at 37˚C and saved at 4˚C until next week at which time you will prepare plasmid DNA from your culture. Be sure to label your tube with your name and lab section! 35 36 Bio 297 Lab Write-up: Learning to clone DNA. Name___________________ Date of Investigation : Answer the following questions after Part 1: In the table below, indicate your results for this experiment. Count the number of colonies on each plate and record them. If there are more than 500 colonies just record ‘>500’; if there is a lawn record ‘lawn’. Plate Reaction contained: 1 Cells only Ampicillin in plate no 2 Cells only yes 3 Cells + vector, + insert, no ligase Cells + vector, no insert, + ligase Cells + vector, + insert, + ligase yes 4 5 Your results yes yes 1. If this experiment worked correctly, you will see colonies on Plate #5. Explain the presence of bacterial growth that you see on plates #1 through 4. Based on the presence or absence of bacterial growth on plates 1-4, can you say that your experiment was successful or not? Explain. ______________________________________________________________ ______________________________________________________________ ______________________________________________________________ ______________________________________________________________ ______________________________________________________________ ______________________________________________________________ ______________________________________________________________ _________________________________________________ 2. Using your notes from class and any outside materials you need, make a diagram illustrating the first three initial phases of our cloning experiment. a. Ligation phase (Draw everything you know about vector and insert. How are they cut, phosphatated, and ligated?) b. Transformation c. Plating 37 Answer the following questions after Part 2: 1. Using your notes from class and any outside materials you need, draw the DNA isolation experiment in two steps. 1) What did the cell look like before the plasmid was isolated? 2) What was the result after the plasmid isolation (miniprep experiment)? 2. Using your notes from class and any outside materials you need, draw a picture of the following: a. The vector cut with EcoRI, no insert. b. The vector cut uncut (3 possibilities) c. The vector cut with EcoRI, 1 insert. 38 Learning to Clone DNA Part Two: Clone Characterization Background: Last week you picked a single colony off of your ‘+ insert ligation’ plate and inoculated a liquid culture. Each colony should represent a single clone. In this case, the clone is a particular DNA molecule. If the experiment was successful, the clone should be a chimeric molecule comprised of the pBS vector and at least one of the ~3 X 103 EcoR I fragments present in the source yeast genome. This week, you will isolate plasmid DNA from your inoculated culture and cut the DNA with the restriction enzyme EcoR I. The aim of this exercise is to determine the structure of the clone, enabling you to answer the following questions: 1) do you have a single insert, multiple inserts or no insert? 2) can the insert(s) be ‘cut out’ of the vector with EcoR I as expected? 3) what is the size of the inserted fragment or fragments? To purify your cloned DNA, you will use a standard ‘alkaline lysis’ bacterial plasmid miniprep procedure that allows isolation of plasmid DNA free of the bacterial chromosomal DNA. The protocol involves five steps. First, the bacterial cells are harvested by centrifugation. Second, the cells are resuspended in a buffered solution (#1) and lysed by the addition of Solution #2 containing NaOH and sodium dodecyl sulfate (SDS, a common detergent. Check the fine print on most shampoo bottles.). The NaOH also serves to denature double-stranded chromosomal and plasmid DNA. The third step is the neutralization (with Solution #3) of the alkaline lysis solution to allow doublestranded DNA to reform. The strands from each small circular plasmid DNA will remain interlocked and will quickly reanneal to reform a double-stranded plasmid. However, the complementary strands of the larger linear chromosomal DNA fragments will not reanneal properly and will become irreversibly denatured and insoluble. In addition, the potassium ions in the neutralization solution (#3) cause the detergent to precipitate with insoluble denatured proteins and improperly reassociated chromosomal DNA, carbohydrates and cellular debris. This unwanted material is pelleted by centrifugation in the fourth step, leaving a supernatant containing the desired plasmid DNA. The fifth step is the collection of the plasmid DNA by ethanol precipitation and centrifugation. The plasmid DNA pellet will be resuspended in a dilute buffer (TE) plus RNase to degrade unwanted RNA. After isolating plasmid DNA from your bacteria, you will set up a restriction endonuclease digestion. A portion of your newly isolated plasmid DNA will be added to a tube containing a mixture of EcoR I and buffer. Your restriction digests will be incubated for several hours at 37˚C and the resulting fragments size-fractionated by agarose gel electrophoresis. Next week in lab, you will analyze the data from your section. 39 In the final step of the lab today, you will set up a reaction that will enable sequencing of your inserted DNA fragment. The sequencing protocol we will employ makes use of a technique with which you should already be familiar: PCR. Your plasmid + insert DNA will be used as a template to synthesize additional copies of this DNA. As in a typical PCR reaction, the DNA polymerase being used is stable at high temperatures and allows copies of your template to be generated after ‘melting’ a double stranded sample into single strands. Both deoxynucleotide triphosphates and dideoxynucleotide triphosphates will be included along with a DNA polymerase, your DNA template and a primer (complementary to your plasmid DNA, usually to a region within the multiple cloning site). As you remember from lecture, dideoxynucleotide triphosphates are missing a 3’-OH group. Without this 3’-OH group, another nucleotide cannot be added to the lengthening DNA chain; therefore, polymerization of the new DNA strand stops when one of these nucleotides is used. The ratios of dNTPs to ddNTPs has been formulated so that polymerization is terminated by incorporation of a ddNTP at least once at each base, resulting in the production of many DNA strands, each of which is of unique length, differing in size by one base. The differently-sized products that are generated by this sequencing protocol can be separated by gel electrophoresis. Each dideoxynucleotide is ‘labeled’ with a colored dye specific to the nucleotide (for example, dideoxyadenosine triphosphate is labeled with a green dye while ddGTP is labeled blue), and the color of each piece of DNA is detected and recorded as it’s run off the gel. A computer analyzes this color data and produces a print out of your DNA sequence. Of course, the actual reading of the sequence will not be done by you; this process is all automated. All you need to do is set up the reaction, put it in the PCR machine, and analyze your results! Experimental protocol: (Work in pairs) DNA isolation 1. Transfer 1000 µl of your overnight culture to a clean microcentrifuge tube. 2. Pellet cells by centrifugation for 1 minute. 3. Remove the supernatant using a P-1000 pipetman. Make sure there is no excess liquid remaining on the cell pellet. 4. Add 250 µl of Solution #1 to the cell pellet. 5. Vortex the tube to resuspend the cell pellet. Make sure that no pellet remains visible on the bottom of the tube. 6. Add 250 µl of Solution #2. Close the cap and tip the tube back and forth to mix (Do not vortex!). Let the tube sit at room temperature for one minute. The mixture should become clear (loss of turbidity due to cells lysis) and viscous (long DNA polymers spill out of the broken cells). 40 7. Add 350 µl of Solution #3. Close the cap and tip the tube back and forth again to mix (Do not vortex here either). You should see a yellow-white precipitate form. Continue tipping back and forth for 30 seconds. 8. Spin the tube in a high speed microcentrifuge (on the side bench) for 10 minutes to pellet the debris (including chromosomal DNA, proteins, carbohydrates). 9. Carefully remove the entire volume of the supernatant and transfer to a clean spin column / microcentrifuge tube. Be careful not to trasfer any of the pellet or floating particles. Discard the tube with the precipitate in the biohazard waste. 10. Spin the DNA-binding spin column / microcentrifuge tube in the microcentrifuge on your bench for one minute. All of the liquid will be forced through the column and any DNA in the liquid will bind to the resin on the column. Discard the liquid in the microcentrifuge tube and put the column back into the same tube. 11. Transfer 700 µl of Solution #4 onto the column. Spin the tube in the microcentrifuge on your bench for one minute. This step serves to wash any non-specifically bound material off of the column. The plasmid DNA remains bound to the column during this was step. 12. Discard the liquid in the microcentrifuge tube and replace the column back in the same tube. 13. Spin the column / microcentrifuge tube (empty) in the high speed microcentrifuge (on the side bench) for one minute. This spin results in the removal of any residual liquid from the column. The plasmid DNA remains bound to the column. 14. Discard the microcentrifuge tube and place the column (still with your plasmid DNA) in a new, labeled microcentrifuge tube. 15. Elute (wash off) your plasmid DNA from the column by pipeting 35 µl of dH2O onto the column. Spin the column / microcentrifuge tube in the microcentrifuge on your bench for one minute. Your plasmid DNA is now in the liquid in the microcentrifuge tube; the column can be discarded. Use this DNA to set up both the restriction digestion and the sequencing reactions as specified below. Restriction digestion 1. Transfer 5 µl of your plasmid DNA into a clean microcentrifuge tube marked E for EcoR I digestion. Transfer another 5 µl of your plasmid DNA into a second microcentrifuge tube marked U for the uncut control. Add your initials and section letter onto the side or top of the tube for later identification. 2. Add 15 µl of EcoR I restriction digestion mix to the tube labeled ‘E’. Add 15 µl of Mock restriction digestion mix (buffers but no enzyme) to the tube labeled ‘U’. This mock digestion serves as a negative control to determine if the EcoR I digestion worked. 41 Sequencing reaction 1. Transfer 10 µl of your plasmid DNA into a clean 0.5 ml (small) microcentrifuge tube marked with your initials and section letter on the top. 2. Add 2 µl of primer to the tube. 3. Add 8 µl of BigDye TerminatorTM reagent to the tube. This reagent is a commercially-obtained mix containing a modified Taq DNA polymerase, buffers, MgCl2, dNTPs and labeled ddNTPs. Close the cap and place the tube in the ice bucket on your bench. Your reaction tubes will be collected by your instructor and will be placed in the thermal cycling machine along with those from students in your section for the sequence cycling reaction. When the cycling is complete, they reaction products will be precipitated and sent to the sequencing facility in Rebstock for the automated reading of your sequence. The sequencing results will be sent to your lab instructor and you will have the opportunity to analyze them in the ‘Genomics and DNA Sequence Homology Analysis’ lab. 42 Learning to Clone DNA Part Three: Restriction Analysis Background: Bacterial restriction enzymes were originally discovered as part of a defense system against infection by bacterial viruses. These restriction enzymes function to break up or hydrolyze pieces of DNA with specific DNA recognition sequences. This activity is useful in preventing infection because the bacteria can partially hydrolyze the DNA of an infecting virus, if the virus has the sequence that is recognized by the restriction enzyme. You can see, then, that having one restriction enzyme does not completely prevent a bacterium from infection by bacterial viruses because each restriction enzyme only cleaves the DNA of viral strains that contain the specific DNA recognition sequence. To increase their resistance to infection, many bacteria synthesize multiple restriction enzymes. Restriction enzymes are named for the organism from which they are first isolated and characterized. The first three letters of the enzyme name comes from the name of the bacterial species, and is optionally followed by a Roman numeral indicating the order in which this enzyme was isolated from that organism relative to other restriction enzymes from the same organism. The name part is usually pronounced rather than spelled out, but the Roman numeral is always pronounced by the cardinal number (one, two, three, etc.). For example, EcoRI (pronounced: E-coh-are-won) is derived from E. coli strain R, and it was the first enzyme isolated from this strain; Bgl II (bagel-too) was the second restriction enzyme isolated from Bacillus globigii. In today’s lab we will separate our restriction digest by gel electrophoresis. We will continue with lecture and when our samples have run far enough to analyze the results we will talk about them as a class. Experimental Protocol: 1. After you’ve retrieved your restriction digest from last class, spin the microcentrifuge tube briefly to bring any condensed liquid down to the bottom. Add 3 µl of loading dye. 2. Load the entire sample onto the gel. 3. Run all the samples in a 1.5% agarose gel for ~1hour at 120 volts. 43 Genomics and Homology Analysis Background: Often times when doing biological research, an investigator clones a gene, sequences it, and using the internet, discovers that it has never been characterized before. While it is exciting to discover a new gene further research becomes complicated by the fact that the function of this new gene product is not likely known. This problem can be approached in a number of different ways, the simplest of which is to look at gene sequences from other organisms and compare them with the sequence of the novel gene. If a gene with a similar sequence is found, and if the function of that similar gene product has already been described, it might give a clue as to the function of the newly discovered gene product. In this way, a scientist might be able to hypothesize about the function of the new gene based upon its homology with another species’ gene(s) of known function. In today’s lab, you will learn to analyze DNA sequences relative to other known sequences. We will be looking at the sequence data we obtained from the cloning experiment from the past few weeks. Procedure Part A: Searching for information about your cloned DNA What gene is it? Is it a gene at all? 1. Go to the Saccharomyces Genome Database at www.yeastgenome.org 2. Select the BLAST program from the links along the bar at the top of the page. When the BLAST page opens, you should see a box to enter your sequence data. You can just copy and paste the sequence results (in FASTA format) into that box. Next, change the Filter to “none.” Finally, hit “Run BLAST” to scan your sequence against the yeast genome. FASTA format and the BLAST program will be explained in class. When you submit your sequence for BLAST, the program will search the DNA sequence of the entire yeast genome to find the sequence you obtained from cloning. When it gets a match, a page will open with the results of your match and will contain links that you can use to find out more about the sequence that your clone matches (for example, gene name, location in the genome, possible function etc.) 3. On the results page, you will see some matching scores as well as a probability number (P-value). You can think of the P-value as the probability that the hit you found was found by random chance and is not significant (so you want the P-value to be very low!). Start filling out the Sequence Information Form as you complete the following steps. 44 4. It is hard to see much information about the sequence on this page, so select “ORF Map” to see a map of the open reading frames in this area of the genome. 5. The match to your sequence is located between the red dotted lines. If there is a red or blue box in that location, you have cloned part of a gene. The key to the colors used in this diagram is directly below the diagram. The gene names are short combinations of letters and numbers near the red or blue box. If the dotted lines do not intersect with a gene, skip to step 7. We will talk more about non-coding DNA in the next activity. 6. Click on the colored box representing the gene, and a new page will come up with information about that gene. Use this information to answer the questions about the gene product on the Sequence Information Form. 7. If you have not cloned part of a gene, open a sequence file from List A (a folder on the course webpage) and repeat steps 1 – 6 on your own, filling out the Sequence Information Form as you go. If your clone WAS part of a gene, open a sequence file from the “List B” folder. Repeat steps 1 – 6 with this sequence on your own and fill out the Sequence Information Form for this sequence. Procedure Part B: Why didn’t everyone clone a gene? Which organism has more non-gene DNA, yeast or humans? 1. We will use “Map Viewer” from the NCBI website to explore the yeast and human genomes. Go to http://www.ncbi.nlm.nih.gov/mapview/ 2. Once at this site, select “saccharomyces cerevisiae” from the phylogenetic tree. You should now be viewing an interactive map of the yeast chromosomes. 3. Select one of the sequences you studied from the Sequence Information Form. Click on the chromosome number where this sequence was located. From this screen record the number of base pairs shown as well as the total number of genes. 4. As a class, we will perform a calculation to determine the percent of base pairs in the chromosome that are part of a gene. Next, we will repeat steps 2 – 4, looking at a human chromosome. Are humans or yeast more efficient with their use of DNA? 45 Human Genome vs. Yeast Genome Fact Table: Number of chromosomes Total size (bp) Total genes (estimated) Repetitive elements Pseudogenes % of genome that is coding Human 22 + X and Y Yeast 16 3,000,000,000 ~25,000 12,067,000 ~6,200 43% ~8,000 ~1% 10% 48 (non-gene ORF’s) ~50% The average length of a gene is ~1000 bp (although some yeast genes are much shorter and some human genes are much longer). Based on this length, what is the maximum number of genes the human and yeast genomes could have? Which genome is closer to having its maximum number of genes? 46 Sequence Information Form Sequence Cloned in Class Sequence information: Name of file P-value of top hit from the BLAST search Chromosome and coordinates (bp) Gene name Crick or Watson strand? Gene product: What is the function of the gene product? Where is the gene product found in the cell? Can yeast survive without this gene? 47 Chymotrypsin Enzyme Activity Assay By April Bednarski Introduction: Chymotrypsin is a digestive enzyme that can break peptide bonds in proteins. It is part of a class of enzymes called proteases. Other members of this class are involved in blood clotting, development, and inflammation. Proteases are present in plants as well as animals. Papain is another protease and can be isolated from papayas. In our bodies, the substrate for chymotrypsin is a specific type of peptide found in proteins that contains a bulky hydrophobic group. In this experiment, the activity of chymotrypsin will be measured using the substrate, GPNA (glutaryl-L-phenylaniline 4-nitroaniline). When chymotrypsin cleaves the peptide bond in GPNA, p-nitroaniline (PNA) is released. PNA is a yellow substance, so we can use UV spectroscopy to detect product formation. At the end of each of your assays, you should notice that the solution becomes more yellow. Enzymatic reaction: O OH NO2 O O HN H N H NO2 Chymotrypsin + GP H2 N GPNA (Glutaryl-L-phenylalanine 4-nitroaniline p-nitroaniline (absorbs strongly at 410 nm General Procedure: Each enzymatic assay will be set up with a total volume of 5 mL in a cuvette (or test tube) and activity will be measured by monitoring the absorbance of each assay at 410 nm in a Spectronic 20 spectrophotometer. Each assay will contain a final concentration of 10 µM chymotrypsin in 100 mM sodium phosphate buffer (NaH2PO4). For each assay, first combine the substrate and buffer in the cuvette and mix well. Set the cuvette in the Spec 20 and set the A410nm to zero. Initiate the reaction and start timing as you add the enzyme (0.5 mL of chymotrypsin). Record the A410nm at 1, 2, 3, 4, and 5 minutes. For each assay, the substrate concentration will increase. We will see how the amount of substrate added affects enzyme activity. Do you think you will see an infinite increase in enzyme activity as you add more and more substrate? Why might the enzyme activity reach a constant maximum level? 48 Procedure: Mix the NaH2PO4, DMF, and substrate for each assay in a cuvette according to the table below. DO NOT ADD ENZYME YET! Lab each cuvette with the assay number (A1 – A5). Use buffer which has been pre-warmed to 30˚C. (each assay will total 5 mL): Assay # NaH2PO4 (pH (substrate 7) concentration) A5 (1.5 mM) 4 mL A4 (1.2 mM) 4 mL A3 (0.9 mM) 4 mL A2 (0.6 mM) 4 mL A1 (0.3 mM) 4 mL DMF Substrate in DMF Enzyme 0.0 mL 0.1 mL 0.2 mL 0.3 mL 0.4 mL 0.5 mL 0.4 mL 0.3 mL 0.2 mL 0.1 mL 0.5 mL 0.5 mL 0.5 mL 0.5 mL 0.5 mL Set up the enzyme, assay cuvettes, and data table next to the spectrophotomer. Make sure the spectrophotometer is warmed up and has been zeroed. For each assay, follow the procedure below: (Note: after adding the enzyme, work as quickly as you can since the reaction has started!) 1. Place cuvette in the Spec 20, set to zero. 2. Add the enzyme, gently mix by shaking the cuvette lightly, place back in the Spec 20, and start timing. 3. Record A410nm at 0 min, 1 min, 2 min, 3 min, 4 min, and 5 min in the data table below. 4. After 5 min, you can stop timing. Repeat this procedure with the next assay. Data Table: Assay # A1 A2 A3 A4 A5 Substrate conc. 0.3 mM 0.6 mM 0.9 mM 1.2 mM 1.5 mM A410nm 0 min A410nm 1 min A410nm 2 min A410nm 3 min A410nm 4 min A410nm 5 min From this table, plot A410nm (y-axis) versus time (x-axis) for each assay. You will have a total of five lines on your plot (each assay is a different line). Label each line. Use this plot to answer the questions on your lab write-up. Hand in this plot with your write-up. 49 Data Plot 50 51 Biology 280 Lab Write-up: Chymotrypsin Assays Name:________________________ Date of Investigation: 1. Which assay reached maximum absorbance the quickest and which assay took the longest? 2. What do you predict the line on the plot would look like for an assay with 3 mM substrate? Describe below, or lightly sketch on your plot. Explain your answer. 3. Why would differences in substrate concentration lead to a difference in enzyme activity? Hint: think about what needs to happen for the enzyme to make product out of the substrate. 4. The slope of each line on your plot starting at (0,0) and extending along the assay line for as long as it is straight, is called initial velocity and is represented by V0. What trend can you see from comparing the slope of this line for each assay? 52 Yeast Genetics Adapted from Modern Genetics for All Students, Washington University in St. Louis, Science Outreach, 2000. T137, S112 Background: Today we will do a real genetic experiment, with a real, live model organism. The model organism we will use is baker’s yeast. Yes, this is the same organism that the bakers of Bunny Bread use to make that spongy white stuff for your peanut butter and jelly sandwiches! Baker’s yeast is a unicellular fungus with a life cycle that at first seems very different from that of the more familiar animals and plants. As different as it looks at first glance however, you will notice that the yeast life cycle does resemble the life cycle of animals and plants in several very important regards. It involves haploid and diploid cells and thus meiosis and recombination. Indeed, yeast follows the same basic rules of inheritance that we do, even though its haploid cells bear no resemblance to the sperm and egg cells of animals. Because yeast can complete its life cycle in less than a day (under optimum conditions), it can be used to perform many different genetic experiments in a very short time. As a result, it is one of the most popular model organisms for geneticists who are interested in studying the basic principles of genetics that apply to all eukaryotic organisms. In this exercise, we will use red and white strains of yeast to study a simple genotype-phenotype relationship that until now we had only encountered as a theoretical concept. In addition to strains that differ in color, we will also need strains that differ in mating type so that they will be able to fuse to make diploids. Yeast geneticists call the two mating types of yeast a (small a) and _(alpha). But to be sure that we do not confuse the two strains in our experiment, we will use the terms A (capital A) and _(alpha). In a sense, the two mating types, A and _, are to yeast as males and females are to animals. Your job will be to formulate a hypothesis about the genetic basis for the color difference that you will observe between the haploid strains you will cross, use that hypothesis to make a prediction about what color the diploids will be, and then mate four strains of yeast to test your hypothesis. THE LIFE CYCLE OF YEAST Haploid and diploid yeast cells look similar, and both can divide mitotically to form large colonies. Haploid cells come in two mating types, which we will call A and _. As long as these two mating types are kept apart, they continue to grow and divide in the haploid state. But if they make contact, they fuse to form an A/_ diploid. Under most conditions, the A/_ diploids will grow and divide continuously, forming colonies. Under certain nutritional conditions, however, A/_ diploids undergo meiosis to produce new A and _ haploids. If the A and _ cells are not separated at once, they fuse again to make new A/_ diploids. At first this might seem like a 53 waste of effort, but it is not. Because various alleles will have been sorted out and recombined at random in the process, the new generation of A/_ diploids will include individuals that are genetically different from those in the earlier generation, and one or more of these new variants might be better adapted to the nutritional conditions that now exist. Procedure: Mating Type A R Mating Type A W R R R W W W 1 2 3 4 Making the Genetic Crosses Do not open your petri dish until instructed to do so. 1. With the dish still closed, draw four circles and number them 1 to 4 (fig. 3). 2. Get a sterile toothpick. Lift the agar-containing part of the petri dish, and — while keeping the dish upside down — gently rub the Red Mating type A colony with the end of the tooth pick. Now rub the toothpick on the agar lightly in the center of circle 1. You want to deposit only a small number of cells on the agar — barely enough to see. If you have more than this, try to scrape off the excess. Discard the toothpick in the waste jar. 3. With a new sterile toothpick gently rub the Red Mating type _ colony and then rub the toothpick in the same region of circle 1 where you rubbed with the Red Mating type A toothpick. Mix the two kinds of yeast cells together with the toothpick, but avoid stabbing the agar. Discard the toothpick in the waste jar. 4. Repeat steps 3 and 4 for the three empty circles, so as to cross the White Mating type A and Red Mating type _ strains in circle 2, the Red Mating type A and White Mating type _ strains in circle 3, and the two white strains in circle 4. 5. Place your petri dish in the incubator or where your teacher directs you. 54 Biology 280 Lab Write-up: Yeast Genetics Name:________________________ Date of Investigation: Part 1: 1. Describe the appearances of the four colonies of haploid yeast cells at the beginning of the experiment. 2. Formulate a hypothesis about the genetic difference that causes the difference in appearance of the red and white yeast strains. 3. Which color trait do you think will be dominant, or do you think that they will be codominant? Why? 4. Based on the above hypotheses, what do you predict the color phenotypes of the diploids will be in the following four crosses that you have set up? Red Mating type A x Red Mating type Red Mating type A x White Mating type White Mating type A x Red Mating type White Mating type A x White Mating type 55 Part 2 Observing Your F1 (A/_ Diploids) 1. Get your petri dish and observe your results. Even though there are no sperm and eggs involved, a Punnett square diagram can be used to record and analyze the results of yeast crosses such as the ones you performed. Use your observed results to fill in the blanks on the Punnett square below. Mating Type α Mating Type A R W R W Genotype______________ Genotype______________ Phenotype_____________ Phenotype ____________ Genotype______________ Genotype______________ Phenotype ____________ Phenotype ____________ 2. Are any of your results unclear? If so, indicate which ones, describe what you see, and provide a good explanation for these results. 3. What ratio of phenotypes did you observe as a result of the four crosses you performed? 4. What does this indicate about which allele is dominant and which is recessive? 5. Is this what you predicted? 56 6. In the table below, list what you predicted and what you observed for each of the four crosses. Cross Predicted Phenotype of Diploid Observed Phenotype of Diploid Red Mating type A by Red Mating type α Red Mating type A by White Mating type α White Mating type A by Red Mating type α White Mating type A by White Mating type α 6. If your predicted and observed phenotypes do not agree, how can you account for that, and can you propose a good hypothesis to account for the results you actually observed? 7. If you have come up with a new hypothesis, can you think of a way to test it? 57 58 Reebops: Inheritance of Traits Adapted from Modern Genetics for All Students, Washington University in St. Louis, Science Outreach, 2000. T119, S90 Introduction: Reebops are imaginary creatures that were invented by Patti Soderberg at the University of Wisconsin. We think that this is a good “model” system for studying heredity. As you create baby Reebops from marshmallows and other objects, they can help you see how the visible traits of a baby are related to the combination of genes that it inherited from its mom and dad (and why all the kids in the family don’t always look alike) Have fun Reebopping! Procedure: 1. You and your lab partner will receive an envelope that contains 14 red chromosomes that belong to Mom Reebop and 14 green chromosomes that belong to Dad Reebop. Decide which of you will act as Mom and which will act as Dad. Place your chromosomes on the table in front of you, letter side down. Your lab partner should do the same with the other set of chromosomes. 2. Arrange your 14 chromosomes into pairs by length and width. Select one chromosome from each of your seven pairs and place all seven in a special “gamete” (egg or sperm) pile. Your lab partner should do the same. The leftover chromosomes should now be returned to the envelope. What type of cell division has just occurred? 3. Combine the seven red and seven green chromosomes from the two gamete piles to form a “baby” pile. Now each Reebop baby will have 14 chromosomes just like Mom and Dad did. But half will be red and half green, indicating that half came from Mom and half from Dad. 4. Line up the chromosomes contributed to the baby by Mom and Dad in pairs of similar size, letter side up. You will see that each chromosome in a pair carries a gene of similar type (same letter of the alphabet). Some chromosome pairs might carry the same allele (either both capital letters or both lower case), indicating that the baby is homozygous (has two alleles of the same type) for the kind of gene carried on that chromosome. Other chromosome pairs might carry one dominant (capital letter) allele and one recessive (lower-case) allele, indicating that the baby is heterozygous (has two alleles of different type) for the kind of gene carried on that chromosome. 59 The combination of genes carried on these seven chromosome pairs defines your Reebop baby’s genotype (genetic constitution). Record this genotype on the lines below. ___ ___ ___ ___ ___ ___ ___ ___ ___ ___ ___ ___ ___ ___ 5. Refer to the Reebop Genotype-Phenotype Conversion Table on page S94 to determine your baby’s phenotype. Record the phenotype on the lines below, keeping the phenotypic traits in the same order as the genes you listed in step 4. ____________ ____________ ____________ ____________ ____________ ____________ ____________ 6. You are now ready to construct your Baby Reebop. Collect the body parts that you will need and return to your bench to build your baby. Genotype Phenotype DD Dd dd Three body segments Three body segments Two body segments AA Aa aa Two antennae One antenna No antennae NN Nn nn Red nose Orange nose Yellow nose EE Ee ee Two eyes Two eyes One eye MM Mm mm Three green humps Two green humps One green hump TT Tt Curly tail Curly tail 60 tt Straight tail LL Ll ll Blue legs Blue legs Red legs Note: Toothpicks function as the bones and ligaments that hold the Reebops together. Reebop Problem Solving: 1. Define the following terms and give an example of each from this activity. (You may refer to the Genetic Glossary.) allele: genotype: phenotype: homozygous: heterozygous: 2. If a Reebop female with a red nose and a Reebop male with a yellow nose marry and have children, what genotype and phenotype for nose color will their children have? (You may refer back to the Reebop Genotype-Phenotype Conversion Table.) genotype ______________________________ phenotype______________________________ 3. If a Reebop female with one antenna and a Reebop male with no antennae marry and have children, what genotypes and phenotypes might their children have with respect to number of antennae? genotype ______________________________ phenotype______________________________ 4. If a Reebop female with one antenna and a Reebop male with one antenna marry and have children, what is the probability that they will have a baby with no antennae? 5. If a Reebop female with two green humps and a Reebop male with two green humps marry and have children, what is the probability that their first baby will have two green humps? 61 6. If a Reebop female with three green humps and a Reebop male with three green humps marry and have children, what is the probability that they will have a baby with two green humps? 7. If a Reebop baby has a straight tail, but both of his parents have curly tails, what are genotypes of the two parents? Class Reebop Data Collection: Fill in the NUMBER of Reebops found in your class with the following heritable traits: Antennae Nose color Humps One _____ Red _____ One Two _____ Orange _____ _____ Two _____ None _____ Yellow_____ Three _____ Eyes Segments Tail Leg color One _____ Two _____ Curly _____ Blue _____ Two _____ Three _____ Straight_____ Red _____ 62 Biology 280 Lab Write-up: Analysis of Reebop Findings Name:________________________ Date of Investigation: 1. Describe the phenotypes of Mom and Dad Reebop. 2. Using the information in the Reebop Genotype-Phenotype Conversion Table, list all the possible genotypes that would produce the phenotypes exhibited by Mom and Dad. 3. How many of the Reebop babies in your class have the same phenotypes as Mom or Dad? 4. Do any two babies in your class have exactly the same phenotypes? 5. Why do some Reebop babies have traits that are not seen in either Mom or Dad? 6. Which Reebop traits are dominant? 7. Which Reebop traits exhibit codominance? 8. Use the information you have about the phenotypes of all of the Reebop babies in your class to figure out what the genotypes of Mom and Dad Reebop are. 9. If you know the genotype of the parents, is it possible to predict all of the possible genotypes of babies that they might produce? 63 10. If you know the genotype of the parents, is it possible to predict the genotype of any particular baby, such as their first one? 11. The Reebops appear to have only one gene on each chromosome. Do you think this is true of real, living organisms? 64 Can Sunscreen Stop Mutations? Adapted from Modern Genetics for All Students, Washington University in St. Louis, Science Outreach, 2000. T184, SP164 and WU Bio 2960 Spring 2004 Lab Manual. BRING IN YOUR OWN SUNSCREEN FOR THIS LAB, IF YOU LIKE. UV as a Mutagen You will use ultraviolet light (UV) to induce mutations in bacteria. UV is present in sunlight as radiation that (as the name suggests) is just beyond the violet end of the visible spectrum. UVis absorbed by, and damages, DNA, and therefore it is a powerful mutagen (mutation-causing agent) and carcinogen (cancer-causing agent). Fortunately, however, the ozone layer of earth's upper atmosphere filters out most of the UV that streams toward earth from the sun. If the ozone layer were to disappear, the surface of the earth would become uninhabitable. Indeed, some scientists believe that the drastic decline in frog and salamander populations that has been observed around the world in the past ten years is probably a result of the thinning of the ozone layer. It is also believed that this thinning has been caused by chlorofluorocarbons (CFCs), such as Freon, which were once used as refrigerants and propellants in hair-spray cans. Even at present ozone levels, the ability of organisms to live at the surface of the earth is due to their remarkable ability to repair DNA damage that has been caused by UV. The importance of DNA repair is highlighted by a genetic disease called xeroderma pigmentosum (XP) in which a critical enzyme required for DNA repair is nonfunctional. People with XPcharacteristically have many skin abnormalities, and most of them die of skin cancer before age 20. As important as they are, however, our DNA repair systems are inadequate to fully protect us against the carcinogenic effects of the UV rays in sunlight or tanning salons. It is now recognized that a single sunburn as a child or teenager can increase the risk of skin cancer many years later. Because UV is absorbed by the top few layers of skin cells in humans, however, UV radiation cannot penetrate to our ovaries or testes, where our germ cells are stored. Thus, UV damage in humans is largely restricted to cells in the skin and eyes. Although it is the principal cause of human skin cancer, UV does not cause heritable mutations that can be passed on to our offspring. In contrast, small one-cell organisms, such as bacteria, do not have the benefit of a protective layer of skin, and any non-fatal UV-induced mutations that they experience are invariably passed on to their progeny. We will take advantage of that fact in this exercise to study the mutagenic effects of UV on bacteria. How do the mutations occur? Nucleic acids absorb UV light strongly, and UV irradiation of DNA results in a wide variety of chemical alterations. One of the most important of these alterations is the formation of covalent bonds between adjacent pyrimidine residues on a strand of DNA, resulting in the formation of pyrimidine dimers. 65 As shown in Figure 1 below, the pyrimidines in such dimers are connected by cyclobutane rings. Pyrimidine dimers present a severe problem to the cell, because they block the progression of DNA replication and transcription. As you might expect, mechanisms have evolved to remove pyrimidine dimers from DNA. These repair mechanisms include general excision repair and a second mechanism, called photoreactivation. In the latter, pyrimidine dimers are split apart by an enzyme that utilizes energy from visible light. This photoreactivating enzyme (PRE; DNA photolyase) is one of very few proteins known in biology able to absorb and utilize the energy in visible light. Figure 1: A thymine dimer As in the tobacco experiment, you will be working with the Gram-negative bacterium Serratia marcescens. You will monitor the same two effects of the mutagenesis, death and loss of the red pigment, prodigiosin. Procedure: 1. Label 8 NA-plates as follows: 0 Dark, 15 Dark, 30 Dark, 0 Light, 15 Light, 30 Light, 30 Sunscreen. The numbers refer to the number of seconds of UV short-wave light exposure. For the 30 sunscreen plate only, draw a line through the center of the back of the plate; label one side +SS and the other SS (to designate with/without sunscreen). Write your initials on each plate. 2. Spread 100 µl of the suspension of Serratia bacteria onto the surface of each of your labeled plates. 3. Arrange your plates by exposure time and take them to the UV lamp for irradiation. For the "30 Sunscreen" plate, place a piece of sunscreen-coated Saran wrap over the plate so that the sunscreen coating covers the +SS half of the plate. Irradiate the sunscreen plate along with the "30 dark" and “30 light" plates. Note: remove the plastic lids from the plates before irradiation. 4. After UV exposure, the "dark" plates will be placed directly in a dark, room temperature incubator; the "light" plates will be incubated at room temperature in the presence of white light for ≥ 6 hours. After this initial light exposure all plates will be incubated in the dark at room temperature. 66 5. After the plates incubate at room temperature for about 48 hours, they will be placed in a refrigerator to arrest colony growth. Next week you will examine the plates and complete the analysis of this experiment. 67 68 Biology 280 Lab Write-up: Effectiveness of Sunscreen Name:________________________ Date of Investigation: 1. What was the purpose of exposing the bacterial cells to UV light? 2. What do the numbers 10, 12 and 15 refer to in our mutagenesis experiment? 3. A) What is the most common kind of mutation caused by UV light exposure? B) What repair mechanisms are available to repair this kind of mutation? 4. What was the purpose of putting some plates in the dark and some plates in the light? 69 5. Describe the control that was used in the UV mutagenesis experiment. Complete Questions 6 –8 after you obtain your results. 6. Do you think the Saran wrap had an effect on the experiment? How could you find out? 7. Describe your results. Include number of colonies and frequency of white colonies. Control plate: 0 Dark 0 Light 10 Dark 10 Light 15 Dark 15 Light 15 + Sunscreen 15 - Sunscreen 8. Discuss your results. Describe whether the results were expected. Offer possible explanations for any unexpected results. Finally, assess the sunscreen for the ability to protect against UV- caused mutations. 70 Learning to use Bioinformatics tools. This lab is designed around bioinformatics tools that are widely used in biomedical research today. Over the past few years, the number of freely available software programs and web-based research tools has increased dramatically. Knowing how to use these tools is very important to research and health professionals in order to access and interpret the constantly growing amounts of genomic and proteomic information. These bioinformatics tools as well as genomic information are freely available on the web to anyone who knows how to access them. Bio280 introduces these computer-based research tools in a context that reinforces concepts presented during the Bio280 lecture. You will be given your own gene to study during the 3 lab sessions. This gene will be the focus of all your guide sheets and final report. There will be one other person working on the same gene in each lab section. You are encouraged to help each other, but work individually. All the projects are similar in their structure and format, even though they are designed around different genes. The basic process for using web-based research tools is the same for all projects. 71 List of Projects: Lung Cancer K-Ras Cytochrome P450 1A1 Apoptosis Caspase 1 Superoxide Dismutase 1 Metabolic Disease Phenylalanine Hydroxylase Hypoxanthine-guanine phosphoribosyltransferase Cholesterol Biosynthesis HMG-CoA Reductase Low Density Lipoprotein Receptor Mitochondrial Diseases ATP Synthase 6 Cytochrome c Oxidase 1 Report: You will be compiling printouts from each lab period into a report showing what you have found out about your gene using the web-based resources. You will also write a summary of your findings. This report will be due at the last lab session Report format: Final report should contain: 1. Title page with: Your name and date Protein name Lab section number 2. Summary (1 page, single-spaced, see “Sample Summary” on the 3055 Website) 3. Multiple sequence alignment (annotated) 4. Swiss-Pdb Viewer figures with labels 5. Don’t include anything extra 72 Laboratory 1 - Introduction COX-2 (PTGS2) Tutorial On the first day of lab, we will be working through a tutorial on the web-based bioinformatics programs that you will be using for your research projects. You will also be receiving your project packet, which will contain the introduction, specific directions, and reading assignments for your project. There are ten possible projects, and you will be working on a project that is the same as one of the people at the computer next to you. Although you must each complete your own research project, you are encouraged to collaborate, discuss your projects, and help each other during the lab. The tutorial is based on the enzyme cyclooxygenase-2 (COX-2), which also has the name prostaglandin synthase-2 (PTGS2). You can read more about this protein on the next page. In this tutorial, the bioinformatics tools from the NCBI (National Center for Biotechnology Information) website will be introduced. NCBI is a division of the National Institute of Health (NIH). These tools include Gene, GenBank, RefSeq, and PubMed. Gene is a database of genes in which each entry contains a brief summary, the common gene symbol, information about the gene function, and links to websites, articles, and sequence information for that gene. GenBank is a historical database of gene sequences, which means it contains every sequence that was published, even if the same sequence was published more than once. Therefore, GenBank is considered a redundant database. RefSeq is a database of sequences that is edited by NCBI and is NON-redundant, meaning that it contains what NCBI determines is the strongest sequence data for each gene. Finally, we will be learning to use ClustalW, which is a multiple sequence alignment program. It allows you to enter a series of gene or protein sequences that you believe are similar and may be evolutionarily related. These sequences are usually obtained by performing a BLAST search. ClustalW then aligns the sequences, so that the lowest number of gaps is introduced and the highest numbers of similar residues are aligned with each other. ClustalW uses a scoring matrix similar to BLOSUM-62, which is explained in your text and will be presented in lecture. 73 Introduction to COX-2 (PTGS2) The enzyme we will be focusing on has two names. It is called prostaglandin H2 synthase-2 and cyclooxygenase-2 (COX-2). COX-2 has been thoroughly studied because of its role in prostaglandin synthesis. Prostaglandins have a wide range of roles in our body from aiding in digestion to propagating pain and inflammation. Aspirin is a general inhibitor of prostaglandin synthesis and therefore, helps reduce pain. However, aspirin also inhibits the synthesis of prostaglandins that aid in digestion. Therefore, aspirin is a poor choice for pain and inflammation management for those with ulcers or other digestion problems. Recent advances in targeting specific prostaglandin-synthesizing enzymes have lead to the development of Celebrex, which is marketed as an arthritis therapy. Celebrex is a potent and specific inhibitor of COX-2. Celebrex is considered specific because it doesn’t inhibit COX-1, which is involved in synthesizing prostaglandins that aid in digestion. This is a remarkable accomplishment given the great similarity between COX-1 and COX-2. This achievement has paved the way for developing new therapies that bind more specifically to their target and therefore have fewer side effects. Understanding the enzyme structures of COX-1 and COX-2 helped researchers develop a drug that would only bind and inhibit COX-2. Many of the types of information and tools used by researchers for these types of studies are freely available on the web. In this tutorial, and throughout this lab course, you will be introduced to the databases and freely available software programs that are commonly used by professionals in research and medicine to study genes, proteins, protein structure and function, and genetic disease. 74 GenBank Entry Date of latest modification The unique name for the gene locus. It was originally designed to help group entries with similar sequences. The first letter usually is related to the organism. Number of base pairs Molecule type Organismal division of GenBank A unique number for the sequence. A new GI number is given if the sequence changes in any way. Publication that discusses the data in the entry. Information about the gene sequence. CDS = nucleotide coding sequence Base count (A,T,G, and C) and actual sequence follow. 75 SwissProt Entry SwissProt Accession Number Click here before printing 76 SwissProt Entry Continued: Guide sheet questions focus on “Comments” and “Features” Sections: Amino acid residue numbers Active site residues, disulfide linkages, residues bound to carbohydrates, etc. Mutations found Sources disagree Best sequence known for this protein 77 The Sequence Manipulation Suite: Clear contents of the box Submit sequence Choose reading frame for translating Paste FASTA formatted nucleotide sequence from Word document here. Obtain FASTA formatted translation of nucleotide sequence 78 Taken from http://www.ebi.ac.uk/clustalw/index.html ClustalW Submission Form: We want the human sequence on top, so output should be in order of input, not order of aligned. Change this to “input.” Choose “full” Align with numbers, so we can find particular residues in the alignment. No color for easier printing. Enter FASTA formatted sequences from Word document using “copy” and “paste.” Choose “Run” to submit. 79 Background on PTGS1 (Cox-1) and PTGS2 (Cox-2): Follow these directions to access the entries for PTGS1 and PTGS2 in the “Gene” database at the NCBI Website: A. First, go to the NCBI homepage using the link on the lab webpage, or by going to: http://www.ncbi.nlm.nih.gov B. Select “Gene” from the database pulldown menu. Type “PTGS” in the search box, then click “Go.” C. Scan the results for the “Homo sapiens” entries. There should be one called “PTGS1” and one called “PTGS2.” D. Select each entry by clicking on its name, then read the paragraph under the “Summary” section for each entry. Read the “Summary” section for both of these genes, then answer the questions below. 1. PTGS1 and PTGS2 are isozymes. Isozymes catalyze the same reaction, but are separate genes. What types of reactions do PTGS enzymes catalyze? Also, what pathway are these enzymes a part of? 2. How is the expression of PTGS1 and PTGS2 different? 80 Part 1 – Getting sequence information and viewing database entries NCBI – Gene 1. Go back to the “Gene” entry for Homo sapiens PTGS2. 2. What is the gene name? 3. What is the GeneID number? 4. Where in the human genome is this gene located? 5. What is the RefSeq accession number for the mRNA sequence of Homo sapiens prostaglandin-endoperoxide synthase 2?__________________. 6. What is the RefSeq accession number for the Homo sapiens PTGS2 protein sequence (called “product”)?_________________. 7. Open the RefSeq protein entry, then choose “FASTA” from the pulldown menu. Copy the sequence (including the title line designated by the “>” symbol) and paste it into a Word document. 8. In Word, select the “Replace” tool under the EDIT menu. In the “find” box, type “^p” to find all paragraph marks. Don’t type anything into the “replace” box. Then click “Replace All.” This will eliminate all the paragraph marks in the document. If you still see white spaces in the sequence, use the same procedure, but type “^w” in the “find” box to represent white spaces. 9. You now should add back a paragraph mark (hit return) after the title line (that starts with “>”) and before the sequence starts. Save the file as PTGS2prot.doc on your desktop. Note: Please review the entry for “FASTA” in the Glossary (at the end of this manual). Understanding this definition will be very important for working with the bioinformatics programs. To learn more, go to: http://www.ncbi.nlm.nih.gov/blast/fasta.shtml 81 Swiss-Prot Entry 10. Go to the Expasy website and search for the Swiss-Prot entry for PTGS2. (Hint: use the gene name to search and be sure to select the HUMAN protein from the search results). 11. Write at least three alternate names for this protein. 12. Where in the cell is this protein located? 13. What types of drugs target this protein? Sequence Manipulation 14. Go to the Sequence Manipulation Suite (http://bioinformatics.org/sms/). 15. Click on “Translate” under “DNA Analysis” heading from the menu. 16. Clear the data entry box by hitting “Clear”. 17. Copy the mRNA sequence from your Word file and Paste it into the data entry box on the Sequence Manipulation website. 18. Select “Reading Frame 3” and “direct” from the pull-down menus, then click “Submit”. 19. When the Output window opens, review it and make sure it doesn’t contain too many stop codons (which are represented with a “*”). 20. Compare this sequence with the sequence in the “PTGS2prot.doc”. What are the first residues that are the same in the sequences? Do the sequences look like they are the same? (Hint: protein sequences should start with a methionine, M.) 82 Part 2 – Multiple Sequence Alignment with ClustalW 21. On the course website under “COX2 Tutorial”, there should be a file called “ClustalWseq”. Click on this link to download the pdf file to your desktop. To open the file in Acrobat Reader, hold the curser over the file on your desktop, hold down the “Control” key on your keyboard as you click and hold with the mouse. A list of programs should appear on your desktop. Highlight “Adobe Acrobat,” then release the mouse. The file should open in Adobe Acrobat. If you have the file open in “Preview,” you will not be able to select multiple pages of text at the same time, so close the file and open it in Acrobat Reader as described above. Using the “Text select” tool, select all the text and copy or go under the “Edit” menu and “Select All.” This file contains six FASTA formatted sequences of PTGS2 from different organisms. The top sequence is the human PTGS2 protein sequence you have been working with. 22. Go to the ClustalW website and enter (by using “copy” and “paste”) all of the FASTA formatted sequences into the data entry box. Select “input” for the Output order so the human sequences will stay at the top in the alignment. Press “Run.” 23. Copy the alignment and paste it into a Word document. To make this file readable, do the following things: a. Go to “Page Set-up” under “File” and change the page orientation to landscape. b. Select all text and change to “Courier” font, size 10. Courier is the best font for alignments because all the letters are the same width. c. Save this file to the desktop as “ClustalW.doc” 24. Review the alignment. What symbols are used for positions in the alignment that contain identical, highly homologous, homologous, and non-homologous residues? Are the residue numbers mentioned in steps 14 and 15 conserved? Why would you expect them to be conserved? 83 Laboratory 2 - Project Research Guide Sheet 1 Working with Primary Protein Structure Information The research projects begin by using some of the same tools you used last week in the COX-2 tutorial. Guide Sheet 1 will provide directions for the web-based research you will be doing today. First you will search for your gene in the Gene database. Gene will contain the RefSeq sequence for your protein, which you will download in FASTA format. FASTA format is defined in your Glossary. Be sure to review this definition before you begin on Guide Sheet 1. You will continue to learn about your protein using the SwissProt database. The SwissProt database is a database maintained by the Swiss Bioinformatics Institute and contains entries for thousands of proteins. You can search for the protein you are studying by using the gene name given in Gene. The SwissProt entry contains some of the same information that you found in Gene, but also contains a lot of information about the protein sequence, structure, and function that is summarized in a fairly easy-to-read format. The ultimate goal for today’s lab is to create a multiple sequence alignment for your protein using ClustalW. You will use this alignment to identify the protein mutation, to observe regions of high sequence conservation, and to map secondary structure predictions. The protein mutation is important to identify since it is the basis for your project and is important for understanding the link between the protein and the disease you are studying. The regions of high sequence conservation are important because they often correspond to regions in the protein that are important to the protein’s function. Next week, you will be studying the crystal structure of your protein and will be able to check the accuracy of the secondary structure predictions that you mapped onto your alignment. The program used for making the secondary structure predictions is called PSIPRED. If your protein is a membrane protein, you will also have MEMSTAT predictions, which predict which regions of your protein, are imbedded in a membrane. 84 Guide Sheet 1 Part 1 – Obtaining the basics: Getting sequence information and viewing the SwissProt and GenBank entries for your protein Directions: Follow this guide sheet and answer the questions in your project packet that accompany this guide as you work through each section. Be sure to refer to hints specific to your project in your project packet. Translating your patient’s cDNA 1. Obtain the mutant cDNA sequence from the course website. Open the file and copy the sequence. 2. Go to the Sequence Manipulation Site (http://bioinformatics.org/sms/). 3. In the menu to the left, Click on “Translate” found under the heading “DNA analysis”. 4. Clear the search box, then paste your patient’s cDNA sequence into the search box. Choose a reading frame from the pull-down menu (the project manuals indicate which reading frame to choose). Click “Submit.” 5. You should be able to find the sequence of your protein by finding the first methionine (M), then continuing until you see the first “*” which is a stop codon. Copy the protein sequence in that region, starting with the first “M” and paste it into a word document. Save the document in your folder on the desktop. Now you have saved the file of the mutant protein sequence. NCBI – Gene 6. Using Gene on the NCBI website, find the entry for the protein you are studying by searching with the protein name. Be sure to select the Homo sapiens protein from the list of results. Answer question 1. 85 7. Open the entry for the RefSeq protein sequence and save the sequence in FASTA format to your desktop. Name this file “wildtypeprot.doc” so that you will know it is the un-mutated protein sequence. Swiss-Prot Entry 8. Go to the ExPASy website and search for the SwissProt entry for your protein using either the protein name the gene name. Be sure to select the human protein from the list of results. Make sure the information in the entry is the same as you saw in the Gene entry. If your protein is an enzyme, the EC number is a good way to double-check. You may want to record the SwissProt entry number in case you want to find this entry again. Part 2: BLAST sequence: Finding homologous proteins Protein-protein BLAST 9. Perform a BLAST search using the RefSeq protein sequence (the unmutated protein sequence). First, select PSI-PHI BLAST. Then paste the FASTA formatted protein sequence in the search box. Select the nrprotein database. Click “BLAST” to begin. On the next page that appears, select “Format.” You may need to wait a few minutes before the results page opens. After obtaining the results, choose 5 sequences from various positions in the results. The goal is to choose a variety of sequences that differ in evolutionary distance from the human protein. Be sure not to choose any sequences that are human, since they are the same as your search sequence. For each of the five sequences, click on the sequence name to view the GenBank entry for the sequence. Then view the sequence in FASTA format. Copy and paste all the FASTA formatted sequences into the same Word file and save it to the desktop. At the beginning of this file, add your mutant protein sequence, also in FASTA format. 10. This Word file will be used to create the multiple sequence alignment, so the formatting is very important. The format for this file should be like the example used in the tutorial for COX-2. Review the entry for FASTA format in the Glossary. You should end up with a Word file that contains the 5 sequences from the BLAST search plus the un-mutated human protein sequence and your mutant sequence for a total of 7 sequences. Each sequence should be in FASTA format and contain a title line (starting with >, then text, then a return). Shorten the text to contain JUST the species information so it will fit in the alignment (next step). For example, you should erase the “gi” line and add in something simpler like “pig,” “cow,” etc. Your mutant sequence should read “>mutant”. At the 86 end of each title, be sure to press return to separate it from the rest of the sequence. Part 3 – Multiple Sequence Alignment 11. Go to the ClustalW website and enter (by using “copy” and “paste”) all your FASTA formatted sequence into the data entry box. The default parameters will work for us, except for the output order. a. Select “input” for the Output order b. Press “run” 12. Save the alignment by copying and pasting the alignment into a Word document. At first the alignment will look broken up. Follow these steps to make it readable again. a. Select all b. Change the font to size 10 and Courier c. Change the page set-up to landscape d. Save the file to your desktop 13. Scroll through the alignment and make sure none of the blocks of sequences are separated by a page break. Save and print the alignment. We will be working with this alignment next week and it will be part of your final report. 14. On your alignment, compare the wild-type protein sequence with your mutant sequence and mark any differences that you observe. Secondary Structure Predictions – using PSIPRED 15. To save time, the PSIPRED and MEMSTAT predictions for your protein are already available for you in the same format as we received them. Obtain these results from the course website under your project name and print them out. The Glossary provides more information on how to submit requests for PSIPRED and MEMSTAT predictions if you want to use these tools outside this course. From this lab, you should save the multiple sequence alignment for your Final Report. The multiple sequence alignment should have the secondary structure predictions and the mutation mapped. NEXT WEEK: Do the structure problem set for next week. It’s important to view the problem set online from the lab website in order to see the structures in color! 87 Laboratory 3 - Project Research Guide Sheet 2 Investigation of the Crystal Structure The goal of this lab is to analyze the crystal structure for your protein in order find the functionally important areas of your protein and predict the effect of the mutation on protein function. You may determine that the mutation makes the protein more active, less active, or that the mutation is likely to have no effect of protein function. To do these investigations, we will be using the protein structure-viewing program called DeepView/Swiss-Pdb Viewer. This program has some similarities to programs you have used in previous course, like Chime, and Protein Explorer. DeepView can easily model mutations and is easy to learn to use in one day. It is not a web-based program, but both the Mac and PC versions can be downloaded for free from the ExPASy website. There are also DeepView tutorials available for free from the ExPASy website. At the beginning of lab, a brief tutorial on how to use DeepView will be given using the COX-2 protein. The first step today is to obtain the data file of the crystal structure for your protein. In some cases, the crystal structure has not been solved for your exact protein, so you will analyze the structure and model the mutation in a homologous protein. In this case, the amino acid numbering will be slightly different, but details are given in your project packets. The data files for crystal structures are called pdb files and are all stored in the Protein Data Bank. You can search the Protein Data Bank website for your protein and download the pdb file to your desktop. In analyzing the crystal structure, the focus is on identifying the noncovalent interactions (Van der Waals, H-bonds, and ionic bonds) of the amino acid side chain before and after it is mutated. How do the interactions change or how are they maintained when the amino acid is mutated? Be sure to review both the distances and nature of these non-covalent interactions before coming to this lab. Crystal structures are the results of experiments, and so it is important to consider experimental error when analyzing a crystal structure. One way experimental error is reported in a crystal structure is by giving the resolution. Resolution of a crystal structure is the accuracy of the prediction of each atom location. For example, if the resolution is 2 angstroms, you can be confident that the atom is located within a 2 angstrom radius of where it is shown in the pdb file. This is important to consider when measuring distances, since each distance measured in a crystal structure is actually ± the resolution of the crystal structure. 88 Tool Bar Transverse Movement (along plane of screen) Rotate Zoom Distance Tool measure the distance between two atoms in Angstroms Identity Tool - the residue type and number will appear in the display for the atom you select Radius Tool - the side chains of residues within a certain distance (you specify) will appear around the atom you select. Re-center Tool - the display will re-center around the atom you select and will rotate around that atom when you use the rotate tool. Mutate Tool - When you click on this tool, then an atom in the display, you can mutate a side chain to another type and it will appear in the display in the lowest energy conformation 89 Panel under : “Prefs” “Ribbons” Make sure there is a check in this box to show a cartoon ribbon in the display In this panel, you can choose a different color for each side of the 3-dimensional ribbon as well as different colors for the helices, sheets, and coils (loops). There should be checkmarks as shown here for each color box. When you click on the “Color” button, a panel appears to choose a color. It is a good idea to choose the same color for each of the top, bottom and sides of all helices. You don’t need to change any of the other settings in this panel for this course. Click “OK” when finished. 90 Control Panel This column needs to be checked to show a substrate or an amino acid side chain. Pdb file name Display side chain of amino acid Show label for residue and number in display Show spacefilling dots for this residue Chain secondary structure: h = helix b = beta sheet Checkmarks make residues show up in ribbon format in the display amino acid three letter code residue number Clicking on the box and selecting a color will color the amino acid. Leaving it as is will be cpk color mode. (see Glossary) 91 Guide Sheet 2 Obtaining 3D Structure Information: Searching for Structure Files (pdb files): Data files that contain the three-dimensional coordinates for protein structures are called “pdb” files. Pdb files are named in 4 characters (numbers and letters). These files are stored in the Protein Data Bank (www.rcsb.org/pdb). You can perform a search to find the structure you are looking for either from the NCBI website using their “Structure” database, or search directly at the Protein Data Bank website. The search results are a little easier to understand from the NCBI website. If you know exactly what you are looking for, you can directly search the Protein Data Bank. Follow the directions in your project manuals to get directions for finding the pdb file and analyzing the structure in Swiss-Pdb Viewer/DeepView. 92 Laboratory 4 - Project Research Guide Sheet 3 Investigating the biological impact of the mutation - using the OMIM and KEGG databases For this lab session, your research will focus on investigating the biological impact of the mutation you are studying. To do this, you will use the OMIM and KEGG websites. OMIM stands for the Online Mendelian Inheritance in Man database. The OMIM database was started at John Hopkins University and is now maintained by NCBI and can be found through a link on the NCBI homepage. The OMIM database contains entries for both diseases with known genetic links and entries for the genes that have been linked to a disease. Each OMIM entry is a summary of the research that has been performed on the disease or gene and contains links to the research articles that it summarizes. You will be able to read about the clinical and biochemical research that has been performed related to the mutation you are studying. Each link in the OMIM entry will open an abstract from the PubMed database. PubMed is a literature database, and is also maintained by NCBI. PubMed is a searchable database of medical and life science journal articles. Most of the abstracts for these articles can be accessed through PubMed, but in order to access the entire article, you need to go to each individual journal website and have a subscription to the journal. The WashU library system has subscriptions to electronic versions of many of these journals that you can access through the E-journal link on the WashU library home page. Most journals have their articles available online as .pdf files for articles published between 1995 to present. However, the older articles must still be accessed through the paper versions stored in libraries. The other database that you will be using is the KEGG database. KEGG stands for the Kyoto Encyclopedia of Genes and Genomes. It is a database of metabolic pathways that is maintained by a research institute in Japan. It contains all the known metabolic and signaling pathways. Each protein in the pathway and each small molecule metabolite (ex. ATP) has its own entry in the database that can be accessed by clicking on the protein or metabolite in the pathway figure. By using this website, you can make predictions about what would happen to downstream events in the pathway if the protein you are studying is either less active or more active. In order to write the summary for your final report, you will need to use the information from this lab’s research as well as what you learned from studying the crystal structure of your protein in order to draw a conclusion about what the biological impact is of the mutation you are studying. This conclusion should be supported by the web-based research you have performed, but will most likely require further clinical and biochemical research in order to be proven or disproved. 93 Guide Sheet 3 Using Web-based resources to investigate the biological impact of the mutation and its possible role in human disease OMIM search: The OMIM (Online Mendelian Inheritance of Man) database contains short, referenced reviews about genetic loci and genetic diseases. It can be a very useful resource for finding out what type of research has been done on a gene or a disease. KEGG search: The Gene entry for your gene that you viewed during the first day of your research (Guide Sheet 1) will contain a link for the KEGG pathway(s) related to your protein. Scroll down to the “Additional Links” and select the “KEGG pathway” link. KEGG stands for Kyoto Encyclopedia of Genes and Genomes. The www.kegg.com site contains a database of metabolic maps. Follow the directions in your project manuals to answer Guide Sheet 3 questions. 94 Glossary. BLAST – Basic Local Alignment Search Tool – A program that compares a sequence (input) to all the sequences in a database (that you choose). This program aligns the most similar segments between sequences. BLAST aligns sequences using a scoring matrix similar to BLOSUM (see entry). This scoring method gives penalties for gaps and gives the highest score for identical residues. Substitutions are scored based on how conservative the changes are. The output shows a list of sequences, with the highest scoring sequence at the top. The scoring output is given as an E-value. The lower the E-value, the higher scoring the sequence is. E-values in the range of 1^-100 to 1^-50 are very similar (or even identical) sequences. Sequences with E-values 1^-10 and higher need to be examined based on other methods to determine homology. An E-value of 1^-10 for a sequence can be interpreted as, “a 1 in 1^10 chance that the sequence was pulled from the database by chance alone (has no homology to the query sequence).” This program can be accessed from the NCBI homepage or: http://www.ncbi.nlm.nih.gov/BLAST Reference: Altschul, S.F., Gish, W., Miller, W., Myers, E.W. & Lipman, D.J. (1990) "Basic local alignment search tool." J. Mol. Biol. 215:403-410. BLOSUM – Block Scoring Matrix - A type of substitution matrix that is used by programs like BLAST to give sequences a score based on similarity to another sequence. The scoring matrix gives a score to conservative substitutions of amino acids. A conservative substitution is a substitution of an amino acid similar in size and chemical properties to the amino acid in the query sequence. Discussed in the Berg text, p.175 – 178. Bioinformatics - Bioinformatics is a field of study that merges math, biology, and computer science. Researchers in this field have developed a wide range of tools to help biomedical researchers work with genomic, biochemical, and medical information. Some types of bioinformatics tools include data base storage and search programs as well as software programs for analyzing genomic and proteomic data. ClustalW – A program for making multiple sequence alignments. http://www.ebi.ac.uk/clustalw/index.html W. R. Pearson (1990) “Rapid and Sensitive Sequence Comparison with FASTP and FASTA” Methods in Enzymology 183:63 - 98. 95 Conserved – when talking about a position in a multiple sequence alignment, “conserved” means the amino acid residues at that position are identical throughout the alignment. Conservative residue change – when talking about a position in a mulitple sequence alignment, a “conservative change” is when there is a change to a homologous amino acid residue. cpk coloring mode - This coloring mode colors based on atom identity: red = oxygen blue = nitrogen orange = phosphorous yellow = sulfur gray = carbon DeepView/Swiss-Pdb Viewer – a program for viewing 3-D structures. It loads “.pdb” files, which contain the 3-D coordinates for molecular structures. SwissPdb Viewer is easy and free to download on any computer (Mac or PC) and can be used no matter what Browser you are using. It is fairly easy to learn to use at the basic level, however, it also has very advanced capabilities that can be useful in research. It is also a nice program to use with PovRay, which allows you to make graphic files from pdb information. This is important when making figures for a presentation, report, or journal article. If you would like to download SwissPdb Viewer for your own computer, the program is available for free and is easy to download from the website, “us.expasy.org/spdbv”. A help manual is also available here if you have further questions that aren’t addressed in this course. http://us.expasy.org/ To run this program with Mac OSX, you must first change the monitor settings. a. Open “System Preferences” on your computer. b. Double click on the “Displays” icon. c. On the right-hand side of the panel, choose “thousands” of colors from the list (changing it to “thousands” from “millions”). d. Then close System Preferences and then open Swiss-Pdb Viewer. Names of some other structure viewing programs: RasMol (www.openrasmol.org) Kinemage (www.kinemage.biochem.duke.edu) Protein Explorer (www.proteinexplorer.org) EC number - Enzyme Committee number - Given by the IUBMB (International Union of Biochemistry and Molecular Biology) classifies enzymes according to the reaction catalyzed. An EC Number is composed of four numbers divided by 96 a dot. For example the alcohol dehydrogenase has the EC Number 1.1.1.1 ExPASy – Expert Protein Analysis System - A server maintained by the Swiss Institute of Bioinformatics. Home of SWISS-PROT, the most extensive and annotated protein database. The Swiss-Pdb Viewer protein-viewing program is also available at this site for free download. http://us.expasy.org/ FASTA – A way of formatting a nuleic acid or protein sequence. It is important because many bioinformatics programs require that the sequence be in FASTA format. The FASTA format has a title line for each sequence that begins with a “>” followed by any needed text to name the sequence. The end of the title line is signified by a paragraph mark (hit the return key). Bioinformatics programs will know that the title line isn’t part of the sequence if you have it formatted correctly. The sequence itself does NOT have any returns, spaces, or formatting of any kind. The sequence is given in one-letter code. An example of a protein in correct FASTA format is shown below: >K-Ras protein Homo sapiens MTEYKLVVVGAGGVGKSALTIQLIQNHFVDEYDPTIEDSYRKQVVIDGETCLLDI LDTAGQEEYSAMRDQYMRTGEGFLCVFAINNTKSFEDIHHYREQIKRVKDSEDVP MVLVGNKCDLPSRTVDTKQAQDLARSYGIPFIETSAKTRQGVDDAFYTLVREIRK HKEKMSKDGKKKKKKSKTKCVIM To learn more, go to: http://www.ncbi.nlm.nih.gov/blast/fasta.shtml GenBank - a database of nucleotide sequences from >130,000 organisms. This is the main database for nucleotide sequences. It is a historical database, meaning it is redundant. When new or updated information is entered into GenBank, it is given a new entry, but the older sequence information is also kept in the database. GenBank belongs to an international collaboration of sequence databases, which also includes EMBL (European Molecular Biology Laboratory) and DDBJ (DNA Data Bank of Japan). In contrast, the RefSeq database (see entry) is non-redundant and contains only the most current sequence information for genetic loci. The GenBank database can be searched at the NCBI homepage: http://www.ncbi.nlm.nih.gov/ Gene – an NCBI database of genetic loci. This database used to be called LocusLink. Entries provide links to RefSeqs, articles in PubMed, and other descriptive information about genetic loci. The database also provides information on official nomenclature, aliases, sequence accession numbers, phenotypes, EC numbers, OMIM numbers, UniGene clusters, map information, 97 and relevant web sites. Access through the NCBI homepage by selecting “Gene” from the Search pulldown menu. Genome – The entire amount of genetic information for an organism. The human genome is the set of 46 chromosomes. Homologous – When referring to amino acids, a homologous amino acid is similar to the reference amino acid in chemical properties and size. For example, glutamate can be considered homologous to aspartate because both residues are roughly similar in size and both residues contain a carboxylic acid moiety which gives them similar chemical properties. KEGG – Kyoto Encyclopedia of Genes and Genomes – This website is used for accessing metabolic pathways. At this website, you can search a process, gene, protein, or metabolite and obtain diagrams of all the metabolic pathways associated with your query. You will see a link to the KEGG entry at the end of the Gene entry for a gene. http://www.genome.ad.jp/kegg/ NCBI – National Center for Biotechnology Information – This center was formed in 1988 as a division of the NLM (National Library of Medicine) at the NIH (National Institute of Health). As part of the NIH, NCBI is funded by the US government. The main goal of the center is to provide resources for biomedical researchers as well as the general public. The center is continually developing new materials and updating databases. The entire human genome is freely available on this website and is updated daily as new and better data becomes available. The NCBI homepage: http://www.ncbi.nlm.nih.gov NCBI also maintains an extensive education site, which offers online tutorials of its databases and programs: http://www.ncbi.nlm.nih.gov/About/outreach/courses.html OMIM - Online Mendelian Inheritance in Man – a continuously updated catalog of human genes and genetic disorders, with links to associated literature references, sequence records, maps, and related databases. Access through the NCBI homepage or: http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=OMIM Protein Data Bank – (PDB) – A database that contains every published 3-D structure of biological macromolecules. It contains mostly proteins, but also DNA and RNA structures. Also see RCSB. http://www.rcsb.org/pdb/ 98 A pdb file is a file containing the three-dimensional coordinates (x,y,z) for each of the atoms in the protein. This type of file is made using the data obtained from either an X-ray crystallography experiment or an NMR experiment. Once you have pdb file of a protein, you can open the file in various structure viewing programs to view the protein structure. Proteome – the entire set of expressed proteins for an organism. This term is commonly used to discuss the set of proteins that are expressed in a certain cell type or tissue under specific conditions. PSIPRED – a server for predicting secondary structure from protein sequences. The predictions are made based on a database of known secondary structures for protein sequences. These predictions are estimated to be correct ~80% of the time. This server can also be used to predict transmembrane segments. http://bioinf.cs.ucl.ac.uk/psipred/ McGuffin LJ, Bryson K, Jones DT. (2000) The PSIPRED protein structure prediction server. Bioinformatics. 16, 404-405. Jones DT. (1999) Protein secondary structure prediction based on positionspecific scoring matrices. J. Mol. Biol. 292: 195-202. PubMed – a retrieval system containing citations, abstracts, and indexing terms for journal articles in the biomedical sciences. PubMed contains the complete contents of the MEDLINE and PREMEDLINE databases. It also contains some articles and journals considered out of scope for MEDLINE, based on either content or on a period of time when the journal was not indexed, and therefore is a superset of MEDLINE. http://www.ncbi.nlm.nih.gov/ RCSB – Research Collaborative for Structural Bioinformatics – A non-profit consortium that works to provide free public resources and publication to assist others and further the fields of bioinformatics and biology dedicated to study of 3D biological macromolecules. Members include Rutgers, San Diego Supercomputer Center, University of Wisconsin, and CARB-NIST (at NIH). RefSeq - NCBI database of Reference Sequences. Curated, non-redundant set including genomic DNA contigs, mRNAs, proteins, and entire chromosomes. Accession numbers have the format of two letters, an underscore bar, and six digits. Example: NT_123456. Code: NT, NC, NG = genomic; NM = mRNA; NP = protein (See NCBI site map for more of the two letter codes). 99 Sequence Manipulation Suite – a website that contains a collection of webbased programs for analyzing and formatting DNA and protein sequences. http://bioinformatics.org/sms/ SNP = Single Nucleotide Polymorphism. synonymous change– The nucleotide change results in NO change in amino acid. non-synonymous change – The nucleotide change DOES result in a change in amino acid. heterozygosity – A measure of the genetic variation in a population with respect to one locus. Stated as the frequency of heterozygotes for that locus. 100 Website List for Bio 280 http://www.nytimes.com The Tuesday edition of the New York Times has a Science section. You can also search for past science and health-related articles at this site. You have to register, but registration is free. http://www.madsci.org/libs/libs.html A resource for getting started using science resources on the WWW. http://science.education.nih.gov/ NIH educational resources on many genetics and health-related topics. http://www.genome.gov Homepage for the National Human Genome Research Institute (NHGRI) http://www.doegenomes.org/ Genome program of the U. S. Department of Energy Office of Science: There is a lot of helpful information about different genome projects. http://www.the-scientist.com/home This website offers coverage of the latest developments in life sciences research, technology and business. You have to register, but registration is free. http://www.sciam.com/ This is the web of Scientific American magazine which contains good reviews of current research in science and health and is written for a broad audience. http://www.sciencemag.org/ This is the Science magazine which publishes the most up-to-date discoveries of scientific fields. Written for a scientific audience. http://www.nature.com/ This website is the Nature magazine. Both the Science and the Nature cover the newest scientific information. Written for a scientific audience. http://www.cell.com/ The Cell magazine contains a lot of latest discoveries of biological fields. Written for a scientific audience. http://www.science.gov/ Use the browsable index for a broad range of topics from Agriculture and Food to Science Education. Searchable by keyword. http://science.howstuffworks.com/ 101 Searchable database for simple explanations on a wide range of topics, including cell biology, DNA science, and genetics. Easy to understand. A good place to start, but explanations are aimed at a K – 12 audience. http://www.biology.arizona.edu/ Online resources to supplement biology courses. http://www.cellsalive.com/toc.htm Animations for cell biology. http://www.accessexcellence.org/AB/GG/ A searchable database of curriculum tools for K – 16. http://www.hhmi.org/biointeractive/ Howard Hughes Medical Institute online curriculum resources on health-related topics. http://www.hgc.gov.uk/business_consultations2maintext.rtf Background information for identification of individuals: Unit 1 A discussion document on the storage, protection and use of personal genetic information from Great Britain. http://www.biology.washington.edu/fingerprint/dnaintro.html This page was created as a class project at the University of Washington to provide to the Internet basic information on the structure and function of DNA as it relates to DNA fingerprinting. http://whyfiles.org/ Searchable database for articles on the “science behind news.” http://www.sciencenews.org/ A weekly online news magazine from the journal Science. News on current research written for a broad audience. http://www.geneticalliance.org/DIS/ A public information search tool for genetic disease information http://www.bbc.co.uk/go/science/news1a/-/genes/ Track the major developments in gene science since Charles Darwin published "The Origin of Species": http://www.nabt.org/sub/contact/reps.asp The National Association of Biology Teachers (NABT) http://www.nsta.org/chapters The National Science Teachers Association (NSTA) 102 http://www.exploratorium.edu/edunews From the Exploratorium in San Francisco. Aimed at K-12 students and teachers. http://sciwebserver.science.mcmaster.ca/biology/CBCN/genetics/mac_dnafp.htm A short article about DNA fingerprinting and conservation genetics http://www.blackwell-synergy.com/links/doi/10.1046/j.15231739.1994.08030744.x/abs/ A short abstract which demonstrates uses for DNA fingerprinting in conservation genetics http://www.rothamsted.bbsrc.ac.uk/notebook/courses/guide/ A Beginner's Guide to Molecular Biology. Great pictures and explanations for basic molecular biology concepts. http://jura.wi.mit.edu/bio/education/BOA-2000/left.htm Has several great links to background information in DNA science and protein science. IMPORTANT: Note that all of the above web sites are maintained by academic sources, government agencies, or reviewed journals. These sources generally subject the work that they post to peer review, and are usually reliable- although mistakes do occur! Be aware that many other web sites are set up by people who do not set a high standard for accuracy, or who have a particular opinion on a topic. Please check the source of any web site that you use as a reference for your papers. 103 GENETIC GLOSSARY allele: one of two or more forms of a gene that can exist at a single locus. chromosome: a structure in the nucleus of a eukaryotic cell that contains a linear array of many genes. A chromosome is composed of a single DNA double helix molecule wound around many protein molecules that stabilize it and regulate its function. codominant: refers to a pair of alleles, both of which exert an effect on the phenotype when they are present together. In codominance, the heterozygote has a phenotype different from that of either homozygote and sometimes (but not always) is intermediate in phenotype. diploid: having two complete sets of chromosomes, one set derived from the mother and one from the father. dominant: refers to an allele that has the same effect on the phenotype whether it is present in the homozygous or heterozygous condition. (Thus, if A is a dominant allele, individuals with the AA and Aa genotypes have the same phenotype.) genotype: the specific combination of alleles that an individual possesses at one or more loci. haploid: having only one set of chromosomes (as in a sperm or egg nucleus). heterozygous: having two different alleles at a particular locus. homozygous: having two identical alleles at a particular locus. incomplete dominance: a form of codominance in which the heterozygote is about half-way between the two homozygotes in phenotype. (For example, if homozygous plants have red or white flowers and the heterozygous plant has pink flowers, the situation is sometimes called incomplete dominance. But it is just a special type of codominance.) locus: a region of a chromosome or DNA molecule where a particular kind of gene, coding for a particular kind of protein, is located. Different variants at a single locus are known as alleles. meiosis: the “reduction division” in which a diploid cell divides to produce haploid cells that will function as gametes (eggs or sperm). 104 phenotype: the outward appearance of an individual with respect to one or more traits that is associated with some particular genotype. recessive: refers to an allele that has no effect on the phenotype unless it is present in the homozygous condition. recombination: the process in which two haploid sets of chromosomes are brought together in a pair of gametes to produce a new diploid offspring. Usually this new diploid will be different in genotype from both of its parents. 105