* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Download Bio research bio and fromatics lab - BLI-Research-Synbio
Molecular cloning wikipedia , lookup
Copy-number variation wikipedia , lookup
Cell-free fetal DNA wikipedia , lookup
Zinc finger nuclease wikipedia , lookup
Saethre–Chotzen syndrome wikipedia , lookup
Pathogenomics wikipedia , lookup
Gene therapy of the human retina wikipedia , lookup
Transposable element wikipedia , lookup
Human genetic variation wikipedia , lookup
Epigenetics of diabetes Type 2 wikipedia , lookup
Cre-Lox recombination wikipedia , lookup
Human genome wikipedia , lookup
No-SCAR (Scarless Cas9 Assisted Recombineering) Genome Editing wikipedia , lookup
Frameshift mutation wikipedia , lookup
Non-coding DNA wikipedia , lookup
Metagenomics wikipedia , lookup
Genome evolution wikipedia , lookup
Gene expression profiling wikipedia , lookup
Epigenetics of neurodegenerative diseases wikipedia , lookup
Neuronal ceroid lipofuscinosis wikipedia , lookup
Gene expression programming wikipedia , lookup
Gene desert wikipedia , lookup
Gene nomenclature wikipedia , lookup
Gene therapy wikipedia , lookup
Nutriepigenomics wikipedia , lookup
Point mutation wikipedia , lookup
Vectors in gene therapy wikipedia , lookup
Genetic engineering wikipedia , lookup
Public health genomics wikipedia , lookup
History of genetic engineering wikipedia , lookup
Genome (book) wikipedia , lookup
Genome editing wikipedia , lookup
Site-specific recombinase technology wikipedia , lookup
Therapeutic gene modulation wikipedia , lookup
Helitron (biology) wikipedia , lookup
Designer baby wikipedia , lookup
BLAST that Gene Student Name: ____________________ Adapted from Stephen H. Wefer, "The American Biology Teacher" Objective: In this activity you will use the website for the National Center for Biotechnology Information to identify a sequence of bases from a DNA sample. Background: The NCBI contains a database of genes from multiple organisms that have been sequenced and identified. The work of a number of scientists across a wide variety of research areas provides the information compiled in this database. The tool used in this activity is BLAST - Basic Logical Alignment Search Tool, which allows a user to submit a sequence of DNA bases (our old friends A, T, G and C) into the search engine, BLAST will search the database and find the gene (if any) associated with that sequence. Furthermore, the user can access information about the gene or the disease it causes. PreLab Questions: Answer in the blank space beneath each question 1. What are the four nucleotides that make up the DNA code? Adenine, Thymine, Cytosine, Guanine 2. What are the three parts of a nucleotide? 3. What is a gene? A gene is the set of DNA in a person that make up who they are. 4. Where do your genes come from? Genes come from parents DNA 5. Where are genes located? Genes are located in stem cells and sequences of DNA 6. Explain how DNA determines the traits of an organism. Your answer should include the words nucleotide, gene, codon, mRNA, ribosome and protein. Transcription is when RNA polymerase binds to a specific sequence called a promoter. When a codon is formed in a DNA sequence the ribosome comes along and attaches to the codon and helps create proteins. A gene is made up of many nucleotides which make up the sequence of DNA. 7. What could happen to an organism if it inherited a gene that coded for a defective protein? Why? The organism wouldn’t be able to make proteins or they themselves would have the defective protein because of the genetic traits passed down from parent to offspring. Instructions You will be given a nucleotide sequence found in real human DNA that is associated with a genetic disease when mutated. Your job is to compare the sequences you are given with the nucleotide sequence of most known genes, using the BLAST tool to search genetic databases. 1. Go to the homepage for the NCBI ( www.ncbi.nlm.nih.gov ) 2. Click on the word "BLAST" located in the list of links on the right of the page. 3. Scroll down until you find the heading "Nucleotide BLAST" and click the link. 4. This time we will practice using a part of the gene sequence that codes for hemoglobin. When a mutation occurs in this gene a person can wind up with Sickle Cell Anemia. Cut and Paste the sequence below into the blast window. GGG ATG AAT AAG GCA TAT GCA TCA GGG GCT GTT GCC AAT GTG CAT TAG CTG TTT GCA GCC TCA CCT TCT TTC ATG GAG TTT AAG ATA 5. When you have finished entering your sequence, click the radio button for OTHERS and then click on the BLAST button. 6. You will then see a screen asking you to wait 10-20 seconds. Don’t click on anything just relax and wait patiently for the search to conclude. 7. After the search has ended, scroll down past the box with all of the pink bars in it and find the words "Sequences Producing Significant Alignment". 8. Listed in order are the closest matches with your DNA sequence. You should notice that the blast search will return results for all genomes currently mapped (Several prokaryotes, humans, rats, chimpanzees, cows, pigs, chickens and puffer fish to name a few) 9. Please take a moment to be awed by the similarity found in the DNA code despite the outward physical diversity of organisms. 10. Click the Blue G block to far right of the first human listing to enter the gene information page. (If there is no G box click on the blue reference number preceding the closest match. This will tell you the name of the gene and its abbreviation if available). 11. You should now be on a screen that has the following information 12. This is the Gene Summary. It tells you that this sequence is officially recognized. HBB is the official Symbol for this gene. If you read further it will tell you that it is found in humans and that a mutated version of it causes Sickle Cell Anemia. 13. On the right you will see a link to PHENOTYPES. Click the link and look for information about Hb SS Disease under summary from GeneReviews. This will open more information about sickle cell. 14. Next click on the link to the “Map Viewer” in the right column under the “Links” heading. 15. Map Viewer page. Across the top of the page, you will notice the numbers 1-22 XY. These numbers represent the chromosomes found in humans. 16. While we are here lets take a moment to go over chromosomal nomenclature 11p15.5– This means that the HBB gene is found on chromosome 11 on the short (p) arm in region 15.5. the fancy word for this is Locus which is science talk for location 17. Your next step will be to click on the OMIM link in the pink information bar. This will take you to a page with lots of information about your gene and what it does. So that is your basic tour of the NCBI. There is a lot of other information about genes and DNA posted on this website some of which requires 3 or 4 PhDs to understand so we are going to move on. 18. Using the same steps as before you are going to explore the human genome. Your first gene sequence will be found by counting off by eights. Your second and (possibly third) gene sequence is of your own choosing but cannot be the same as the people directly to your left or right, front or back. 19. Fill out the attached data sheets for both of your gene sequences. Gene Sequences: You will be assigned 2 to 3 of the following sequences to BLAST Gene Sequence 1 ATG GCG ACC CTG GAA AAA GCT GAT GAA GGC CTT CGA GTC CCT CAA GTC CTT CCA GCA GCA GCA GCA GCA GCA GCA GCA GCA GCA GCA GCA GCA GCA GCA GCA GC Gene Sequence 2 ATG GCG GGT CTG ACG GCG GCG GCC CCG CGG CCC GGA GTC CTC CTG CTC CTG CTG TCC ATC CTC CAC CCC TCT CGG CCT GGA GGG GTC CCT GGG GCC ATT CCT GGT GGA GTT CCT GGA GGA GTC TT Gene Sequence 3 ATG CTC ACA TTC ATG GCC TCT GAC AGC GAG GAA GAA GTG TGT GAT GAG CGG ACG TCC CTA ATG TCG GCC GAG AGC CCC AGC CCG CGC TCC TGC CAG GAG GGC AGG CAG GGC CCA GAG GAT GGA G Gene Sequence 4 ATG TTT TAT ACA GGT GTA GCC TGT AAG AGA TGA AGC CTG GTA TTT ATA GAA ATT GAC TTA TTT TAT TCT CAT ATT TAC ATG TGC ATA ATT TTC CAT ATG CCA GAA AAG TTG AAT AGT ATC AGA TTC CAA ATC T Gene Sequence 5 ATG CGT CGA GGG CGT CTG CTG GAG ATC GCC CTG GGA TTT ACC GTG CTT TTA GCG TCC TAC ACG AGC CAT GGG GCG GAC GCC AAT TTG GAG GCT GGG AAC GTG AAG GAA ACC AGA GCC AGT CGG GCC Gene Sequence 6 ATG CCG CCC AAA ACC CCC CGA AAA ACG GCC GCC ACC GCC GCC GCT GCC GCC GCG GAA CCC GGC ACC GCC GCC GCC GCC CCC TCC TGA GGG ACC CAG AGC AGG ACA GCG GCC CGG AGG AC Gene Sequence 7 ATG TTG TGCAAT ATC CAT CTA CTG TAG TTA AGA TAT TCA GTA GTT TGT TTT TCA TAA GCA TGT AAT TGA TCA TAT TTC TGC CAA GGA TGT GCC TTC AAC TTT ATA ATT ATA GTG TTG TAA AAT ATT TTT GTC TG Gene Sequence 8 ATG CCA TCT TCC TTG ATG TTG GAG GTA CCT GCT CTG GCA GAT TTC AAC CGG GCT TGG ACA GAA CTT ACC GAC TGG CTT TCT CTG CTT GAT CAA GTT ATA AAA TCA CAG AGG GTG ATG GTG GGT GAC CTT Adapted from: Wefler, S. H. 2003. Name that gene: An authentic classroom activity incorporating bioinformatics. . The American Biology Teacher 65:8610–613. BLAST Data Sheet Name: Period: Type your answers directly into the tables below #1 Gene Sequence Number Abbreviation or name of gene Genetic disease associated with gene Chromosome number (location of gene) Describe symptoms of disease #2 Gene Sequence Number Abbreviation or name of gene Genetic disease associated with gene Chromosome number (location of gene) Describe symptoms of disease 3 Psen2 alzheimers 1q42.13 Emotional behavior, language, memory, perception, thinking and judgment 7 ATP7A Menkes syndrome Xq21.1 weak muscle tone (hypotonia), sagging facial features, seizures, mental retardation, blue sclera and developmental delay. #3 Gene Sequence Number Abbreviation or name of gene Genetic disease associated with gene Chromosome number (location of gene) Describe symptoms of disease Once you have completed your BLAST searches answer the questions on the next page. Analysis 1. In a BLAST search how do you know which sequence of DNA was closest to your unknown sequence? The percentage on the right side would tell you how close it was to matching. 2. If you had more nucleotides in your sequence to enter into BLAST, (say 1000 instead of 100), do you think you would get more specific or less specific matches? Why? Less specific because there is less of a chance that another sequence would have both nucleotides. 3. Why do scientists all over the world check BLAST when they sequence a new region of DNA? They’re curious to know if there is another sequence like that already in existence. If not it’s a great find and could lead to new discoveries but if it’s already been discovered then it saves them time from researching it more. 4. Why would a biologist studying human genetic disorders want to see related genetic entries for mice, cats, dogs etc? To compare the genetic sequences and find any abnormalities or similarities. 5. How does the information in the NCBI help to illustrate the interrelation of all organisms? Explain. It shows how a lot of the genetic code is similar but the difference of a couple of letters can allow two organisms to be more similar or less and it shows the percentage of similarity within each genetic sequence. 6.What do you think the potential consequences are for insurance companies knowing DNA sequences of individuals? Should our genetic information be shared with companies, employers, future spouses, doctors? Why or Why not? DNA sequences reveal a lot of about a person and what genetic traits they have. If insurance companies got hold of each individuals they’d know who is more likely to have a terminal disease and therefore live a shorter life effecting there insurance policy. No, genetic information should remain privet to the individual and only be shared if they want. It can effect how people see you if you happen to have a bigger chance at having a disease or disorder. And if you’re a carrier of a disease or disorder then some people wouldn’t want to have kids with you because they don’t want their kids to have those disorders.