Download Bio research bio and fromatics lab - BLI-Research-Synbio

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Molecular cloning wikipedia , lookup

Copy-number variation wikipedia , lookup

Cell-free fetal DNA wikipedia , lookup

Mutation wikipedia , lookup

Zinc finger nuclease wikipedia , lookup

Saethre–Chotzen syndrome wikipedia , lookup

Pathogenomics wikipedia , lookup

Gene therapy of the human retina wikipedia , lookup

Transposable element wikipedia , lookup

Human genetic variation wikipedia , lookup

Epigenetics of diabetes Type 2 wikipedia , lookup

Cre-Lox recombination wikipedia , lookup

Human genome wikipedia , lookup

No-SCAR (Scarless Cas9 Assisted Recombineering) Genome Editing wikipedia , lookup

Frameshift mutation wikipedia , lookup

Non-coding DNA wikipedia , lookup

Metagenomics wikipedia , lookup

Genome evolution wikipedia , lookup

Gene expression profiling wikipedia , lookup

Epigenetics of neurodegenerative diseases wikipedia , lookup

Neuronal ceroid lipofuscinosis wikipedia , lookup

Gene expression programming wikipedia , lookup

RNA-Seq wikipedia , lookup

Gene desert wikipedia , lookup

Gene nomenclature wikipedia , lookup

Gene therapy wikipedia , lookup

Nutriepigenomics wikipedia , lookup

Genomics wikipedia , lookup

Point mutation wikipedia , lookup

Vectors in gene therapy wikipedia , lookup

Genetic engineering wikipedia , lookup

Gene wikipedia , lookup

Public health genomics wikipedia , lookup

History of genetic engineering wikipedia , lookup

Genome (book) wikipedia , lookup

Genome editing wikipedia , lookup

Site-specific recombinase technology wikipedia , lookup

Therapeutic gene modulation wikipedia , lookup

Helitron (biology) wikipedia , lookup

Designer baby wikipedia , lookup

Microevolution wikipedia , lookup

Artificial gene synthesis wikipedia , lookup

Transcript
BLAST that Gene
Student Name: ____________________
Adapted from Stephen H. Wefer, "The American Biology Teacher"
Objective: In this activity you will use the website for the National Center for Biotechnology
Information to identify a sequence of bases from a DNA sample.
Background: The NCBI contains a database of genes from multiple organisms that have been
sequenced and identified. The work of a number of scientists across a wide variety of research areas
provides the information compiled in this database. The tool used in this activity is BLAST - Basic
Logical Alignment Search Tool, which allows a user to submit a sequence of DNA bases (our old
friends A, T, G and C) into the search engine, BLAST will search the database and find the gene (if
any) associated with that sequence. Furthermore, the user can access information about the gene or the
disease it causes.
PreLab Questions: Answer in the blank space beneath each question
1. What are the four nucleotides that make up the DNA code?
Adenine, Thymine, Cytosine, Guanine
2. What are the three parts of a nucleotide?
3. What is a gene?
A gene is the set of DNA in a person that make up who they are.
4. Where do your genes come from?
Genes come from parents DNA
5. Where are genes located?
Genes are located in stem cells and sequences of DNA
6. Explain how DNA determines the traits of an organism. Your answer should include the words
nucleotide, gene, codon, mRNA, ribosome and protein.
Transcription is when RNA polymerase binds to a specific sequence called a promoter. When a
codon is formed in a DNA sequence the ribosome comes along and attaches to the codon and helps
create proteins. A gene is made up of many nucleotides which make up the sequence of DNA.
7. What could happen to an organism if it inherited a gene that coded for a defective protein? Why?
The organism wouldn’t be able to make proteins or they themselves would have the defective protein
because of the genetic traits passed down from parent to offspring.
Instructions
You will be given a nucleotide sequence found in real human DNA that is associated with a genetic
disease when mutated. Your job is to compare the sequences you are given with the nucleotide
sequence of most known genes, using the BLAST tool to search genetic
databases.
1. Go to the homepage for the NCBI ( www.ncbi.nlm.nih.gov )
2. Click on the word "BLAST" located in the list of links on the right of the
page.
3. Scroll down until you find the heading "Nucleotide BLAST" and click the
link.
4. This time we will practice using a part of
the gene sequence that codes for
hemoglobin. When a mutation occurs in
this gene a person can wind up with
Sickle Cell Anemia. Cut and Paste the
sequence below into the blast window.
GGG ATG AAT AAG GCA TAT GCA TCA GGG GCT GTT GCC AAT GTG CAT TAG CTG
TTT GCA GCC TCA CCT TCT TTC ATG GAG TTT AAG ATA
5. When you have finished entering
your sequence, click the radio
button for OTHERS and then click
on the BLAST button.
6. You will then see a screen asking
you to wait 10-20 seconds. Don’t
click on anything just relax and
wait patiently for the search to
conclude.
7. After the search has ended, scroll
down past the box with all of the
pink bars in it and find the words
"Sequences Producing
Significant Alignment".
8. Listed in order are the closest
matches with your DNA sequence.
You should notice that the blast
search will return results for all
genomes currently mapped
(Several prokaryotes, humans, rats,
chimpanzees, cows, pigs, chickens
and puffer fish to name a few)
9. Please take a moment to be awed by the similarity found in the DNA code despite the outward
physical diversity of organisms.
10. Click the Blue G block to far right of
the first human listing to enter the
gene information page. (If there is
no G box click on the blue reference
number preceding the closest match.
This will tell you the name of the
gene and its abbreviation if
available).
11. You should now be on a screen that has the following information
12. This is the Gene
Summary. It tells you
that this sequence is
officially recognized.
HBB is the official
Symbol for this gene. If
you read further it will
tell you that it is found
in humans and that a
mutated version of it
causes Sickle Cell
Anemia.
13. On the right you will
see a link to PHENOTYPES. Click the link and look for information about Hb SS Disease under
summary from GeneReviews. This will open more information about sickle cell.
14. Next click on the link to the “Map Viewer” in the right column under the
“Links” heading.
15. Map Viewer page. Across the top of the page, you will
notice the numbers 1-22 XY. These numbers represent the
chromosomes found in humans.
16. While we are here lets take a moment to go over
chromosomal nomenclature
11p15.5– This means that the HBB gene is found on chromosome
11 on the short (p) arm in region 15.5. the fancy word for this is
Locus which is science talk for location
17. Your next step will be to click on the OMIM link in the pink information bar. This will take you to
a page with lots of information about your gene and what it does.
So that is your basic tour of the NCBI. There is a lot of other information about genes and DNA posted
on this website some of which requires 3 or 4 PhDs to understand  so we are going to move on.
18. Using the same steps as before you are going to explore the human genome. Your first gene
sequence will be found by counting off by eights. Your second and (possibly third) gene sequence
is of your own choosing but cannot be the same as the people directly to your left or right, front or
back.
19. Fill out the attached data sheets for both of your gene sequences.
Gene Sequences: You will be assigned 2 to 3 of the following sequences to BLAST
Gene Sequence 1
ATG GCG ACC CTG GAA AAA GCT GAT GAA GGC CTT CGA GTC CCT CAA GTC CTT CCA GCA GCA GCA GCA GCA GCA GCA GCA
GCA GCA GCA GCA GCA GCA GCA GCA GC
Gene Sequence 2
ATG GCG GGT CTG ACG GCG GCG GCC CCG CGG CCC GGA GTC CTC CTG CTC CTG CTG TCC ATC CTC CAC CCC TCT CGG CCT GGA
GGG GTC CCT GGG GCC ATT CCT GGT GGA GTT CCT GGA GGA GTC TT
Gene Sequence 3
ATG CTC ACA TTC ATG GCC TCT GAC AGC GAG GAA GAA GTG TGT GAT GAG CGG ACG TCC CTA ATG TCG GCC GAG AGC CCC
AGC CCG CGC TCC TGC CAG GAG GGC AGG CAG GGC CCA GAG GAT GGA G
Gene Sequence 4
ATG TTT TAT ACA GGT GTA GCC TGT AAG AGA TGA AGC CTG GTA TTT ATA GAA ATT GAC TTA TTT TAT TCT CAT ATT TAC ATG
TGC ATA ATT TTC CAT ATG CCA GAA AAG TTG AAT AGT ATC AGA TTC CAA ATC T
Gene Sequence 5
ATG CGT CGA GGG CGT CTG CTG GAG ATC GCC CTG GGA TTT ACC GTG CTT TTA GCG TCC TAC ACG AGC CAT GGG GCG GAC GCC
AAT TTG GAG GCT GGG AAC GTG AAG GAA ACC AGA GCC AGT CGG GCC
Gene Sequence 6
ATG CCG CCC AAA ACC CCC CGA AAA ACG GCC GCC ACC GCC GCC GCT GCC GCC GCG GAA CCC GGC ACC GCC GCC GCC GCC
CCC TCC TGA GGG ACC CAG AGC AGG ACA GCG GCC CGG AGG AC
Gene Sequence 7
ATG TTG TGCAAT ATC CAT CTA CTG TAG TTA AGA TAT TCA GTA GTT TGT TTT TCA TAA GCA TGT AAT TGA TCA TAT TTC TGC
CAA GGA TGT GCC TTC AAC TTT ATA ATT ATA GTG TTG TAA AAT ATT TTT GTC TG
Gene Sequence 8
ATG CCA TCT TCC TTG ATG TTG GAG GTA CCT GCT CTG GCA GAT TTC AAC CGG GCT TGG ACA GAA CTT ACC GAC TGG CTT TCT
CTG CTT GAT CAA GTT ATA AAA TCA CAG AGG GTG ATG GTG GGT GAC CTT
Adapted from: Wefler, S. H. 2003. Name that gene: An authentic classroom activity incorporating bioinformatics. . The
American Biology Teacher 65:8610–613.
BLAST Data Sheet
Name:
Period:
Type your answers directly into the tables below
#1
Gene Sequence Number
Abbreviation or name of
gene
Genetic disease associated
with gene
Chromosome number
(location of gene)
Describe symptoms of
disease
#2
Gene Sequence Number
Abbreviation or name of
gene
Genetic disease associated
with gene
Chromosome number
(location of gene)
Describe symptoms of
disease
3
Psen2
alzheimers
1q42.13
Emotional behavior, language, memory, perception, thinking and
judgment
7
ATP7A
Menkes syndrome
Xq21.1
weak muscle tone (hypotonia), sagging facial features, seizures,
mental retardation, blue sclera and developmental delay.
#3
Gene Sequence Number
Abbreviation or name of
gene
Genetic disease associated
with gene
Chromosome number
(location of gene)
Describe symptoms of
disease
Once you have completed your BLAST searches answer the questions on the next page.
Analysis
1. In a BLAST search how do you know which sequence of DNA was closest to your unknown
sequence?
The percentage on the right side would tell you how close it was to matching.
2. If you had more nucleotides in your sequence to enter into BLAST, (say 1000 instead of 100), do
you think you would get more specific or less specific matches? Why?
Less specific because there is less of a chance that another sequence would have both nucleotides.
3. Why do scientists all over the world check BLAST when they sequence a new region of DNA?
They’re curious to know if there is another sequence like that already in existence. If not it’s a great
find and could lead to new discoveries but if it’s already been discovered then it saves them time from
researching it more.
4. Why would a biologist studying human genetic disorders want to see related genetic entries for
mice, cats, dogs etc?
To compare the genetic sequences and find any abnormalities or similarities.
5. How does the information in the NCBI help to illustrate the interrelation of all organisms? Explain.
It shows how a lot of the genetic code is similar but the difference of a couple of letters can allow two
organisms to be more similar or less and it shows the percentage of similarity within each genetic
sequence.
6.What do you think the potential consequences are for insurance companies knowing DNA sequences
of individuals? Should our genetic information be shared with companies, employers, future spouses,
doctors? Why or Why not?
DNA sequences reveal a lot of about a person and what genetic traits they have. If insurance
companies got hold of each individuals they’d know who is more likely to have a terminal disease and
therefore live a shorter life effecting there insurance policy. No, genetic information should remain
privet to the individual and only be shared if they want. It can effect how people see you if you happen
to have a bigger chance at having a disease or disorder. And if you’re a carrier of a disease or disorder
then some people wouldn’t want to have kids with you because they don’t want their kids to have those
disorders.