Download Name: : - Ms. Poole`s Biology

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Gene nomenclature wikipedia , lookup

Gene wikipedia , lookup

Pathogenomics wikipedia , lookup

NEDD9 wikipedia , lookup

Expanded genetic code wikipedia , lookup

Protein moonlighting wikipedia , lookup

DNA barcoding wikipedia , lookup

Therapeutic gene modulation wikipedia , lookup

Koinophilia wikipedia , lookup

Sequence alignment wikipedia , lookup

Microevolution wikipedia , lookup

Helitron (biology) wikipedia , lookup

Genetic code wikipedia , lookup

Metagenomics wikipedia , lookup

Genomics wikipedia , lookup

Artificial gene synthesis wikipedia , lookup

Point mutation wikipedia , lookup

Transcript
Cladograms and Genetics Name: _______________________
Taxonomy is the study of the classification. In biology, taxonomy deals with the
classification of organisms into groups based upon their similarities and differences –
using their genetic information (DNA or protein sequences) and their anatomical
features.
In this activity, you will use protein sequences (sequences of amino acids) to determine
the evolutionary relationship between organisms. We can use protein sequences to
determine evolutionary relationships because DNA codes for a particular protein
sequence, so differences in protein sequences mean differences in DNA sequences.
The number of amino acid substitutions between two organisms shows the differences
between the organisms themselves. The greater the number of substitutions between
two organisms, the longer ago the two organisms diverged from a common ancestor.
Scientists believe that cytochrome c has evolved at a fairly constant rate. This rate of
change is the basis for a “molecular clock.” This rate of mutation can be a helpful tool in
trying to determine how organisms have evolved.
Recent advances in biochemistry have exposed powerful molecular tools for
constructing phylogenetic trees. Protein, mRNA, rRNA, and genomic sequencing are
molecular techniques commonly used in identifying and characterizing organisms.
Molecular sequence data can be used to calibrate “molecular clocks” based on the rate
of mutations in protein and nucleic acid sequences. The rationale of molecular clocks
is that the observed number of changes in an amino acid or nucleotide sequence
may be approximately proportional to the evolutionary time elapsed. The National
Center for Biotechnology Information houses a public database of sequenced
nucleotides and proteins and develops sequence analysis shareware:
http://www.ncbi.nlm.nih.gov/
Protein data
The amino acid sequence of cytochrome c has been analyzed in over 100 eukaryotic
species, and the molecular data support the notion that cytochrome c is an
evolutionarily conservative protein. Shown below are two protein sequence
comparisons for cytochrome c: between human and either rat or yeast.
Substitutions within the primary structure of cytochrome c are relatively constant over
time, which makes characterizing cytochrome c a potentially useful molecular clock. For
example, note the cytochrome c sequence similarity between humans and rats (91%) is
much higher than between humans and yeast (64%). This indicates that the
evolutionary path leading to the human species diverged with that of yeast far before its
divergence from rat. A comparison of cytochrome c sequences from different species
has been used to order the divergence of species in relative time.
Determining the absolute timing of events using molecular clocks attracts skepticism
for several reasons. The fact that different proteins evolve at different rates makes
determining the divergence rate of a particular protein problematic. Also, changes in
generation time or metabolic rate may affect a mutation rate, making molecular clocks
less predictable.
Method of protein sequencing and alignment
A common approach to protein sequencing involves repeatedly cleaving a polypeptide
(using a protein-digesting enzyme), separating the fragments using chromatography,
and sequencing the resulting small fragments. The entire protein sequence is then
reconstructed by matching regions of sequence overlap seen in the small fragments.
Algorithms may be used to compare protein sequence data against a database. These
algorithms provide local information (within a defined region) or global information
(which accounts for gaps or other aberrations that may mask multiple regions of
similarity). This technology has greatly facilitated the grouping of related proteins into
families (an example is the serine protease family of enzymes, which includes trypsin,
chymotrypsin, and elastase).
The significance of protein sequencing
Direct protein sequencing is an indispensable tool for several reasons. For example, the
position of disulfide bonds can be resolved using protein sequencing techniques. Also,
protein modifications, such as the excision of residues or the covalent attachment of
other groups, can be detected. These changes in protein sequence and structure would
evade detection by nucleic acid sequencing efforts. One drawback to protein sequencing
is that only those genes that code for proteins are under scrutiny. This is often a small
fraction of an organism’s total amount of molecular information.
Directions: Constructing a Cladogram
1. Find the human, rhesus monkey, kangaroo, snapping turtle, bullfrog, and tuna on
the "Amino Acid Sequences in Cytochrome-C Proteins from 20 Different Species"
chart provided. Highlight or underline their names. NOTE: THE SEQUENCES
ARE SPLIT INTO TWO CHUNKS. YOU NEED EXAMINE BOTH!
2. Compare the human amino acid sequence with each of these five animals. Do this
by counting the number of times an amino acid in that animal’s protein is
different from the same amino acid position of the human sequence. Write that
information in the table below.
Number of amino acid differences between human and …
Rhesus Monkey
Kangaroo
Snapping Turtle
Bull Frog
Tuna Fish
3. Use the above information to construct a cladogram using genetic information.
The fewer the differences between the species, the more closely related they are.
The cladogram on the following page uses anatomical differences, rather than
protein and DNA sequences.
The cladogram on the next page is organized using anatomical (body) features. Notes:
1. The foramen magnum is the opening in the skull where the spine attaches.
2. Plancental mammals give live birth.
3. An aminion is a support structure for embryos.
Directions for BLAST Searches
1. Go to http://www.ncbi.nlm.nih.gov/. From the drop-down menu next to the
search bar, select “Protein.”
2. In the search bar, type: cytochrome c homo sapiens. This will search for the
human sequence of cytochrome C. Click ‘search.’
3. From your search results, select the first search result, which should be a 105 aa
sequence. Click ‘FASTA’ to see the amino acid sequence.
4. Copy the FASTA sequence beginning with the term ‘>gi|11128019 …’
5. Go to http://blast.ncbi.nlm.nih.gov/. Select ‘protein BLAST.’
6. Copy the FASTA sequence into the search box. Remove the line break mid-amino
acid sequence, so that it’s as one phrase.
7. For ‘search set,’ choose ‘non-redundant protein sequence.’ It may be selected as
the default. For ‘program selection,’ make sure that ‘blastp’ (protein-protein
BLAST) is selected. I would select ‘show results in a new window,’ so that you
can go back to your specified search.
8. Select ‘BLAST.’ Results may take several seconds to a minute to appear. If results
take longer than 2 minutes, let Ms. Poole know.
9. The results page has two sections. The first section is a graphical display of the
matching sequences. Scroll down to the section titled “Sequences producing
significant alignments.” The species in the list that appears below this section are
those with sequences identical to or most similar to the gene of interest. The
most similar sequences are listed first, and as you move down the list, the
sequences become less similar to your gene of interest.
10.
If you click on a particular species listed, you’ll get a full report that includes
the classification scheme of the species, the research journal in which the gene
was first reported, and the sequence of bases that appear to align with your gene
of interest.
11.
If you click on a particular species listed, you’ll get a full report that includes
the species’ classification scheme, the research journal in which the gene was
first reported, and the sequence of bases that appear to align with your gene of
interest. Click to open the species in a new window! (Otherwise, you wont be
able to return to your search.)
12.
If you click on the link titled “Distance tree of results,” you will see a
cladogram with the species with similar sequences to your gene of interest
placed on the cladogram according to how closely their matched gene aligns
with your gene of interest.
13.
Recall that species with common ancestry will share similar genes. The more
similar genes two species have in common, the more recent their common
ancestor and the closer the two species will be located on a cladogram.
14.
Select TWO of the other organisms from the cladogram and view their
alignment with human cytochrome C, either by running a BLAST with each or by
searching through your original BLAST.
15.
For each BLAST query, consider the following:
The higher the score, the closer the alignment.
The lower the e value, the closer the alignment.
Sequences with e values less than 1e-04 (1 x 10-4) can be considered related
with an error rate of less than 0.01%.
16.
Do these results match the results you came to from analyzing your amino
acid sequences? From your anatomical features cladogram?
Analysis Questions
Answer the following on a SEPARATE piece of paper.
1. Turn in your LABELED cladogram with each species’ scientific name. (Yes, you
will have to look them up and have them properly formatted. For names that
include more than one species, pick one and label accordingly. Google is your
friend.) (5 points)
2. Why do scientists use scientific names – rather than common names – as a way
of referring to organisms? List and explain at least three reasons! (3 points)
3. Does the cladogram organized by genetic information (and confirmed through
BLAST) agree with the cladogram organized by anatomical features? Why or
why not? Do organisms with fewer anatomical traits in common also have fewer
amino acids in common? Explain! (2 points)
4. Chickens and turkeys are both birds and have the same sequence of amino acids
in their cytochrome-c protein. Remember the definition of species! Explain in at
LEAST a paragraph how two species can have identical cytochrome-c and still be
different species. Remember the biological definition of species! (5 points)
5. How can protein sequencing and comparison be used to evaluate organisms’
evolutionary relationships? Discuss the benefits and drawbacks of using protein
sequences, rather than DNA sequences and anatomical features, to establish
organisms’ evolutionary relationships. (5 points)