Download Restriction Digestion and Analysis of Lambda DNA

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Mutagen wikipedia , lookup

Mutation wikipedia , lookup

DNA repair wikipedia , lookup

Nutriepigenomics wikipedia , lookup

Frameshift mutation wikipedia , lookup

Comparative genomic hybridization wikipedia , lookup

DNA wikipedia , lookup

Site-specific recombinase technology wikipedia , lookup

Zinc finger nuclease wikipedia , lookup

Cancer epigenetics wikipedia , lookup

DNA profiling wikipedia , lookup

Whole genome sequencing wikipedia , lookup

Designer baby wikipedia , lookup

DNA polymerase wikipedia , lookup

Human genome wikipedia , lookup

RNA-Seq wikipedia , lookup

DNA damage theory of aging wikipedia , lookup

DNA barcoding wikipedia , lookup

Nucleosome wikipedia , lookup

Primary transcript wikipedia , lookup

Gene wikipedia , lookup

Vectors in gene therapy wikipedia , lookup

DNA vaccination wikipedia , lookup

Replisome wikipedia , lookup

Microevolution wikipedia , lookup

No-SCAR (Scarless Cas9 Assisted Recombineering) Genome Editing wikipedia , lookup

DNA sequencing wikipedia , lookup

Molecular cloning wikipedia , lookup

Genomic library wikipedia , lookup

SNP genotyping wikipedia , lookup

Extrachromosomal DNA wikipedia , lookup

History of genetic engineering wikipedia , lookup

United Kingdom National DNA Database wikipedia , lookup

Epigenomics wikipedia , lookup

Point mutation wikipedia , lookup

Nucleic acid double helix wikipedia , lookup

DNA supercoil wikipedia , lookup

Gel electrophoresis of nucleic acids wikipedia , lookup

Non-coding DNA wikipedia , lookup

Genealogical DNA test wikipedia , lookup

Nucleic acid analogue wikipedia , lookup

Cell-free fetal DNA wikipedia , lookup

Cre-Lox recombination wikipedia , lookup

Microsatellite wikipedia , lookup

Genome editing wikipedia , lookup

Bisulfite sequencing wikipedia , lookup

Therapeutic gene modulation wikipedia , lookup

Metagenomics wikipedia , lookup

Deoxyribozyme wikipedia , lookup

Artificial gene synthesis wikipedia , lookup

Helitron (biology) wikipedia , lookup

Genomics wikipedia , lookup

Transcript
DNA Sequencing Analysis (Edvotek)
Introduction
The purpose of this exercise is to introduce students to bioinformatics. In order to gain experience in
database searching, students will utilize the free service offered by the National Center for
Biotechnology (NCBI) which can be accessed on the WWW. These exercises will involve using BLASTN,
where nucleotide sequence will be compared to other sequences in the nucleotide database.
Procedure Background
Bioinformatics is a new field of biotechnology that is involved in the storage and manipulation of DNA
sequence information from which one can obtain useful biological information. Almost routinely, data
from DNA sequence analysis is submitted to Data bank searches using the World Wide Web (WWW) yo
identify genes and gene products.
For sequence analysis, four separate enzymatic reactions are performed, one for each of the four
nucleotides. Each reaction contains the DNA polymerase, the single-stranded DNA template to be
sequenced to which a synthetic primer has been hybridized, the four deoxyribonucleotide triphosphates
(dATP, dGTP, dCTP, dTTP), often ad isotopically labeled deoxynucleotide triphosphate, such as 32P, and
the appropriate DNA sequencing buffer. The reactions contain the dideoxytriphosphate reactions as
follows: the “G” reaction contains dideoxyGTP, the “C” reaction dideoxyCTP, the “A” reaction
dideoxyATP, the “T” reaction dideoxyTTP. The small amounts of dideoxynucleotide concentrations are
carefully adjusted so they are randomly and infrequently incorporated into the growing DNA strand.
Once a dideoxynucleotide is incorporated into a single strand, DNA synthesis is terminated since the
modified nucleotide does not have a free 3’ hydroxyl group on the sugar which is the site of the addition
of the next nucleotide in the chain. The incorporation of the dideoxynucleotide allows the generation of
the nested DNA fragments and makes possible to determine the position of the various nucleotides in
DNA. A particular reaction will contain millions of growing DNA strands. Each fragment is terminated at
a different position corresponding to the random incorporation of the dideoxynucleotide.
The products from the G, A, T, and C reactions are separated by a vertical DNA polyacrylamide gel. It is
important to note that the strand being sequenced will have the opposite Watson/Crick base. As an
example, the G reaction in tube one will identify the C nucleotide in the template being sequenced. After
electrophoretic separation is complete, autoradiography is performed. The polyacrylamide gel is placed
into direct contact with a sheet of X-ray film. Since the DNA fragments are labeled with 32P, their
position can be detected by a dark exposure band on the sheet of x-ray film. In addition to 32P, nonisotopic methods of using fluorescent dyes and automated DNA sequencing machines are replacing the
traditional isotopic methods.
The sequence deduced from the autoradiogram will actually be the complement of the DNA strand
contained in the single-stranded DNA template. This DNA sequencing procedure is called the Sanger
“dideoxy” method named after the scientist who developed the procedure.
Data from DNA sequencing is of limited use unless it can be converted to biologically useful information.
Bioinformatics therefore is a critical component of DNA sequencing. It evolved from the merging of
computer technology and biotechnology. The widespread use of the internet has made it possible to
easily retrieve information from the various genome projects. In a typical analysis, as a first step, after
obtaining DNA sequencing data a molecular biologist will search for DN sequence similarities using
various data banks. Such as search may lead to the identification of the sequenced DNA or identify its
relationship to related genes. Protein coding regions can also be identified by the nucleotide
composition. Likewise, noncoding regions can be identified by interruptions due to stop codons. The
functional significance of new DNA sequences will continue to increase and become more important as
sequence information continue to be added and more powerful search engines become readily
accessible.
Experiment Procedures
1. Type: www.ncbi.nlm.nih.gov to log on to the NCBI web page. Click on BLAST.
2. New menu should now appear. This page presents choices for which database is being searched,
and which variety of BLAST to use for the searching. For our purpose, which is a nucleotide
sequence (not a protein amino acid sequence), so click on nucleotide blast.
3. Next, under Nucleoide BLAST, click on standard Nucleotide-Nucleotide BLAST (blastn).
The new screen should appear. There are three selections on this screen: Enter Query Sequence,
Choose Search Set, and Program Selection.
4. To get started entering the nucleotide sequence, start typing the sequence (either in capital or
lower-case letters) shown in screen above under Enter Query Sequence. Be careful to type
exactly! Use the default selections and then click on BLAST query box.
5. Once the BLAST query box has been clicked, you will receive reply regarding the status of your
search results. Once your entry is in the BLAST Queue, you will be assigned an ID# that can be
used to check your results at a later time (“Retrieve results for an existing Request ID” under the
BLAST menu).
6. Sometimes the search is busy. If your results are not ready at this point, try again a bit later or
log on at another time and use the same ID number.
As a general rule, identical nucleotide sequence spanning greater than 21bp between two
samples, usually indicates that the sequences are related or identical. One clear exception to this
rule, is a long stretch of Adenines or Thymines, which may correspond to the polyA tail found at
the 3’ end of all eukaryotic mRNAs.
Part A - Radiographs
Exercise #1
Familiarize yourself with the autorads by reading the DNA sequence for sample #1.

Start at the arrow and read up the gel for 30 nucleotides. Record the sequence. Submit it
to NCBI using the BLASTN program.
A few notes on reading a sequencing gel:

It is critical that you do not confuse lanes when reading the sequence.

Reading a sequence gel requires that you read the nucleotides in the 5’ to 3’ direction.
This can be accomplished by reading up the gel.

Notice that for the most part that the spacing and intensity of most of the bands is fairly
constant. Ignore lightly colored bands choosing only the darker ones. Occasionally the
sequence will be dark and all four lanes will be of relatively similar intensity. This is called
a DNA sequencing compression and is common when there are stretches of G and Cs.
Results to be obtained for Sample #1:
a. Provide sequence as read on gel (do not forget polarity - 5ʹ and 3ʹ ends)
b. Name of the gene
Exercise #2
Now that you have familiarity with the entry and submission process, read the DNA sequence analysis
from the autoradiograph corresponding to number 2. Notice that it is sometimes difficult to judge the
spacing and strongest intensity of the band in each lane and therefore you need to use your best
judgement.
Now begin the exercise by reading the DNA sequence for sample number 2 approximately 6cm from the
bottom of the strip. It should read as follows:
5’ … GGACGACGG … 3’ (including this sequence, read a series of 30 nucleotides)
Submit it to NCBI using the BLASTN program.

Remember that DNA sequence is always entered in the 5’ to 3’ direction.

DNA is double-stranded and contains a top (5’ to 3’) and bottom (3’ to 5’) strand
(sometimes this corresponds to the coding and noncoding strands).

If an exact band at a position is ambiguous, you can enter an N which denotes it could be
either A, C, G, or T.
After receiving the BLAST results, scroll down to look at the entries that have nucleotide matches with
your query sequence. Again remember that when two sets of nucleotide sequences show identity of a
21 base pair continuous strand, it usually indicates that the sequences are related or identical.
Results to be obtained for Sample #2:
a. Provide sequence as read on gel (do not forget polarity - 5ʹ and 3ʹ ends)
b. Name of the gene
Exercise #3

DNA entries can be further accessed by clicking on the Genbank accession number.

The information shown describes the DNA sequence and/or gene, the contributing
scientist’s name and information such as the protein and the amino acid sequence for
which it codes.

Read the DNA sequence from sample number 3. Start at the bottom of the strip and
record the DNA sequence (30 nucleotides).
Results to be obtained for Sample 3:
a. Provide sequence as read on gel (do not forget polarity - 5ʹ and 3ʹ ends)
b. Name of the gene
Part B – automated sequencing
In this instance, nucleotides are fluorescently labeled, and are run in a single lane. As each DNA
fragment (in one nucleotide increments) passes a beam, the color is detected, and the computer prints
out a visual representation. A = green, C = blue, G = black, and T = red (although red looks more
orange on this particular printouts).
Exercise #1-4
Each sequence (#1 - #4) has multiple lanes. You only need to sequence ONE lane per sample, but
make sure you provide in your write up which lane you used.
Record a 70 nucleotide sequence for each sample.
a. Provide the sequence as read on the printout ( do not forget polarity)
b. Name the gene
Thought questions
Your group as a whole does NOT need to agree on these, and, since they are opinion-based, there is no
right or wrong answer. If there are conflicting opinions, include all in your writeup.
Question #1: Should we do prenatal tests for diseases that are currently incurable?
Question #2: Should we test our children for adult-onset diseases?
Question #3: Should experiments dealing with human fetal tissues be allowed? (usually obtained from
POC – products of conception {tissue retrieved from either spontaneous or elective abortions, or
miscarriages})
Question #4: Direct-to-consumer testing refers to testing that is marketed to individuals through
mediums like the Internet, radio, and television. These kits often involve doing a cheek cell smear and
mailing the sample back for testing. In SOME cases, a genetic counselor or other health care provider
may be available to explain results and answer questions. Should this type of testing continue to be
available? Why or why not?