Download Simple tandem repeats in mammalian genomes

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Epigenetics of neurodegenerative diseases wikipedia , lookup

Epigenomics wikipedia , lookup

Short interspersed nuclear elements (SINEs) wikipedia , lookup

Human genetic variation wikipedia , lookup

Oncogenomics wikipedia , lookup

Non-coding RNA wikipedia , lookup

Epigenetics of diabetes Type 2 wikipedia , lookup

Cancer epigenetics wikipedia , lookup

No-SCAR (Scarless Cas9 Assisted Recombineering) Genome Editing wikipedia , lookup

Extrachromosomal DNA wikipedia , lookup

Public health genomics wikipedia , lookup

Deoxyribozyme wikipedia , lookup

Cre-Lox recombination wikipedia , lookup

Genetic engineering wikipedia , lookup

Copy-number variation wikipedia , lookup

Gene desert wikipedia , lookup

Genomic imprinting wikipedia , lookup

Polycomb Group Proteins and Cancer wikipedia , lookup

Genomic library wikipedia , lookup

Ridge (biology) wikipedia , lookup

Gene expression programming wikipedia , lookup

Pathogenomics wikipedia , lookup

Long non-coding RNA wikipedia , lookup

Biology and consumer behaviour wikipedia , lookup

Primary transcript wikipedia , lookup

Point mutation wikipedia , lookup

Vectors in gene therapy wikipedia , lookup

Metagenomics wikipedia , lookup

Nutriepigenomics wikipedia , lookup

Transposable element wikipedia , lookup

Genome (book) wikipedia , lookup

Genomics wikipedia , lookup

Minimal genome wikipedia , lookup

Human genome wikipedia , lookup

Epigenetics of human development wikipedia , lookup

Gene expression profiling wikipedia , lookup

Site-specific recombinase technology wikipedia , lookup

Non-coding DNA wikipedia , lookup

Gene wikipedia , lookup

Designer baby wikipedia , lookup

Genome editing wikipedia , lookup

History of genetic engineering wikipedia , lookup

Microevolution wikipedia , lookup

Therapeutic gene modulation wikipedia , lookup

Genome evolution wikipedia , lookup

RNA-Seq wikipedia , lookup

Helitron (biology) wikipedia , lookup

Artificial gene synthesis wikipedia , lookup

Microsatellite wikipedia , lookup

Transcript
Simple repeats in DNA sequences may
regulate gene expression
Cornelia Lange
The sequencing of the human genome lead to some surprises concerning the number of genes.
The number of human genes seems to be around 30,000, much less than expected. This
number is not a lot higher than the number of genes of “lower” organisms, like the worm
Caenorhabditis elegans with its more than 19,000 genes or the fruit fly Drosophila
melanogaster with about 13,000 genes. This led to the assumption that the difference between
humans and other organisms is not so much due the number of genes, but more to how these
genes function.
DNA molecules are made up from four different bases arranged in different sequences,
much as letters are arranged into words. It's the sequence of the bases that contains the
information, and is decoded into RNA and proteins. DNA regions where a small sequence of
bases is repeated over and over again - CTCCTCCTCCTCCTC for instance, containing five
repeats of the sequence CTC - are called microsatellites. For some microsatellites, therefore
called "polymorphic", the number of repeats varies in different individuals
"Genes" are defined as those parts of DNA-molecules that specify (encode) RNA or
proteins. Only around 3% of the human genome encodes proteins, the rest consists of regions
that encode RNA as well as non-coding regions. Some of the non-coding regions, regulatory
sequences, specify how the genes are going to be expressed.
Particular proteins (transcription factors) bind to such regulatory sequences, thereby
regulating gene expression. There is strong evidence that microsatellites can be part of
regulatory sequences. Since they are often polymorphic, this may be a source of genetic
variation in regulating gene expression in closely related species.
In this project I searched for polymorphic microsatellites located close to genes in the
human genome, because in this positions they have the highest probability of having a
regulatory function. I used a database that was created to detect short sequences repeated
directly after each other (tandem repeats) in the human genome. To test if they were
polymorphic I used two methods. One was DHPLC (Denaturing High Performance Liquid
Chromatography). This is a separation technique where DNA is forced to flow through a
column under a high pressure. The time it takes DNA to pass through the column varies
depending on its length and shape. If a DNA fragments contains a polymorphic microsatellite,
this will show up as a special pattern in the results. The second method was to determine the
base sequence in DNA fragments including the microsatellite from different individuals. The
sequences were then compared to see if they had a different number of repeats of the
microsatellite. Using these methods I could create a dataset (including gene name, gene
identity, microsatellite position, orientation, motif and number of repeats) of altogether 24
polymorphic microsatellites, which can be used for further studies on microsatellites.
To see how the number of repeats affected gene expression, I studied one particular
microsatellite, (CTC)n (n repeats of the sequence CTC). The different alleles (gene variations)
(CTC)8, (CTC)10 and (CTC)11 showed different expression of the gene they are adjacent to. I
tested whether a transcription factor bound differently to these different alleles. But using my
experimental set-up, I could not clearly confirm any differences.
Degree project in biology, University of Uppsala, fall 2002
Examensarbete i biologi, 20 p
Department of Genetics and Pathology, Uppsala University
Supervisor: Claes Wadelius