Download MGF 110-13L/14L overlap

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

X-inactivation wikipedia , lookup

Transposable element wikipedia , lookup

Oncogenomics wikipedia , lookup

Epistasis wikipedia , lookup

Epigenetics in learning and memory wikipedia , lookup

Ridge (biology) wikipedia , lookup

Minimal genome wikipedia , lookup

Epigenetics of neurodegenerative diseases wikipedia , lookup

Biology and consumer behaviour wikipedia , lookup

Public health genomics wikipedia , lookup

NEDD9 wikipedia , lookup

Genomic imprinting wikipedia , lookup

Genetic engineering wikipedia , lookup

Pathogenomics wikipedia , lookup

Copy-number variation wikipedia , lookup

History of genetic engineering wikipedia , lookup

Saethre–Chotzen syndrome wikipedia , lookup

Gene therapy of the human retina wikipedia , lookup

Epigenetics of diabetes Type 2 wikipedia , lookup

Epigenetics of human development wikipedia , lookup

Vectors in gene therapy wikipedia , lookup

Point mutation wikipedia , lookup

Neuronal ceroid lipofuscinosis wikipedia , lookup

Nutriepigenomics wikipedia , lookup

Gene therapy wikipedia , lookup

Gene wikipedia , lookup

Genome evolution wikipedia , lookup

The Selfish Gene wikipedia , lookup

Genome (book) wikipedia , lookup

RNA-Seq wikipedia , lookup

Site-specific recombinase technology wikipedia , lookup

Gene desert wikipedia , lookup

Gene expression programming wikipedia , lookup

Helitron (biology) wikipedia , lookup

Therapeutic gene modulation wikipedia , lookup

Gene nomenclature wikipedia , lookup

Gene expression profiling wikipedia , lookup

Microevolution wikipedia , lookup

Artificial gene synthesis wikipedia , lookup

Designer baby wikipedia , lookup

Transcript
MGF 110 Figure Legend
MGF 110 ortholog group
Reference length of this
ortholog
Ortholog columns and gene size
All genes of same width as the heading are approximately the length of the
reference length stated.
MGF 110-4L
ortholog
present in
Mkuzi_1979
strain
MGF 110-1L
ortholog
missing in
these
connected
strains
Genes:
The above figure indicates MGF 110-4L ortholog is missing in strains
Pretorisuskop_96_4, Warmbaths, and Warthog, but present in Mkuzi_1979 and
GEO_2007|1.
Gene: Mkuzi_1979 - 006
Assigned Ortholog: MGF 110-1L
Gene: Warmbaths - 009
Assigned Ortholog: MGF
110 – 4L
The assigned ortholog
(4L) is incorrect. The
correct ortholog is 3L.
Gene labels:
The annotation of each gene is in two parts: the currently assigned ortholog
group followed by the corresponding gene number of the connected strain.
If the ortholog group is incorrectly annotated, it is labelled in red.
1
MGF 110 Figure Legend
ORF for this gene
is located on the
reverse strand
3’/Carboxy
Terminus
5’/Amino
Terminus
Gene Orientation:
MGF 110 has only “L” orthologs that are transcribed on the reverse strand (3’
 5’). Their gene boxes are pointed to the left.
Size: ~414 bp
Size: less than
414 bp.
Amino terminus
truncation
Size: less than
414 bp.
Carboxy terminus
truncation
Relative gene size and truncations:
A smaller or larger gene box indicates the size difference of the gene relative
to other genes of the same ortholog
A 5’/amino terminus that is not aligned with the amino terminus of the fulllength genes indicates an amino terminus truncation of this gene.
A 3’/carboxy terminus that is not aligned with the carboxy terminus of the
full-length genes indicates a carboxy terminus truncation of this gene.
2
MGF 110 Figure Legend
Theses ORFs are
overlapping in
different reading
frames
This MGF 110-11L ortholog
is split into two smaller
ORFs due to frame shift
Grey box: In-frame deletion
Special Note Gene Features:
The 5’ end of the ORF for the top two 11L genes overlaps with the 3’ end of
the ORF for L60-011 in different reading frames.
Two smaller gene boxes underneath a single heading represent the
fragmentation of an ortholog into two smaller ORFs.
For the bottom most 11L ortholog showed in the above diagram is showed to
have several large in-frame deletions in the gene when compared to the aligned
genomes.
Fusion between MGF 110 – 13L amino terminus and 11L
carboxy terminus separated by deletion (grey box)
MGF 110-13L/14L Overlap
Fusion between
complete MGF 11014L, 13L amino
terminus, and 11L
carboxy terminus
separated by deletion
(grey box)
MGF 110 Fusion:
Genes encoded by an open reading frame that aligns across multiple ortholog
loci due to genomic deletions are labelled in the above diagram as “fusion” genes
and are represented by black gene boxes. The size and alignment of the black gene
3
MGF 110 Figure Legend
boxes to a given ortholog group is representative of how much of the ortholog is
fused and which region.
The fused ortholog groups are labelled along the deletion box connecting the
fragments.
The absence of a deletion box in a gene fusion indicates that the deletion that
connects the two ORFs is only a few base pairs that would be too small to resolve on
this diagram.
MGF 110-13L/14L overlap:
The 3’ end of the 14L ORF overlaps with the 5’ end of the 13L ORF. This is
represented by no space between the ortholog headings and all genes under these
headings to be to overlapping.
MGF 110-10L skipped
Only MGF 110-10L
assignment
MGF 110-10L Missing:
The MGF 110-10L ortholog is annotated in only the Warmbaths strain which
appears to be a truncated 11L ortholog. This gene is quite short due to an amino
terminus truncation. It is possible that when comparing to this gene, it appeared to
form its own ortholog, causing any subsequent assignments to be a new ortholog.
4
MGF 110 Figure Legend
Fusion between MGF
110-7L and MGF 360-6L
Likely gene rearrangement
between these two genes
but hard to tell
Fusion between small region between
MGF 110-9L and 11L and MGF 360-6L
Cross-Diagram Fusions:
Trunc - 014 [MGF 110-7L/MGF 360-6L Fusion Protein]:
This gene is a fusion between the MGF 110-7L ortholog and MGF 360-6L. The
amino terminus of this fusion is not shown since it is outside the scope of this
diagram.
The annotated ortholog for this gene is: “Truncated MGF 360 protein” which
has been shortened to “Trunc”, however the actual ortholog identity is most likely a
fusion between the two MGF orthologs.
MGF 360-5L  MGF 360-6L:
This gene is a fusion between the MGF 360-6L amino terminus and a short
non-MGF sequence between MGF 110-9L and 11L. Due to this gene having mostly
MGF 360-6L character and only a short region aligned to the mid MGF 110-9L/11L
sequence, the gene is given a MGF 360-6L ortholog assignment. Due to the small
carboxy terminus fragment, the gene was labelled on the connecting deletion box.
The amino terminus of this fusion is not shown since it is outside of the scope of this
diagram.
Pretorisuskop 11L and 13/14L alignment:
These two Pretorisuskop genes do not align very well in the region they are
positioned in the above diagram. In a dotplot analysis, it appears that 022 aligns
best with the 13L/14L orthologs and 023 aligns best with the 11L ortholog. This
supports a potential gene rearrangement in this genome but due to the high
similarity between the 11L and 13L/14L orthologs it is hard to determine.
5
MGF 110 Figure Legend
Unannotated gene
Warmbaths MGF 110-14L ortholog:
The Warmbaths strain has an ORF that that aligns with the 14L orthologs but
was not annotated.
6